Sr. Data Engineer, Clickhouse
Remote - United States
Tealium Inc.
Tealium is a customer data platform (CDP) that connects your data so you can connect with your customers!Tealium is the most trusted and world’s largest independent customer data platform. Tealium connects customer data – spanning web, mobile, offline, and IoT devices — so brands can connect with their customers. Tealium’s turnkey integration ecosystem supports more than 1,300 client-side and server-side vendors and technologies, empowering brands to create a unified, real-time customer data infrastructure.
Tealium’s customer data solutions encompass tag management, an API hub, a customer data platform with machine learning and AI, and data management solutions that make customer data more valuable, actionable, and secure. Tealium has been a trusted provider of customer data solutions for more than a decade, and more than 850 top businesses worldwide including Microsoft, Hyatt, Gap, HSBC and Novartis rely on Tealium to power their customer data strategies.
Team Tealium works and lives across the U.S. and in nearly 20 countries across the world. We are intentional about our culture, our investment in our team members and how we care and connect.
Tealium = Teal + Helium. Teal: a vibrant reflection that evokes authenticity, trustworthiness, reliability, open communication and clarity of thought. Helium: we rise above, a kinetic force that elevates our customers’ and our experiences beyond all others.
We win together with respect and appreciation for the talents required of all positions and the people who contribute to each of these.
As a Senior Data Engineer/architect, you will play a critical role in developing Big Data Analytics applications to fuel ingestion, storage, and optimization of Tealium’s 5 billion daily events into our core data platform that powers Machine Learning and Analytics in our products. You will be responsible for designing, developing, testing, and deploying production-ready solutions aimed at leveraging our petabyte scale data platform to solve real-time customer data delivery challenges. This includes heavy use and understanding of columnar and time series databases, distributed computing frameworks in streaming and batch applications, and low latency querying across trillions of events at scale.
Through seamless inbound and outbound connectors with major digital marketing vendors, Tealium's Cloud-based Customer Data Hub allows our customers to take action on their visitors in real-time based on behavior. This lets them maximize the value from each visitor by creating personalized content and product offers, as well as highly targeted re-marketing campaigns.We are a fast growing, highly disciplined engineering team, working on multiple exciting products across a common cloud platform using the latest technologies and design principles. You will be a senior person on the core foundational data platform team, working across product boundaries.
YOUR DAY TO DAY
- Work with a diverse set of engineers, architects, data scientists, and product owners to develop foundational data solutions to power Machine Learning and Analytics in our products
- Evaluate current architecture and help further define best practices for data consumption, storage, and access in areas such as cost, maintainability, and scalability
- Propose and drive architectural design decisions in accordance with the latest technologies and best practices in cloud based architecture
- Participate in all phases of the Software Development Lifecycle in an Agile environment, including design, development, testing, monitoring, and documentation
WHY YOU ARE A PERFECT FIT
- BS, MS, in Computer Science, Software Engineering, or a related discipline
- 7+ years software development experience related to data engineering
- Strong schema and query optimization skills on nonSQL, Columnar and time-series databases
- Demonstrated knowledge of big data databases such as DynamoDB, Cassandra, BigTable, BigQuery and Presto
- Experience with Data Warehouse and Data Lake technologies such as Delta Lake and Apache Hudi
- Experience working with big data file formats like Parquet, Avro and ORC at a petabyte scale
KEY QUALIFICATIONS
- 4+ years building schemas and optimizing nonSQL, Columnar, time series type databases for massive petabyte low latency querying
- Expert in schema design and data modeling concepts
- Expert in ETL processes, and moving petabytes of data via Apache Spark and other similar tools
- Experience with workflow management tools: Azkaban, Oozie, Airflow etc.
- Expert with streaming and batch data processing using Kafka or similar products
- Ability to learn and research new technologies in rapid fashion
- Software engineering or data engineering experience through the full software development lifecycle
- Excellent problem solving and analytic skills
- Strong communication skills and experience working on multi-functional projects across geo locations
- Good written and oral communication skills
WHAT ARE SOME OF THE WOWs ABOUT TEALIUM
At Tealium, we don’t just offer the ordinary, we provide the extraordinary: - Tealium WOWs (Ways of Work), our award winning culture is how with think, act and connect together at Tealium- Mosaic, our commitment to diversity, equity and inclusion is grounded in our mosaic of diverse perspectives and shared belonging as we live in work across the US and in nearly 20 countries- Tealium Cares, to promote caring in our communities, 15 hours of paid work time for volunteer activities and programs is offered annually- Tealium Connects (remote-first working), enabling many of us to choose where we do our best work and offering new hire stipends to assist with purchasing things we need to support a successful home office environment- Tealium Ownership, share in the success of Tealium by becoming an owner of Tealium beginning with new hire equity grants - Tealium Time, unlimited paid time-off policy to offer flexibility to take time when needed and robust leave programs, including extended paid parental leave and company holidays- Healium, health and wellness programs to help us be our best selves in the experiences of health, physical, mental, social, and even financial well-being and wellness- Healium Be-Well Break, an annual all-company paid shutdown to provide a true break for us all- Tealium LIFT (Learning is Facilitated at Tealium), offering a myriad of professional development opportunities with over 6,000 courses available on demand to best-in-class manager and leadership development programs- Health and Related Benefits Programs, offering market competitive benefits programs
Collectively, we contribute our individual pieces (identity, experiences, heritage, backgrounds, religions, viewpoints, gender and more ) to form the mosaic of Team Tealium. It is our continuing philosophy to recruit and employ the best qualified individuals without regard to race, color, sex, religion, national origin, disability, age, sexual orientation, gender identity, and/or any other protected characteristic. Tealium does not tolerate unlawful discrimination of any kind and strives to be an inclusive and respectful workplace.
The highly relevant and differentiated positioning of Tealium’s solutions makes this a unique and rewarding career opportunity.
#LI-Remote *Offerings vary by level and location.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow APIs Avro Azkaban Big Data BigQuery Bigtable Cassandra Computer Science Data Analytics Data management DynamoDB Engineering ETL Kafka Machine Learning Oozie Parquet Research Spark Streaming Testing
Perks/benefits: Career development Equity Flex vacation Health care Home office stipend Parental leave Team events Unlimited paid time off Wellness
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Principal Data Engineer jobs
- Open Data Manager jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Business Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Data Analyst Intern jobs
- Open Junior Data Engineer jobs
- Open Research Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open PhD-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs