Sr. Data Engineer, Clickhouse

Remote - United States

Applications have closed

Tealium Inc.

Tealium is a customer data platform (CDP) that connects your data so you can connect with your customers!

View company page

WHO WE ARE
Tealium is the most trusted and world’s largest independent customer data platform. Tealium connects customer data – spanning web, mobile, offline, and IoT devices — so brands can connect with their customers. Tealium’s turnkey integration ecosystem supports more than 1,300 client-side and server-side vendors and technologies, empowering brands to create a unified, real-time customer data infrastructure. 
Tealium’s customer data solutions encompass tag management, an API hub, a customer data platform with machine learning and AI, and data management solutions that make customer data more valuable, actionable, and secure. Tealium has been a trusted provider of customer data solutions for more than a decade, and more than 850 top businesses worldwide including Microsoft, Hyatt, Gap, HSBC and Novartis rely on Tealium to power their customer data strategies.
Team Tealium works and lives across the U.S. and in nearly 20 countries across the world.  We are intentional about our culture, our investment in our team members and how we care and connect. 
Tealium = Teal + Helium. Teal: a vibrant reflection that evokes authenticity, trustworthiness, reliability, open communication and clarity of thought.  Helium: we rise above, a kinetic force that elevates our customers’ and our experiences beyond all others. 
We win together with respect and appreciation for the talents required of all positions and the people who contribute to each of these.
As a Senior Data Engineer/architect, you will play a critical role in developing Big Data Analytics applications to fuel ingestion, storage, and optimization of Tealium’s 5 billion daily events into our core data platform that powers Machine Learning and Analytics in our products. You will be responsible for designing, developing, testing, and deploying production-ready solutions aimed at leveraging our petabyte scale data platform to solve real-time customer data delivery challenges. This includes heavy use and understanding of columnar and time series databases, distributed computing frameworks in streaming and batch applications,  and low latency querying across trillions of events at scale.
Through seamless inbound and outbound connectors with major digital marketing vendors, Tealium's Cloud-based Customer Data Hub allows our customers to take action on their visitors in real-time based on behavior. This lets them maximize the value from each visitor by creating personalized content and product offers, as well as highly targeted re-marketing campaigns.We are a fast growing, highly disciplined engineering team, working on multiple exciting products across a common cloud platform using the latest technologies and design principles. You will be a senior person on the core foundational data platform team, working across product boundaries.

YOUR DAY TO DAY

  • Work with a diverse set of engineers, architects, data scientists, and product owners to develop foundational data solutions to power Machine Learning and Analytics in our products
  • Evaluate current architecture and help further define best practices for data consumption, storage, and access in areas such as cost, maintainability, and scalability
  • Propose and drive architectural design decisions in accordance with the latest technologies and best practices in cloud based architecture
  • Participate in all phases of the Software Development Lifecycle in an Agile environment, including design, development, testing, monitoring, and documentation

WHY YOU ARE A PERFECT FIT

  • BS, MS, in Computer Science, Software Engineering, or a related discipline
  • 7+ years software development experience related to data engineering
  • Strong schema and query optimization skills on nonSQL, Columnar and time-series databases
  • Demonstrated knowledge of big data databases such as DynamoDB, Cassandra, BigTable, BigQuery and Presto
  • Experience with Data Warehouse and Data Lake technologies such as Delta Lake and Apache Hudi
  • Experience working with big data file formats like Parquet, Avro and ORC at a petabyte scale


KEY QUALIFICATIONS

  • 4+ years building schemas and optimizing nonSQL, Columnar, time series type databases for massive petabyte low latency querying
  • Expert in schema design and data modeling concepts
  • Expert in ETL processes, and moving petabytes of data via  Apache Spark and other similar tools
  • Experience with workflow management tools: Azkaban, Oozie, Airflow etc.
  • Expert with streaming and batch data processing using Kafka or similar products
  • Ability to learn and research new technologies in rapid fashion
  • Software engineering or data engineering experience through the full software development lifecycle
  • Excellent problem solving and analytic skills
  • Strong communication skills and experience working on multi-functional projects across geo locations
  • Good written and oral communication skills


WHAT ARE SOME OF THE WOWs ABOUT TEALIUM
At Tealium,  we don’t just offer the ordinary, we provide the extraordinary: - Tealium WOWs (Ways of Work), our award winning culture is how with think, act and connect together at Tealium- Mosaic, our commitment to diversity, equity and inclusion is grounded in our mosaic of diverse perspectives and shared belonging as we live in work across the US and in nearly 20 countries- Tealium Cares, to promote caring in our communities, 15 hours of paid work time for volunteer activities and programs is offered annually- Tealium Connects (remote-first working), enabling many of us to choose where we do our best work and offering new hire stipends to assist with purchasing things we need to support a successful home office environment- Tealium Ownership, share in the success of Tealium by becoming an owner of Tealium beginning with new hire equity grants - Tealium Time, unlimited paid time-off policy to offer flexibility to take time when needed and robust leave programs, including extended paid parental leave and company holidays- Healium, health and wellness programs to help us be our best selves in the experiences of health, physical, mental, social, and even financial well-being and wellness- Healium Be-Well Break, an annual all-company paid shutdown to provide a true break for us all- Tealium LIFT (Learning is Facilitated at Tealium), offering a myriad of professional development opportunities with over 6,000 courses available on demand to best-in-class manager and leadership development programs- Health and Related Benefits Programs, offering market competitive benefits programs
Collectively, we contribute our individual pieces (identity, experiences, heritage, backgrounds, religions, viewpoints, gender and more ) to form the mosaic of Team Tealium. It is our continuing philosophy to recruit and employ the best qualified individuals without regard to race, color, sex, religion, national origin, disability, age, sexual orientation, gender identity, and/or any other protected characteristic. Tealium does not tolerate unlawful discrimination of any kind and strives to be an inclusive and respectful workplace.
The highly relevant and differentiated positioning of Tealium’s solutions makes this a unique and rewarding career opportunity.
#LI-Remote *Offerings vary by level and location.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Agile Airflow APIs Avro Azkaban Big Data BigQuery Bigtable Cassandra Computer Science Data Analytics Data management DynamoDB Engineering ETL Kafka Machine Learning Oozie Parquet Research Spark Streaming Testing

Perks/benefits: Career development Equity Flex vacation Health care Home office stipend Parental leave Team events Unlimited paid time off Wellness

Regions: Remote/Anywhere North America
Country: United States
Job stats:  4  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.