Senior Data Engineer (Remote)
United States
Dialexa
At Dialexa, an IBM Company, we engineer and create game-changing technology products across mobile, web, Internet of Things (IoT) and embedded device platforms.Dialexa is expanding its Data Engineering practice and is looking for great talent to join us in building intelligent, next-generation platforms. Are you tired of just pushing out ones and zeros and you yearn to solve hard problems? Do you want to work on an awesome team of data scientists and developers in a full agile process? Do you want your creative technical ideas to be listened to, heard, and implemented? Do you want to create applications that people love?
Dialexa is expanding and growing our reputation for high-quality end-to-end technology product development. We develop for a wide array of technology platforms and frameworks including mobile, web, Internet of Things (IoT), wearables, and embedded devices. We have an awesome culture that includes perks such as healthcare benefits, 401K, PTO, discounts for the gym, and much much more! To learn more about Dialexa, please visit www.dialexa.com.
Responsibilities
- Build and estimate data engineering projects
- Lead projects overseeing data scientists, data engineers, software engineers, and designers
- Define and decide the best approach to executing data engineering projects based on stakeholder business requirements.
- Advise clients on the benefits and risks of different data engineering architectures and designs in order to solve business problems
- Designing and building scalable architectures tailored to client needs
- Write and monitor ETL processes using distributed computing techniques
- Help drive business results and participate in practice and company growth initiatives
- Mentor new data engineers on technical capabilities and career goals
- Analyze, explore, and visualize data to produce actionable information
Expectations (the more the better - we want great talent!)
- Bachelor’s Degree from a 4-year college/university with a relevant degree (computer science, engineering, etc) or equivalent experience and certifications
- Experience leading data engineering projects
- 5+ years in data engineering roles or similar experiences with 3+ years in a client-facing roles
- Portfolio of applications that clearly demonstrates coding proficiency, an eye for quality, understanding of security, and attention to detail such as things like performance characteristics
- Knowledge of common data engineering languages (e.g. Python, Scala, Java, R, etc.)
- Experience with data lake platforms (e.g. Databricks, AWS Lake Formation, Azure Data Lake)
- Experience with data warehouse technologies (e.g. Snowflake, Redshift, BigQuery)
- Experience with:
- Cloud infrastructures, services, and certifications (e.g. AWS, Azure, Google Cloud)
- Data modeling and testing frameworks (DBT, Deequ, Great Expectations)
- Data and pipeline observability (Monte Carlo, Databand, Datafold)
- Distributed processing (Spark, PySpark, Dask, Presto/Athena/Trino)
- Job orchestration (Airflow, Prefect, Argo, Kubeflow, Azure Data Factory)
- Devops and CI/CD tools (Git, Jenkins, GitHub CI/CD, CircleCI, Azure Pipelines, Terraform)
- Messaging platforms (Kafka, Kinesis, GCS pub/sub, Azure Event Hub)
- Must be comfortable being online and available approximately between 8AM to 6PM CST
- Must be able to travel to Dallas, Texas or the client location as requested (at a minimum of once a quarter)
This is a remote eligible posting available everywhere in the United States except for Colorado.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Agile Airflow Athena AWS Azure BigQuery CI/CD Computer Science Databricks DevOps Engineering ETL GCP Git GitHub Google Cloud Kafka Kinesis Monte Carlo Pipelines PySpark Python R Redshift Scala Security Snowflake Spark Terraform Testing
Perks/benefits: Career development Startup environment
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Junior Data Scientist jobs
- Open Senior Data Architect jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Engineer jobs
- Open Product Data Analyst jobs
- Open Business Data Analyst jobs
- Open Data Manager jobs
- Open Data Quality Analyst jobs
- Open Sr. Data Scientist jobs
- Open Data Scientist II jobs
- Open Big Data Engineer jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Principal Data Scientist jobs
- Open ETL Developer jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open TensorFlow-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Databricks-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs