Data Engineer (Remote)
Cartagena de Indias, CO - Remote
Applications have closed
Blue Orange Digital
Blue Orange utilizes Modern Data Stack infrastructure to unify data, automate tasks, and improve decision making across industries.Company Overview:
Blue Orange Digital is a cloud-based data transformation and
predictive analytics development firm with offices in NYC and
Washington, DC. From startups to Fortune 500’s, we help companies make
sense of their business challenges by applying modern data analytics
techniques, visualizations, and AI/ML. Founded by engineers, we love
passionate technologists and data analysts. Our startup DNA means
everyone on the team makes a direct contribution to the growth of the
company.
Position Overview:
We are seeking a Data Engineer to join our product development team to help build, optimize, maintain and support the data pipeline management components of our product. Your main tasks will include developing, refactoring, customizing, and maintaining our data source integration (DSI) platform with multiple partner-built data observability platforms. You will join a small team of engineers and have a large impact on shaping how the product is built and designed.
Responsibilities:
- Be proficient in server-side development, automation, and optimization of data pipelines, including database creation and management, and debugging.
- Integrate data from various backend services, APIs, and databases.
- Create and maintain software documentation.
- Create and analyze reliable and secure backend functionality.
- Build and maintain infrastructure and automation to support the running of the platform across multiple cloud environments.
- Remain knowledgeable of emerging technologies/industry trends and apply them to operations and activities.
Requirements:
- Expert-level knowledge and experience in Python.
- Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
- Experience building, refactoring, customizing, and optimizing ‘big data’ data pipelines, architectures, and data sets.
- Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
- Strong analytic skills related to working with both structured and unstructured datasets.
- Build processes supporting data transformation, data structures, metadata, dependency, and workload management.
- A successful history of manipulating, processing, and extracting value from large disconnected datasets.
- Strong project management, organizational, and collaboration skills.
- Experience supporting and working with cross-functional teams in a dynamic environment.
- We are looking for a candidate with 5+ years of experience in a Data Engineer role, who has attained a degree in Computer Science or another related field.
Preferred qualifications:
Experience using the following software/tools:
- Experience with big data tools: Spark, Kafka, etc.
- Experience with relational SQL and NoSQL databases, including Hive, Postgres, and Cassandra.
- Experience with data pipeline and workflow management tools: Meltano, Airflow,, Airbyte, Dagster, Fivetran, etc.
- Experience with AWS cloud services: EC2, EMR, RDS, Redshift
- Experience with stream-processing systems: Flink, Storm, Spark-Streaming, etc.
- Experience with object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
Our Benefits Include:
- Fully remote
- Flexible Schedule
- Unlimited Paid Time Off (PTO)
- Paid parental/bereavement leave
- Worldwide recognized clients to build skills for an excellent resume
- Top-notch team to learn and grow with
Salary: $5000 - $6000 USD (per month)
Blue Orange Digital is an equal-opportunity employer.
Background checks may be required for certain positions/projects.
Tags: Airflow APIs Architecture AWS Big Data Cassandra Computer Science Dagster Data Analytics Data pipelines EC2 FiveTran Flink Java Kafka Machine Learning NoSQL Pipelines PostgreSQL Python RDBMS Redshift Scala Spark SQL Streaming
Perks/benefits: Career development Flex hours Flex vacation Parental leave Startup environment Team events Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Azure Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open Consulting-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Databricks-related jobs
- Open Generative AI-related jobs