Data Engineer
Berkeley, CA
Grabango
Eliminating Lines, Saving Time. Grabango is the leading provider of checkout-free shopping technology for retailers’ existing stores in the U.S.Who we are:
Grabango is the leading provider of checkout-free shopper technology for existing stores. Founded by Will Glaser (former founder & CTO of Pandora Media), the Grabango team has developed the only enterprise class solution for large store chains in the market today. Grabango has raised over $75 million in funding since 2017, with $39 million in Series B announced in June 2021. The round was led by Commerce Ventures with participation from Founders Fund, Unilever Ventures, Honeywell Ventures, Rich Products Ventures, and WIND Ventures. Grabango has signed five retail partners, each over $1 billion in revenue, including a global top-10 grocer and a Fortune-25 multinational. Several multi-store deployments are underway. The company has filed 40 patents, and the earliest ones that predate most prior art in the category have already been awarded. We’re a growing group of curious, self-directed people working towards a common goal. We delight in taking risks and testing hypotheses in a collaborative environment. Our ability to celebrate both our successes and failures as milestones of progress opens the door to tremendous breakthroughs.About The Role
Grabango is looking for a Data Engineer to join our growing team of analytics experts. The hire will join a small team responsible for expanding and optimizing our data pipeline architecture and managing our data warehouse, as well as optimizing data accessibility for cross functional teams. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up. The Data Engineer will support our software developers, database architects, data analysts, and data scientists on data initiatives and will ensure optimal data delivery architecture is consistent throughout ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems and products. The right candidate will be excited by the prospect of optimizing or even re-designing our company’s data architecture to support our next generation of products and data initiatives.
This full-time role reports to the Platform Integrations Manager and is based in Berkeley.
What You’ll be Doing:
- Create and maintain ETL pipelines that can robustly ingest, transform, and store both external customer data and internally generated data
- Build out our data warehouse and own the data flow of external data into our system, and between production databases and our data warehouse
- Collaborate with computer vision and machine learning teams to create data pipelines for iteration of cutting edge computer vision models
- Implement best practices to ensure data quality and integrity in our pipelines
What You Should Have:
- 4-7 years of experience as a data engineer
- Excellent SQL and Python skills
- Experience with both SQL and NoSQL databases, such as Cassandra and MongoDB
- Experience with data pipelining tools such as dbt, airflow, luigi or Dagster
- Comfort working within systems running Kubernetes, Docker, Linux, Git
- Strong communication skills to collaborate with cross-functional stakeholders
- You can rapidly become familiar with and apply new tools and technologies
Education & Certifications:
- BS in Computer Science, Engineering, or related field
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow Cassandra Computer Science Computer Vision Dagster Data pipelines Docker Engineering ETL Git Kubernetes Linux Machine Learning MongoDB NoSQL Pipelines Python Security SQL Testing
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open MLOps Engineer jobs
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Product Data Analyst jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Manager, Data Engineering jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open ETL Developer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open GCP-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Kubernetes-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open DevOps-related jobs
- Open LLMs-related jobs