Principal Data Engineer
Remote (USA)
Gemini
Gemini makes crypto simple. Find, Trade and Buy over 80 coins including bitcoin on the best cryptocurrency platform. Start trading crypto here.About the Company
Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.
Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency.
At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.
In the United States, we have a flexible hybrid work policy for employees who live within 30 miles of our office headquartered in New York City. Employees within the New York Metropolitan area are expected to work from the NYC office twice a week, unless there is a job-specific requirement to be in the office every workday. Employees outside of this area are considered part of our remote-first workforce. We believe our hybrid approach for those near our NYC office increases productivity through more in-person collaboration where possible.
The Department: Analytics
The Role: Principal Data Engineer
As a member of our data engineering team, you'll be setting standards for data engineering solutions that have organizational impact. You'll provide Architectural solutions that are efficient, robust, extensible and are competitive within business and industry context. You'll collaborate with senior data engineers and analysts, guiding them towards their career goals at Gemini. Communicating your insights with leaders across the organization is paramount to success.
Responsibilities:
- Focused on technical leadership, defining patterns and operational guidelines for their vertical(s)
- Independently scopes, designs, and delivers solutions for large, complex challenges
- Provides oversight, coaching and guidance through code and design reviews
- Designs for scale and reliability with the future in mind. Can do critical R&D
- Successfully plans and delivers complex, multi-team or system, long-term projects, including ones with external dependencies
- Identifies problems that need to be solved and advocates for their prioritization
- Owns one or more large, mission-critical systems at Gemini or multiple complex, team level projects, overseeing all aspects from design through implementation through operation
- Collaborates with coworkers across the org to document and design how systems work and interact
- Leads large initiatives across domains, even outside their core expertise. Coordinates large initiatives
- Designs, architects and implements best-in-class Data Warehousing and reporting solutions
- Builds real-time data and reporting solutions
- Develops new systems and tools to enable the teams to consume and understand data more intuitively
Minimum Qualifications:
- 10+ years experience in data engineering with data warehouse technologies
- 10+ years experience in custom ETL design, implementation and maintenance
- 10+ years experience with schema design and dimensional data modeling
- Experience building real-time data solutions and processes
- Advanced skills with Python and SQL are a must
- Experience and expertise in Databricks, Spark, Hadoop etc.
- Experience with one or more MPP databases(Redshift, Bigquery, Snowflake, etc)
- Experience with one or more ETL tools(Informatica, Pentaho, SSIS, Alooma, etc)
- Strong computer science fundamentals including data structures and algorithms
- Strong software engineering skills in any server side language, preferable Python
- Experienced in working collaboratively across different teams and departments
- Strong technical and business communication skills
Preferred Qualifications:
- Kafka, HDFS, Hive, Cloud computing, machine learning, LLMs, NLP & Web development experience is a plus
- NoSQL experience a plus
- Deep knowledge of Apache Airflow
- Expert experience implementing complex, enterprise-wide data transformation and processing solutions
- Experience with Continuous integration and deployment
- Knowledge and experience of financial markets, banking or exchanges
- Web development skills with HTML, CSS, or JavaScript
- Competitive starting salary
- A discretionary annual bonus
- Long-term incentive in the form of a new hire equity grant
- Comprehensive health plans
- 401K with company matching
- Annual Learning & Development stipend
- Paid Parental Leave
- Flexible time off
Salary Range: The base salary range for this role is between $172,000 - $241,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.
At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.
#LI-AH1
Tags: Airflow Banking BigQuery Computer Science Crypto Databricks Data warehouse Data Warehousing Engineering ETL Gemini Hadoop HDFS Informatica JavaScript Kafka LLMs Machine Learning MPP NLP NoSQL Pentaho Python R R&D Redshift Snowflake Spark SQL SSIS
Perks/benefits: Career development Competitive pay Equity Flex hours Flex vacation Health care Home office stipend Parental leave Salary bonus
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Power BI Developer jobs
- Open Data Scientist II jobs
- Open Junior Data Scientist jobs
- Open Business Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Big Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Scientist jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open Business Intelligence-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open PhD-related jobs
- Open Deep Learning-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open APIs-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open LLMs-related jobs
- Open CI/CD-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Hadoop-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Docker-related jobs