Data Engineer
Mexico
500 Global
500 Global is a venture capital firm with more than $2.4 billion in assets under managementWho We Are500 Global is a venture capital firm with $2.7B in assets under management that invests in founders building fast-growing technology companies. We focus on markets where technology, innovation, and capital can unlock long-term value and drive economic growth. We work closely with key stakeholders and advise governments and corporations on how best to support entrepreneurial ecosystems so startups can thrive. 500 Global has backed over 5,000 founders representing more than 2,700 companies operating in 81 countries. We have invested in 49 companies valued at over $1 billion and 150+ companies valued at over $100 million (including private, public, and exited companies). Our 180+ team members are located in 27 countries and bring experience as entrepreneurs, investors, and operators from some of the world’s leading technology companies. https://500.co/https://latam.500.co/
Job Scope
The Data Engineer will design and develop the infrastructure necessary to capture and make sense of the massive amount of information made available to 500 Global by virtue of its relationships with a portfolio of over 3,000+ startups from their earliest stage. The ideal candidate is 50% thinker and 50% do-er, has a bias toward action, has an insatiable curiosity around ways to apply AI to business processes, and knows where/when to cut corners (and where/when not to.)
Essential Functions
- Lead integrations and develop connectors with various data sources, ensuring smooth and secure data extraction, including Salesforce, AirTable, Skyvia, Allvue, SaplingHR, and proprietary web portals.
- Design and implement scalable and reliable data infrastructure using appropriate technologies (e.g. data lakes, etc.)
- Document processes, best practices, and knowledge for transparency and reproducibility while staying updated on emerging technologies and industry best practices.
- Build and maintain the infrastructure required for machine learning (ML) and NLP models and LLMs, ensuring scalability, reliability, and performance.
- Collaborate closely with domain experts, and other stakeholders to understand and address business needs, participating in relevant and company-wide decisions.
- Additional projects as determined by business needs.
Minimum Qualifications
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 4+ years of experience in designing and implementing scalable and reliable data infrastructure using cloud technologies.
- Solid understanding of data governance, security, and compliance standards in a cloud environment.
- Hands-on experience building API integrations with various data sources
- Proficiency in cloud platforms such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP), including cloud services relevant to data storage, processing, and analytics.
- Excellent problem-solving skills and ability to troubleshoot complex data infrastructure and integration issues.
- Demonstrated experience building something from the ground up
- Experience staying updated on emerging technologies and industry best practices relevant to data engineering and analytics.
- Strong documentation skills with a focus on documenting processes, best practices, and knowledge for transparency and reproducibility.
- Proven track record of collaborating closely with cross-functional teams, domain experts, and stakeholders.
Preferred Qualifications
- Experience working Data Lakes, Data warehouses, and ETLs.
- Experience building and maintaining infrastructure for machine learning (ML) and natural language processing (NLP) models, ensuring scalability, reliability, and performance.
- Knowledge of web development concepts (HTML, CSS, JavaScript, TypeScript, Python).Previous experience in PE, VC, or financial sectors.
- Experience working with (or at) startups in fast-pacing environments.
- A public GitHub repository with unlocked packages you have built
500 Global does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity, or any other reason prohibited by law in provision of employment opportunities and benefits.
500 Global collects and processes personal data in accordance with applicable data protection laws. If you are a European Job Applicant see the privacy notice for further details. If you are a California Job Applicant see the privacy notice for further details.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airtable APIs AWS Azure Computer Science Data governance Engineering ETL GCP GitHub Google Cloud JavaScript LLMs Machine Learning NLP Privacy Python Salesforce Security TypeScript
Perks/benefits: Career development Health care Parental leave Unlimited paid time off
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Manager jobs
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Science Manager jobs
- Open Principal Data Engineer jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Sr Data Engineer jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Business Intelligence Developer jobs
- Open Data Analytics Engineer jobs
- Open Junior Data Scientist jobs
- Open Sr. Data Scientist jobs
- Open Senior Data Architect jobs
- Open Business Data Analyst jobs
- Open Data Analyst Intern jobs
- Open Big Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Junior Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Research Scientist jobs
- Open Data quality-related jobs
- Open GCP-related jobs
- Open Business Intelligence-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Data visualization-related jobs
- Open Finance-related jobs
- Open Deep Learning-related jobs
- Open PhD-related jobs
- Open APIs-related jobs
- Open TensorFlow-related jobs
- Open PyTorch-related jobs
- Open NLP-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open CI/CD-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Hadoop-related jobs
- Open Airflow-related jobs
- Open Data warehouse-related jobs