Data Engineer

Mexico

500 Global

500 Global is a venture capital firm with more than $2.4 billion in assets under management

View company page

Our MissionWe uplift people and economies around the world through entrepreneurship.
Who We Are500 Global is a venture capital firm with $2.7B in assets under management that invests in founders building fast-growing technology companies. We focus on markets where technology, innovation, and capital can unlock long-term value and drive economic growth. We work closely with key stakeholders and advise governments and corporations on how best to support entrepreneurial ecosystems so startups can thrive. 500 Global has backed over 5,000 founders representing more than 2,700 companies operating in 81 countries. We have invested in 49 companies valued at over $1 billion and 150+ companies valued at over $100 million (including private, public, and exited companies). Our 180+ team members are located in 27 countries and bring experience as entrepreneurs, investors, and operators from some of the world’s leading technology companies. https://500.co/https://latam.500.co/
Job Scope
The Data Engineer will design and develop the infrastructure necessary to capture and make sense of the massive amount of information made available to 500 Global by virtue of its relationships with a portfolio of over 3,000+ startups from their earliest stage. The ideal candidate is 50% thinker and 50% do-er, has a bias toward action, has an insatiable curiosity around ways to apply AI to business processes, and knows where/when to cut corners (and where/when not to.)

Essential Functions

  • Lead integrations and develop connectors with various data sources, ensuring smooth and secure data extraction, including Salesforce,  AirTable, Skyvia, Allvue, SaplingHR, and proprietary web portals.
  • Design and implement scalable and reliable data infrastructure using appropriate technologies (e.g. data lakes, etc.)
  • Document processes, best practices, and knowledge for transparency and reproducibility while staying updated on emerging technologies and industry best practices.
  • Build and maintain the infrastructure required for machine learning (ML) and NLP models and LLMs, ensuring scalability, reliability, and performance.
  • Collaborate closely with domain experts, and other stakeholders to understand and address business needs, participating in relevant and company-wide decisions.
  • Additional projects as determined by business needs.

Minimum Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 4+ years of experience in designing and implementing scalable and reliable data infrastructure using cloud technologies.
  • Solid understanding of data governance, security, and compliance standards in a cloud environment.
  • Hands-on experience building API integrations with various data sources
  • Proficiency in cloud platforms such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP), including cloud services relevant to data storage, processing, and analytics.
  • Excellent problem-solving skills and ability to troubleshoot complex data infrastructure and integration issues.
  • Demonstrated experience building something from the ground up
  • Experience staying updated on emerging technologies and industry best practices relevant to data engineering and analytics.
  • Strong documentation skills with a focus on documenting processes, best practices, and knowledge for transparency and reproducibility.
  • Proven track record of collaborating closely with cross-functional teams, domain experts, and stakeholders.

Preferred Qualifications

  • Experience working Data Lakes, Data warehouses, and ETLs.
  • Experience building and maintaining infrastructure for machine learning (ML) and natural language processing (NLP) models, ensuring scalability, reliability, and performance.
  • Knowledge of web development concepts (HTML, CSS, JavaScript, TypeScript, Python).Previous experience in PE, VC, or financial sectors.
  • Experience working with (or at) startups in fast-pacing environments.
  • A public GitHub repository with unlocked packages you have built
Benefits:Health Care Plan for employees and immediate family (spouse and children), Unlimited PTO, Family Leave (Maternity, Paternity),Training & Development, Work From Home, Monthly credits for food delivery through ComeBien.MX (Only available in Mexico City), Monthly reimbursement for cellphone carrier
500 Global does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity, or any other reason prohibited by law in provision of employment opportunities and benefits.
500 Global collects and processes personal data in accordance with applicable data protection laws. If you are a European Job Applicant see the privacy notice for further details. If you are a California Job Applicant see the privacy notice for further details.
Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airtable APIs AWS Azure Computer Science Data governance Engineering ETL GCP GitHub Google Cloud JavaScript LLMs Machine Learning NLP Privacy Python Salesforce Security TypeScript

Perks/benefits: Career development Health care Parental leave Unlimited paid time off

Region: North America
Country: Mexico
Job stats:  8  0  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.