Data Engineer

Nairobi, Nairobi, Kenya

Applications have closed

Sama

Sama provides ML Professionals and AI team Leads with an indispensable solution for Computer Vision data labeling.

View company page

About the Job:

As a Data Engineer at Sama, you will be responsible for building and optimizing data architectures and data pipelines. You will be responsible for building and maintaining ETL, ELT data flows for different cross functional teams. As a data engineer, you will provide support to our data analysts, data scientists and other stakeholders on data initiatives. Your primary goal is to ensure optimal and consistent data availability, data quality and data delivery architecture.

Key Responsibilities: 

  • Create and maintain optimal data pipeline architectures that serve key business stakeholders
  • Assemble large, complex data sets that meet business requirements for different stakeholders and teams.
  • Build and maintain the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources.
  • Develop and maintain a data catalog of data sets, scripts, tools and pipelines as part of documentation.
  • Work with stakeholders to identify their data needs and provide consistent data availability and quality to meet those needs.
  • Work with business analytics to build ETL pipelines that serve various areas of business.
  • Identify any bottlenecks or challenges in the current data pipelining approaches and suggest areas of improvement.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Build analytics tools that utilize the data pipeline to provide actionable insights on key business metrics
  • Maintain the daily relationship with stakeholders to understand their data needs and communicate results intuitively.

Minimum Qualifications:

  • Advanced working knowledge of SQL.
  • Experience with Google Cloud Platform and its services.
  • Experience working with relational and non-relational databases - Big query, AWS, Postgres.
  • Working knowledge of data pipelining tools such as Hevo
  • Experience with transformation tools such as Data Form, Database Tools (DBT).
  • Experience with object-oriented/object function scripting languages: Python, JavaScript, Java, C++.
  • Experience working on CI/CD processes and source control tools such as GitHub.
  • A successful history of manipulating, processing and extracting value from large disconnected datasets.

Preferred Qualifications:

  • Outstanding communication skills, and the ability to stay self-motivated and work with little or no supervision.
  • Great communication and collaboration skills.
  • Excellent time management and organizational abilities

About Sama:

25% of the Fortune 50 trust Sama to deliver secure, high-quality training data and validation for the technology teams driving humanity forward. From self-driving cars to smart hardware, Sama fuels AI. Founded over a decade ago, we’re experts in image, video and sensor data annotation and validation for machine learning algorithms in industries including automotive, navigation, AR/VR, biotech, agriculture, manufacturing, and e-commerce. Our staff are driven by a mission to expand opportunity for low-income people through the digital economy, and our social business model has helped lift over 50,000 people out of poverty.  

 

Our Culture:

Sama is quite unique. We are a technology company with a social mission. People that thrive in a high growth environment, love working on the bleeding edge of technology, and really care about having a positive impact on the world are a great fit for the Sama culture.

Our core values are;

  • One Team,One Goal
  • Deliver.Period.
  • Trust & Transparency
  • Customer First
  • Humanity
     

Our benefits:

Sama offers competitive compensation commensurate with experience and a full benefits package, including: medical, dental, and vision insurance, short- and long-term disability insurance, AD&D insurance, employer-matching 401K, generous holiday and vacation policies, sabbaticals, paid disability and baby bonding leave, and professional development opportunities.

 

At Sama, we pride ourselves in being a diverse and equal opportunity employer.

About Sama:

Sama is the leading training data provider for Fortune 2000 companies such as Google, Walmart, Ford, Microsoft, and Marriott. We help these organizations get their machine learning models to production more quickly by providing accurate annotation and validation for their datasets.

80% of AI project time is spent on aggregating, cleaning, and labeling data for machine learning models Sama drives the very important task of data annotation from companies building state-of-the-art AI. 

Sama offers the highest quality SLAs in the industry, along with cutting-edge ML-assisted annotation tools, QA processes, and security and compliance standards. 

Sama has a social mission driven by the belief that “talent is equally distributed but opportunity is not.” As a certified B corp, Sama supports an ethical supply chain, and our impact was validated by MIT with a randomized control study. 

Sama has provided worker training programs to increase economic opportunity for more than 13,000 people from underserved communities. By connecting our customers with amazing talent in East Africa, we've impacted more than 59,000 workers and their dependents.

Our Culture:

Sama is quite unique. We are a technology company with a social mission. People that thrive in a high-growth environment, love working on the bleeding edge of technology, and really care about having a positive impact on the world are a great fit for the Sama culture. Our core values are One Team, One Goal - Deliver. Period. - Trust & Transparency - Customer First - Humanity.

Our Benefits:

Sama offers competitive compensation commensurate with experience and a full benefits package, including: medical, dental, and vision insurance, long-term disability insurance, life, and AD&D insurance, employer-matching Group RRSP, generous holiday and vacation policies, sabbaticals, a monthly fitness stipend, and professional development opportunities.

At Sama, we pride ourselves in being a diverse and equal opportunity employer.

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Architecture AWS BigQuery Business Analytics CI/CD Data pipelines Data quality E-commerce ELT ETL GCP GitHub Google Cloud JavaScript Machine Learning ML models Pipelines PostgreSQL Python RDBMS Security SQL VR

Perks/benefits: Career development Competitive pay Health care Insurance Medical leave Startup environment Transparency

Region: Africa
Country: Kenya
Job stats:  49  12  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.