Machine Learning & Data Engineer

Remote - India or Germany

CommentSold

View company page

About CommentSold

CommentSold is the North American leader in live selling technology (ranked by G2), having enabled over 7,000 small to mid-sized retailers with live-selling tools, generating over 166 million items sold with $3.8B+ in lifetime GMV. CommentSold’s technology continues to provide businesses and creators of all sizes with best-in-class solutions for delivering engaging live video commerce experiences across all of their sales channels simultaneously. CommentSold moved into direct-to-consumer commerce via the acquisition of assets of Popshoplive, a community-driven livestream shopping marketplace app at the intersection of social, e-commerce and entertainment. In 2022, CommentSold debuted its lightweight video commerce plugin technology, Videeo, which gives any retailer or brand the ability to embed and go live with engaging, branded live video commerce experiences within days by easily integrating into an existing e-commerce stack.

About the role

The ML & Data Engineer in our AI & Data Team will serve as a senior Data engineering expert and cross-departmental liaison in our Company, responsible for further developing the existing platform of both structured data Data Warehouse as well as vast unstructured data in Data Lake. Tasks include setting new data pipelines, data crawlers and transformations, monitoring the performance and cost-effectiveness of existing data jobs, as well as data integration and docking of the Machine learning processes into the existing data landscape of the company. The ML & Data engineer will represent the Data team in multi-disciplinary projects and is the go-to-person for business teams to consult the new data sources onboarding as well as data availability for products and customer enabling. 

This person is a member of the Data team and reports to our EVP of AI & Data. This is a fully remote role (based in Germany or India), with the need to work (at least partially) for EST or CST time zones in the US.

Main Responsibilities

  • As part of the AI & Data team, build a company-wide data platform. Drive data democracy and literacy within the company.
  • Develop and maintain central Data warehouse and its staging layers, oversee and adapt the ingestion and ETL jobs, enable seamless flow of structured data. 
  • Scan the landscape of both internal and external data sources, propose extensions and updates of the data platform. Document data dictionary and ETL processes.
  • Own and upgrade the company's Data Lake in the cloud. Integrate event tracking and data off-loading into Data Lake, including text, images and video files. Ensure integrations with API gateways and down-streaming consuming services. 
  • Design and man the API integrations and automated data robots for external data ingestion. Design internal API microservices to support and enable the data exchange among products, systems and external 3rd party applications. 
  • Dock the Machine learning and Computer vision models into data pipelines, design the data flow for those AI services. 
  • Work closely with Engineering teams and Data team members, to steer or support projects aimed at data tools and data product creation.
  • Interact with many stakeholders, incl. department leads and senior executives, to translate their business needs into extensions or adaptations of our internal data troves.

Skills, Qualifications & Education 

  • Bachelor’s degree in Computer Science, Machine Learning or Artificial Intelligence
  • At least 5 years of work experience in Business Intelligence, Data analytics, Controlling (or similar analytical roles)
  • Have demonstrated aptitude for working with data both on a structured and unstructured basis. Does not shy to troubleshoot failed ETL processes or API integrations. 
  • Robust Python literacy (esp. for data handling) is a must.
  • Skills with Spark or Typescript are a  plus
  • Strong expertise in SQL (will be tested during the recruitment process). 
  • Well-versed in cloud AWS services, especially around different data handling services from AWS suite. Understands the mutual interdependencies, navigates in linking and tracking individual data handling components. 
  • Hands-on  experience with implementing and gearing APIs for data transfers
  • Working knowledge of Machine learning, NLP and Computer vision algorithms and solutions
  • Experience with Deep learning, GAI and Data crawling automation are a plus 
  • Outstanding data structure blueprinting skills
  • Strong ability to translate ideas between technical and non-technical audiences
  • Solid  business acumen, and experience working with non-technical stakeholders or E-commerce experience (from past roles) are a strong plus
  • Keen to work in a collaborative team environment, eager to give-and-take exchange with other team members and/or regular knowledge sharing
  • Curious, insights and root-causes hungry. Thriving for visible business impact of own tasks and processes. Eager to test-and-learn new approaches.
  • Flexible, self-motivated.  Organized and structured, able to work on multiple projects simultaneously, sometimes against tight deadlines.

We love our values

We’re building a community, our chosen circle, around a set of values that guide how we work and interact with the world around us. Our cultural norms at work can’t be turned off when the computer’s away -- we live these in every part of our lives. Our team isn’t for everyone, so if you’re right for it, the following values should resonate strongly with how you live your life.

Deliver for our customer COMMUNITY: We are committed to making our customers successful.

Do it as a TEAM: We actively listen to diverse perspectives and respond empathetically.

Help each other GROW: We are willing to get uncomfortable for the sake of our growth.

OWN it: We do our part to reach the team's shared goals and hold ourselves and others accountable.

DRIVE forward: We are determined to innovate for impact.

Apply now Apply later
  • Share this job via
  • or

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Job stats:  57  12  0

Tags: APIs AWS Business Intelligence Computer Science Computer Vision Data Analytics Data pipelines Data warehouse Deep Learning E-commerce Engineering ETL Machine Learning Microservices NLP Pipelines Python Spark SQL Streaming TypeScript Unstructured data

Perks/benefits: Career development Flex hours Flex vacation

Regions: Remote/Anywhere Asia/Pacific Europe
Countries: Germany India

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.