Data Engineer
New York City / Remote
Sotheby's
Sotheby's is the premier destination for auctions and private sales of Contemporary, Modern & Impressionist, Old Master Paintings, Jewelry, Watches, Wine, Decorative Arts, Asian Art & moreABOUT SOTHEBY'S
Established in 1744, Sotheby’s is the world’s premier destination for art and luxury. Synonymous with innovation, Sotheby’s promotes access, connoisseurship and preservation of fine art and rare objects through auctions, private sales and retail locations. Our trusted global marketplace is supported by a network of specialists spanning 40 countries and 50 categories, which include Contemporary Art, Modern and Impressionist Art, Old Masters, Chinese Works of Art, Jewelry, Watches, Wine and Spirits, and Interiors, among many others.
THE ROLE
Sotheby’s Data Engineering team is transforming a 200 year old business by using machine learning, cloud technologies, and real time analytics. Our platform processes multiple types of data including 10+ million images, 1+ billion transactions, and 10+ million objects. We work with data scientists, business analysts, and software engineers to store, process, and retrieve data. This role will be responsible for building data pipelines, developing new tools, and implementing access controls.
RESPONSIBILITIES
- Develop new data ingestion pipelines from external and internal API’s using python
- Create data processing, monitoring, and alerting tools for business analysts, data scientists, and software engineers
- Implement data access controls in GCP using terraform
- Improve system stability and code quality using version control, automated testing, and continuous integration
- Identify, resolve, and prevent failures
IDEAL EXPERIENCE & COMPETENCIES
- Bachelor’s degree in a quantitative field or equivalent experience
- Programming experience, preferably in Python and SQL
- Familiarity with cloud development environments such as AWS and GCP
- Understanding of Role Based Access Controls (RBAC)
- Able to use orchestration frameworks such as Airflow, Kubernetes, and Docker Swarm
- Working knowledge of containers
- Willing to learn new programming languages, tools, and frameworks
- Curiosity about problems and a desire to solve them
To view our Candidate Privacy Notice for the US, please click here.
To view our Candidate Privacy Notice for the UK, Hong Kong, France and Switzerland, please click here.
The Company is an equal opportunity employer and considers all applicants for employment without regard to race (including, without limitation, traits historically associated with race, such as natural hair, hair texture, and protective and treated or untreated hairstyles), color, creed, religion, sex, sexual orientation, marital or civil partnership/union status, national origin, age, disability, pregnancy, genetic predisposition, genetic information, reproductive health decision, sexual orientation, gender identity or expression, alienage or citizenship status, domestic violence victim status, military or veteran status, or any other characteristic protected by federal, state/province or local law. The Company complies with applicable state and local laws prohibiting discrimination in employment in every jurisdiction in which it operates.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Airflow APIs AWS Data pipelines Docker Engineering GCP Kubernetes Machine Learning Pipelines Privacy Python SQL Terraform Testing
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open MLOps Engineer jobs
- Open Data Science Manager jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Manager jobs
- Open Data Engineer II jobs
- Open Power BI Developer jobs
- Open Principal Data Engineer jobs
- Open Sr Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Business Intelligence Developer jobs
- Open Junior Data Scientist jobs
- Open Data Scientist II jobs
- Open Product Data Analyst jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Manager, Data Engineering jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Data Quality Analyst jobs
- Open Junior Data Engineer jobs
- Open Principal Data Scientist jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Java-related jobs
- Open Privacy-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open CI/CD-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open LLMs-related jobs
- Open Generative AI-related jobs
- Open Databricks-related jobs