Senior Data Engineer

Remote - Warsaw, Masovian Voivodeship, Poland

Sunscrapers

Software development and consulting company that enables businesses to grow ➤ Hire top Python & Django developers from Poland ➤ Let’s talk!

View company page

Sunscrapers is an elite Python and JavaScript development company that helps clients set up dedicated development teams in Poland, made of the most talented, experienced and motivated developers.

Since 2010, we’ve been working with most ambitious US and European scaleups, SMBs and enterprises on delivering digital products and extending in-house development teams.

Clutch.io currently ranks us as #1 Python/Django development company under 50 people worldwide.

The project:

We are carrying out the project for our client, an American private equity and investment management fund - listed on the Forbes 500 list - based in New York.

We support them in the area of data pipeline, infrastructure and data engineering team. They operate very widely in the world of finance, loans, investments and real estate.

As a Senior Data Engineer you’ll design and implement core systems that enable data science and data visualization at companies that implement data-driven decision process to create a competitive advantage. You’ll build data lakes, data warehouses and data pipelines using:

  • Technologies: Python, Terraform, SQL, Pandas, NumPy, Shell scripts
  • Tools: Apache Airflow, Jupyter Notebook, Docker, Kubernetes, Vagrant, Snowflake, Gitlab, TeamCity/Jenkins, Artifactory, Windows 10, SQLAlchemy
  • AWS: EC2, ELB, IAM, RDS, Route53, S3
  • Best Practices: Continuous Integration, Code Reviews, Scrum

The ideal candidate will be well organized, eager to constantly improve and learn, driven and, most of all - a team player!


Your responsibilities will include:

  • Developing data technology stacks including data lakes, data warehouses and ETL pipelines
  • Building data flows for fetching, aggregation and data modeling using batch and streaming pipelines
  • Developing reusable library code, infrastructure and toolsets for data scientists
  • Designing datasets and schemes for consistency and easy access
  • Creating solutions that enable data scientists and business analysts to be self-sufficient as much as possible.
  • Documenting design decisions before implementation

Requirements

What's important for us?

  • At least 5+ years of professional experience in data-related role
  • Undergraduate or graduate degree in Computer Science, Engineering, Mathematics, or similar
  • Expertise in Python and SQL languages
  • Experience in designing data warehouses (Snowflake or Redshift)
  • Experience with different types of database technologies (RDBMS, noSQL, etc.)
  • Experience in building ETL processes and data pipelines with platforms like Airflow or Luigi
  • Expertise in AWS stack and services
  • Proficiency in using Docker
  • Great analytical skills and attention to detail - asking questions and proactively searching for answers
  • Excellent command in spoken and written English, at least C1
  • Creative problem-solving skills
  • Excellent technical documentation and writing skills
  • Ability to work with both Windows and Unix-like operating systems as the primary work environments


You will score extra points for:

  • Experience with infrastructure-as-code tools, like Terraform
  • Familiarity with data visualization in Python using either Matplotlib, Seaborn or Bokeh
  • Proficiency in statistics and machine learning, as well as Python libraries like Pandas, NumPy, matplotlib, seaborn, scikit-learn, etc
  • Knowledge of any Python web framework, like Django or Flask with SQLAlchemy
  • Experience in operating within a secure networking environment, like a corporate proxy
  • Experience in working with repository manager, for example Jfrog Artifactory

Benefits

What do we offer?

  • Working alongside a talented team of software engineers who are changing the image of Poland abroad
  • Culture of teamwork, professional development and knowledge sharing (https://www.youtube.com/user/sunscraperscom)
  • Flexible working hours and remote work possibility
  • Comfortable office in central Warsaw, equipped with all the necessary tools for conquering the universe (Macbook Pro/Dell, external screen, ergonomic chairs)


Sounds like a perfect place for you? Don’t hesitate to click apply and submit your application today!

* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰

Tags: Airflow AWS Computer Science Data pipelines Data visualization Django Docker EC2 Engineering ETL Finance Flask GitLab JavaScript Jupyter Kubernetes Machine Learning Mathematics Matplotlib NoSQL NumPy Pandas Pipelines Python RDBMS Redshift Scikit-learn Scrum Seaborn Snowflake SQL Statistics Streaming Terraform

Perks/benefits: Career development Flex hours Gear

Regions: Remote/Anywhere Europe
Country: Poland
Job stats:  3  3  0
Category: Engineering Jobs

More jobs like this

Explore more AI, ML, Data Science career opportunities

Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.