Data Science Engineer

Remote - US / Canada

Applications have closed
GitHub logo
GitHub
Build better together

Posted 4 months ago

As the world’s largest social coding platform, a home for Open Source development, and a core tool in the DevOps toolkit of many Fortune 500 companies, GitHub has some of the world’s most interesting data. 

GitHub’s Data Science team is looking for a data curious individual to join us and leverage this wealth of business, ecosystem, and community critical data for organization wide impact. You will be working with a diverse team of other engineers and data scientists to design and build reusable data pipelines, patterns, and tooling to unlock insights for the company. You’ll be working with and enabling a diverse set of stakeholders across all levels of the company to make data informed decisions about  our products, strategy, and community trends. 

Responsibilities

  • Identify business needs and translate them into requirements for unified data schemas, pipelines, and tools for company wide impact
  • Design, develop, and own holistic, robust, and high quality data pipelines (from ETL to Business Intelligence tools) that power internal datasets for other data scientists, product, engineering, and other business teams 
  • Maintain and expand forecasting capabilities for the business at scale 
  • Help the Data Science team scale statistical models to large datasets
  • Develop and maintain tools that support internal analytics and data science needs, such as advanced visualizations, graph data structures, storage, and querying, data dictionary, etc. 

Minimum Qualifications

  • 3+ years related experience in a data engineering or software engineering capacity, including experience in or close proximity to a data science or data analytics capacity 
  • Experience designing robust unified data schemas in a denormalized environment, and ETL pipelines in a distributed data framework (Hive, Hadoop, Spark, Presto, etc.)
  • Capable of developing reusable programmatic solutions for internal use such as front end applications and bots
  • Experience articulating business questions and using mathematical techniques to arrive at an answer using available data.
  • Demonstrated leadership and self-direction.
  • Demonstrated willingness to both teach others and learn new techniques.
  • Demonstrated effective written and verbal communication skills.
  • Experience doing analysis in either R or Python, knowledge of a SQL variant and familiarity with software development in a Python stack (e.g. Flask)

Who We Are:

GitHub is the developer company. We make it easier for developers to be developers: to work together, to solve challenging problems, and to create the world’s most important technologies. We foster a collaborative community that can come together—as individuals and in teams—to create the future of software and make a difference in the world.

Leadership Principles:

Customer Obsessed - Trust by Default - Ship to Learn - Own the Outcome - Growth Mindset - Global Product, Global Team - Anything is Possible - Practice Kindness

Why You Should Join:

At GitHub, we constantly strive to create an environment that allows our employees (Hubbers) to do the best work of their lives. We've designed one of the coolest workspaces in San Francisco (HQ), where many Hubbers work, snack, and create daily. The rest of our Hubbers work remotely around the globe. Check out an updated list of where we can hire here: https://github.com/about/careers/remote

We are also committed to keeping Hubbers healthy, motivated, focused and creative. We've designed our top-notch benefits program with these goals in mind. In a nutshell, we've built a place where we truly love working, we think you will too.

GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!

Please note that benefits vary by country. If you have any questions, please don't hesitate to ask your Talent Partner.

#LI-POST

Job tags: Business Intelligence Data Analytics Engineering ETL Hadoop Open Source Python R Spark SQL