Senior Data Engineer (Scala/Spark) - Remote (copy)

Detroit, MI

Applications have closed
SemanticBits logo
SemanticBits

Posted 1 month ago

SemanticBits is a leading company specializing in the design and development of digital health services, and the work we do is just as unique as the culture we’ve created. We develop cutting-edge solutions to complex problems for commercial, academic, and government organizations. The systems we develop are used in finding cures for deadly diseases, improving the quality of healthcare delivered to millions of people, and revolutionizing the healthcare industry on a nationwide scale. There is a meaningful connection between our work and the real people who benefit from it; and, as such, we create an environment in which new ideas and innovative strategies are encouraged. We are an established company with the mindset of a startup and we feel confident that we offer an employment experience unlike any other and that we set our employees up for professional success every day.
SemanticBits is looking for a talented Data Engineer who is eager to apply computer science, software engineering, databases, and distributed/parallel processing frameworks to prepare big data for the use of data analysts and data scientists.
You will use Spark to build data processing pipelines that derive information from large sets of government data. You will be a subject matter expert for Spark, the Spark Engine, and the Spark Dataframe API. You will use that knowledge of Spark to teach others, inform design decisions, and debug runtime problems.

Tools & Technology

  • Spark, Hadoop, Scala, Python, and AWS EMR
  • Jupyter and Zeppelin
  • Airflow, Jenkins, and AWS Step Functions
  • AWS S3, AWS Redshift, and Teradata
  • GSuite, Slack, Jira, Confluence, Git, and Github

Responsibilities

  • Build scalable data processing pipelines in Spark
  • Debug Spark jobs and do performance tuning
  • Write unit and integration tests for all data processing code
  • Work with DevOps engineers on CI, CD, and IaC
  • Read specs and translate them into code and design documents
  • Perform code reviews and develop processes for improving code quality

Required Qualifications:

  • Bachelor's degree required, strong preference for Computer Science field of study
  • A minimum of 5 years of related professional experience
  • Highly Competent with Scala, Spark, the Spark Engine, and the Spark Data frame API
  • Experience with Agile methodology, using test-driven development.
  • Excellent command of written and spoken English
  • Candidate must reside in the United States
  • Flexible and willing to accept a change in priorities as necessary

Nice to have:

  • Experience working in the healthcare industry
  • Federal Government contracting work experience
  • Prior experience working remotely full-time

Physical and emotional requirements for the job:

  • This position is to be performed remotely from an individual’s home office and involves sedentary work. Employees in this role can be expected to exert up to 10 pounds of force on occasion in order to lift, carry, push, pull or otherwise move standard electronic equipment. Employees are expected to make decisions in a timely manner and display emotional intelligence during occasional stressful situations.
BenefitsCompetitive salaryThree weeks of PTOTen paid holiday daysComprehensive health benefits (medical with HSA option, dental, and vision)401k retirement plan with matching benefit100% paid short-term and long-term disability100% paid life insuranceFlexible Spending Accounts (FSA)Casual working environmentFlexible working hours
SemanticBits, LLC is an equal opportunity, affirmative action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, protected veteran status, disability, or any other characteristic protected by law. We are also a veteran-friendly employer.
If you are an individual with a disability and require a reasonable accommodation to complete any part of the application process, or are limited in the ability or unable to access or use this online application process and need an alternative method for applying, you may contact 703-787-9656 x257 or HR@semanticbits.com for assistance.
Job tags: Airflow AWS Big Data Engineering Hadoop Healthcare Python Redshift Scala Spark
Job region(s): North America