Data Engineer - Intern
Dallas, Texas, United States
UWorld is a worldwide leader in online test prep for college entrance, undergraduate, graduate, and professional licensing exams throughout the United States. Since 2003, over 2 million students have trusted us to help them prepare for high-stakes examinations.
UWorld is seeking a Data Engineering Intern who is passionate about creating an excellent user experience and enjoys taking on new challenges. The intern will be responsible for understanding data, building, testing, and deploying of Data Analytics pipelines and reports, and making improvements to our Data warehouse platform.
Minimum Experience
- Masters/Bachelor's degree in Computer Science or a related field.
- Strong in: Python, Spark/PySpark, Big Data Platforms (Databricks/Delta Lake), Database/Data warehousing fundamentals (MS SQL Server/MySQL), Unix/Linux Shell scripting
- Hands on experience in SQL, Stored procedures, and Relational Databases (MS SQL Server/MySQL/Oracle), Data warehousing.
- Hands on experience with Data Analysis, ingestion, cleansing, transformations, validation, verification, and presentation (Reports/Dashboards)
- Experience with Tableau/Power BI, NoSQL (MongoDB), and Kafka is a plus.
- Strong in Programming, Problem solving, Analytical ability, Applications of Data structures and Algorithms
- Familiarity with REST API, Web Services, JSON, and Cloud environments (Azure, AWS, GCS), and machine learning.
Job Responsibilities
The Data Engineer will perform the following duties:
- Understand Data Services and Analytics needs across the organization and work on the Data warehouse and Reporting infrastructure to empower them with accurate information for decision-making.
- Develop and maintain a Data warehouse that aggregates data from multiple content sources including Salesforce, NoSQL DBs, RDBMS, social media, other 3rd party web services (RESTful, JSON), flat-file stores, and Application databases (OLTPs).
- Use Python, Spark/PySpark, Databricks, Delta Lake, SQL Server, Maria DB, Mongo DB, Jira, Git/Bit Bucket, Confluence, Databricks/Delta Lake, REST services, Tableau, Unix/Linux shell scripting, Azure Cloud for data ingestion, processing, transformations, warehousing, and reporting.
- Develop scalable data pipelines using Data connectors, distributed processing transformations, schedulers, and data warehouse
- Understanding of data structures, analytics, data modeling, and software architecture
- Develop, modify, and test algorithms that can be used in scripts to store, locate, cleanse, verify, validate, and retrieve specific documents, data, and information
- Develop analytics to understand product sales, marketing impact, and application usage for UWorld products and applications
- Employ best practices for code sharing and development to ensure common code base abstraction across all applications. Continuously be up to date on the industry standard practices on big data and analytics and adopt solutions to the UWorld data warehousing platform.
- Work with QA engineers to ensure the quality and reliability of all reports, extracts, and dashboards by process of continuous improvement.
- Collaborate with technical architects, developers, subject matter experts, QA team, and customer care team to drive new enhancements or fix bugs in a timely manner.
- Work in an agile environment such as Scrum
Soft Skills
- Working proficiency and communication skills in verbal and written English
- Excellent attention to detail and organization skills and ability to articulate ideas clearly and concisely
- Ability to work effectively within a changing environment that is going through high growth
- Exceptional follow-through, personal drive, and ability to understand direction and feedback
- Positive attitude with a willingness to put aside ego for the sake of what is best for the team
At UWorld, we believe strength is derived from the talents, ideas, and experiences of a diverse workforce. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or any other protected class. UWorld is proud to be an equal opportunity employer providing a drug-free workplace. If you have a disability or special need that requires accommodation, please let us know.
Tags: Agile APIs Architecture AWS Azure Big Data Computer Science Confluence Data analysis Data Analytics Databricks Data pipelines Data warehouse Data Warehousing Engineering Git Jira JSON Kafka Linux Machine Learning MongoDB MS SQL MySQL NoSQL Oracle Pipelines Power BI PySpark Python RDBMS REST API Salesforce Scrum Shell scripting Spark SQL Tableau Testing
Perks/benefits: Career development
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Marketing Data Analyst jobs
- Open MLOps Engineer jobs
- Open Junior Data Scientist jobs
- Open AI Engineer jobs
- Open Data Engineer II jobs
- Open Senior Data Architect jobs
- Open Power BI Developer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Analytics Engineer jobs
- Open Sr Data Engineer jobs
- Open Manager, Data Engineering jobs
- Open Principal Data Engineer jobs
- Open Business Data Analyst jobs
- Open Product Data Analyst jobs
- Open Data Quality Analyst jobs
- Open Data Manager jobs
- Open Sr. Data Scientist jobs
- Open Big Data Engineer jobs
- Open Data Scientist II jobs
- Open Business Intelligence Developer jobs
- Open Data Analyst Intern jobs
- Open ETL Developer jobs
- Open Principal Data Scientist jobs
- Open Azure Data Engineer jobs
- Open Data Product Manager jobs
- Open Business Intelligence-related jobs
- Open Data quality-related jobs
- Open Privacy-related jobs
- Open Data management-related jobs
- Open GCP-related jobs
- Open Java-related jobs
- Open ML models-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open Deep Learning-related jobs
- Open APIs-related jobs
- Open PyTorch-related jobs
- Open PhD-related jobs
- Open Consulting-related jobs
- Open TensorFlow-related jobs
- Open Snowflake-related jobs
- Open NLP-related jobs
- Open Data governance-related jobs
- Open Data warehouse-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open DevOps-related jobs
- Open CI/CD-related jobs