Data Engineer
Monterrey, Nuevo Leon, Mexico
Applications have closed
DigitalOnUs
Live draw hk siaran no result togel hongkong hari ini. Live hk prize disiarkan oleh situs resmi hk pools, melalui undian dari live draw hongkong.DATA ENGINEER
We are looking to hire a data analyst to join our data team. Who will take responsibility for managing master data set, developing reports, and troubleshooting data issues. To do well in this role is necessary a very fine eye for detail, experience as a data analyst, and a deep understanding of the popular data analysis tools and databases.
Responsibilities:
• You will be using Data wrangling techniques converting one "raw" form into another including data visualization, data aggregation, training a statistical model etc.
• Work with various relational and non-relational data sources with the target being Azure based SQL Data Warehouse & Cosmos DB repositories
• Clean, unify and organize messy and complex data sets for easy access and analysis
• Create different levels of abstractions of data depending on analytics needs
• Hands on data preparation activities using the Azure technology stack especially Azure Databricks is strongly preferred
• Implement discovery solutions for high speed data ingestion
• Work closely with the Data Science team to perform complex analytics and data preparation tasks
• Work with the Sr. Data Engineers on the team to develop APIs
• Sourcing data from multiple applications, profiling, cleansing and conforming to create master data sets for analytics use
• Experience with Complex Data Parsing (Big Data Parser) and Natural Language Processing (NLP) Transforms on Azure a plus
• Design solutions for managing highly complex business rules within the Azure ecosystem
Minimum Requirements:
• Mid to advanced level knowledge of Python and Pyspark is an absolute must.
• Knowledge of Azure, Hadooop 2.0 ecosystems, HDFS, MapReduce, Hive, Pig, sqoop, Mahout, Spark etc. a must
• Experience with Web Scraping frameworks (Scrapy or Beautiful Soup or similar)
• Significant programming experience (with above technologies as well as Java, R and Python on Linux) a must
• Extensive experience working with Data APIs (Working with RESTful endpoints and/or SOAP)
• Excellent working knowledge of relational databases, MySQL, Oracle etc.
• Experience with Complex Data Parsing (Big Data Parser) a must. Should have worked on XML, JSON and other custom Complex Data Parsing formats
• Knowledge of High-Speed Data Ingestion, Real-Time Data Collection and Streaming is a plus
• Bachelor’s in computer science or related educational background
• 3-5 years of solid experience in Big Data technologies a must
If you apply for this opportunity we will get you resume and its contain personal data whose treatment has been authorized by its owner for Digital OnUs, S. de RL de CV (the "Company”). If you are not the owner of this information or have no relation whatsoever with the subjects treated in it, you are requested in the most attentive way not to make copies of it and / or its attached files and delete it immediately, under the risk of being considered as responsible for the unauthorized treatment of personal data in accordance with the Federal Law on Protection of Personal Data Held by Private Parties, its Regulations, and other applicable regulations. If you are the owner of personal data in possession of the Company and wish to obtain further information regarding the processing of your personal data or the exercise of your ARCO rights, please consult our integral privacy notice on the website https://www.digitalonus.com/privacy-policy/
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: APIs Azure Big Data Computer Science Data analysis Databricks Data visualization Data warehouse HDFS JSON Linux MySQL NLP Oracle Privacy PySpark Python R RDBMS Spark SQL Statistics Streaming XML
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Lead Data Analyst jobs
- Open Data Science Manager jobs
- Open MLOps Engineer jobs
- Open Senior Business Intelligence Analyst jobs
- Open Data Engineer II jobs
- Open Sr Data Engineer jobs
- Open Data Manager jobs
- Open Principal Data Engineer jobs
- Open Data Analytics Engineer jobs
- Open Power BI Developer jobs
- Open Product Data Analyst jobs
- Open Junior Data Scientist jobs
- Open Business Intelligence Developer jobs
- Open Data Scientist II jobs
- Open Senior Data Architect jobs
- Open Sr. Data Scientist jobs
- Open Manager, Data Engineering jobs
- Open Business Data Analyst jobs
- Open Big Data Engineer jobs
- Open Data Analyst Intern jobs
- Open Data Quality Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Product Manager jobs
- Open Azure Data Engineer jobs
- Open Junior Data Engineer jobs
- Open Data quality-related jobs
- Open Business Intelligence-related jobs
- Open GCP-related jobs
- Open ML models-related jobs
- Open Data management-related jobs
- Open Privacy-related jobs
- Open Java-related jobs
- Open Finance-related jobs
- Open Data visualization-related jobs
- Open APIs-related jobs
- Open Deep Learning-related jobs
- Open PyTorch-related jobs
- Open Consulting-related jobs
- Open Snowflake-related jobs
- Open TensorFlow-related jobs
- Open PhD-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open Kubernetes-related jobs
- Open Data governance-related jobs
- Open Airflow-related jobs
- Open Hadoop-related jobs
- Open Databricks-related jobs
- Open LLMs-related jobs
- Open Data warehouse-related jobs