Observability Architect/Lead Engineer - DevOps, ELK, GCP, AWS, GHE, CICD
Bengaluru (Gopalan Axis SEZ)
Become a better developer by building things that help local businesses around the world thrive
Are you a passionate, energetic technology enthusiast eager to work at a rapid pace with the flexibility to work across a broad tech stack?
Groupon is an experiences marketplace that brings people more ways to get the most out of their city or wherever they may be. By enabling real-time mobile commerce across local businesses, live events, and travel destinations, Groupon helps people find and discover experiences––big and small, new and familiar––that make for a full, fun, and rewarding life. Groupon helps local businesses grow and strengthen customer relationships––resulting in strong, vibrant communities. With employees spread across multiple continents, we still maintain a culture that inspires innovation, rewards risk-taking, and celebrates success. Our culture encourages employees to embrace change, adapt to new circumstances, and find creative solutions to the challenges we face. Does that sound like a great way to grow your career? Let’s get into the details:
We are looking for a DevOps engineer who can help in IST Time zone. This person will own the deployment of Groupon’s ELK, CICD, monitor and troubleshoot production issues.
Groupon’s Platform team, consisting of 7 departments, creates innovative solution, Keep The Lights On, and help migrating services between cloud provider. This particular DevOPS department has 4 team members already, we are looking for new and experienced addition.
Position Summary:
The Observability Architect is responsible for designing and implementing comprehensive observability solutions to ensure the health, performance, and reliability of enterprise applications and infrastructure. This role involves defining strategies and frameworks for monitoring, logging, tracing, and alerting, enabling proactive issue detection and resolution. The Observability Architect will collaborate with cross-functional teams to build scalable observability systems that support the organization’s business objectives.
Key Responsibilities:
Develop and maintain the observability strategy, ensuring alignment with business goals and technology standards.
Create frameworks and best practices for observability, including monitoring, logging, tracing, and alerting.
Design and implement scalable observability solutions across various platforms and technologies.
Integrate observability tools and platforms (e.g., Prometheus, Grafana, ELK Stack, Jaeger) into existing infrastructure.
Ensure end-to-end visibility into system performance, health, and reliability.
Administer GitHub Enterprise Server including upgrade and maintenance.
Ability to design CI/CD flows, develop maintainable & extensible code/pipelines
Work closely with DevOps, development, and IT operations teams to integrate observability practices into the software development lifecycle.
Partner with stakeholders to understand requirements and translate them into observability solutions.
Analyze monitoring and logging data to identify trends, patterns, and potential issues.
Develop dashboards and reports to provide insights into system performance and reliability.
Use observability data to drive continuous improvement initiatives.
Establish and maintain alerting and escalation processes for timely issue detection and resolution.
Lead incident response efforts, utilizing observability tools to diagnose and resolve issues.
Stay updated with the latest trends and advancements in observability and monitoring technologies.
Continuously evaluate and enhance observability tools and practices to improve system reliability and performance.
We’re excited about you if you have:
Bachelor’s degree in Computer Science, Information Technology, or a related field.
Minimum of 9 years of experience in observability, monitoring, or related fields.
Proven experience in designing and implementing observability solutions.
Proficiency in observability tools and platforms (e.g., Prometheus, Grafana, ELK Stack, Jaeger, Splunk).
Strong understanding of cloud infrastructure (AWS, Azure, Google Cloud) and containerization technologies (Docker, Kubernetes).
Familiarity with scripting and automation (e.g., Python, Shell, Ansible).
Excellent problem-solving and analytical skills.
Strong communication and collaboration abilities.
Ability to work independently and as part of a team.
Attention to detail and a commitment to delivering high-quality work.
Certifications in relevant technologies (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator).
Experience with AIOps and machine learning for observability.
Working Conditions:
This role may require on-call responsibilities for incident management and resolution.
Groupon’s purpose is to build strong communities through thriving small businesses. To learn more about the world’s largest local ecommerce marketplace, click here. You can also find out more about us in the latest Groupon news as well as learning about our DEI approach. If all of this sounds like something that’s a great fit for you, then click apply and join us on a mission to become the ultimate destination for local experiences and services.
Beware of Recruitment Fraud: Groupon follows a merit-based recruitment process without charging job seekers any fees. We've noticed an increase in recruitment fraud, including fake job postings and fraudulent interviews and job offers aimed at stealing personal information or money. Be cautious of individuals falsely representing Groupon's Talent Acquisition team with fake job offers. If you encounter any suspicious job offers or interview calls demanding money, recognize these as scams. Groupon is not responsible for losses from such dealings. For legitimate job openings, always check our official careers website at grouponcareers.com.
* Salary range is an estimate based on our AI, ML, Data Science Salary Index 💰
Tags: Ansible AWS Azure CI/CD Computer Science DevOps Docker E-commerce ELK GCP GitHub Google Cloud Grafana Kubernetes Machine Learning Pipelines Python Security Splunk
Perks/benefits: Career development Team events
More jobs like this
Explore more AI, ML, Data Science career opportunities
Find even more open roles in Artificial Intelligence (AI), Machine Learning (ML), Natural Language Processing (NLP), Computer Vision (CV), Data Engineering, Data Analytics, Big Data, and Data Science in general - ordered by popularity of job title or skills, toolset and products used - below.
- Open Data Science Manager jobs
- Open Research Scientist jobs
- Open Junior Data Analyst jobs
- Open Business Data Analyst jobs
- Open Principal Data Scientist jobs
- Open Data Scientist II jobs
- Open Sr Data Engineer jobs
- Open BI Analyst jobs
- Open Business Intelligence Engineer jobs
- Open Data Science Intern jobs
- Open Sr. Data Scientist jobs
- Open Lead Data Analyst jobs
- Open Senior Business Intelligence Analyst jobs
- Open Azure Data Engineer jobs
- Open Software Engineer, Machine Learning jobs
- Open Junior Data Scientist jobs
- Open MLOps Engineer jobs
- Open Manager, Data Engineering jobs
- Open Marketing Data Analyst jobs
- Open Data Analytics Engineer jobs
- Open Data Engineer III jobs
- Open Junior Data Engineer jobs
- Open Data Engineering Manager jobs
- Open Product Data Analyst jobs
- Open Senior Software Engineer jobs
- Open Power BI-related jobs
- Open GCP-related jobs
- Open Tableau-related jobs
- Open Excel-related jobs
- Open ML models-related jobs
- Open Data pipelines-related jobs
- Open APIs-related jobs
- Open PhD-related jobs
- Open PyTorch-related jobs
- Open Finance-related jobs
- Open LLMs-related jobs
- Open TensorFlow-related jobs
- Open Deep Learning-related jobs
- Open Consulting-related jobs
- Open Data visualization-related jobs
- Open Generative AI-related jobs
- Open Business Intelligence-related jobs
- Open Data governance-related jobs
- Open CI/CD-related jobs
- Open NLP-related jobs
- Open DevOps-related jobs
- Open Kubernetes-related jobs
- Open Docker-related jobs
- Open Git-related jobs
- Open Snowflake-related jobs