Monitoring & Alerting Intern
Sundayapp
👋 About Us
At sunday, we’re on a mission to make payments in hospitality seamless. By allowing customers to pay in under 10 seconds with a simple QR code, we help restaurants improve service, increase turnover, and focus on what matters: great food and experiences.
We’re building the fastest way to pay in restaurants — and we’re just getting started.
💼 About the Role
We are looking for a driven Monitoring & Alerting Intern to join our Engineering team for a 4–6 month mission focused on improving the reliability and observability of our production systems.
Recent incidents in our production environment have revealed a need to significantly improve our monitoring and alerting capabilities. This internship will offer hands-on experience in infrastructure observability, while directly contributing to improved platform stability.
This is a high-impact role for someone eager to get into the nuts and bolts of modern DevOps and help build scalable systems that power real-time user experiences.
🔥 Key Responsibilities
- Audit Our Existing Setup: Conduct a thorough analysis of our current monitoring system (Datadog), identifying coverage gaps and improvement areas.
- Improve Alerting Logic: Design and implement smarter, more tailored alerts for critical parts of our infrastructure to reduce noise and improve incident response.
- Collaborate Across Teams: Work closely with Engineering, SREs, and Product to ensure alignment and adoption of new monitoring tools and workflows.
- Build Effective Dashboards: Create user-friendly dashboards and performance reports to surface key metrics and empower faster decision-making.
- Document Best Practices: Define and share monitoring guidelines and alerting standards to ensure long-term maintainability and scalability.
😊 About You
- A student or recent graduate in Computer Science, Engineering, or a related field
- Passionate about system reliability, DevOps practices, and building solid technical foundations
- Familiar with observability tools like Datadog, Prometheus, Grafana, etc. (Datadog is a plus!)
- Analytical and organized, with a keen attention to detail
- Comfortable working independently and managing your own roadmap
- Curious, eager to learn, and unafraid to ask questions
- Fluent in English (French is a plus)
⛳️ Compensation, Perks & Benefits
- 1200 euros per month
- 50 euros for remote working costs
- 50% of public transportation costs
- The chance to work on a mission-critical project from day one
- Exposure to real production infrastructure and incident management practices
- A fun, fast-moving, and supportive team
- Flexible working hours and location
- Learning and mentorship opportunities from seasoned engineers
💫 Why It Matters
By strengthening our monitoring and alerting systems, you’ll help reduce incident resolution times, improve platform reliability, and ultimately deliver a better experience for our users. This internship will not only elevate our tech but also contribute to a safer and more efficient production environment.
Thank you for taking the time to apply, and looking forward to getting to know you!
Sunday is an equal opportunity employer and does not discriminate and all qualified applicants will receive consideration for employment without regard to race, creed, color, sex, affectional or sexual orientation, gender identity or expression, gender, ethnicity, religion, national origin, ancestry, nationality, age, disability, marital status, veteran status, genetic information, or on any other basis prohibited by law (except where an attribute is a bona fide occupational qualification).