Senior Site Reliability Developer
Vena Solutions
Software Engineering
Indore, Madhya Pradesh, India
INR 2,542,914-3,440,412 / year
Posted on Apr 3, 2026
Department: Cloud Engineering
Location: India - Resident (0008)
Description
We are seeking a Senior Site Reliability Developer to join the SaaS Technology & Operations (STO) team supporting the Acterys platform. This is a senior individual contributor role focused on designing, building, and operating highly reliable, scalable, and automated cloud infrastructure on Microsoft Azure.
This role requires strong technical depth, sound engineering judgment, and the ability to translate customer business requirements into durable technical solutions. The ideal candidate combines software engineering discipline with operational rigor, driving reliability improvements, automation, and observability across a globally distributed SaaS platform operating in a SOC 2 Type II–compliant environment.
How You'll Make An Impact
Please note this reflects only a portion of our current technical stack, and we are constantly evolving and revisiting our stack as we grow:
Location: India - Resident (0008)
Description
We are seeking a Senior Site Reliability Developer to join the SaaS Technology & Operations (STO) team supporting the Acterys platform. This is a senior individual contributor role focused on designing, building, and operating highly reliable, scalable, and automated cloud infrastructure on Microsoft Azure.
This role requires strong technical depth, sound engineering judgment, and the ability to translate customer business requirements into durable technical solutions. The ideal candidate combines software engineering discipline with operational rigor, driving reliability improvements, automation, and observability across a globally distributed SaaS platform operating in a SOC 2 Type II–compliant environment.
How You'll Make An Impact
- Help build and evolve scalable, resilient cloud systems using best practices in automation, reliability engineering, and developer enablement.
- Support services before they go live through system design consultation, infrastructure design, CI/CD implementation, production readiness reviews, and reliability assessments.
- Design, implement, and maintain infrastructure-as-code (Terraform) and deployment pipelines to ensure repeatable, consistent environments.
- Define, document, and continuously improve runbooks and standard operating procedures governing platform operations.
- Maintain production services by measuring and monitoring availability, latency, performance, and overall system health.
- Design and implement observability solutions leveraging Azure-native telemetry, logging, and monitoring tools.
- Drive high standards around incident response, root cause analysis, and post-incident remediation, with a focus on automation and systemic improvement.
- Craft and refine operational procedures that define how the Acterys platform is deployed, monitored, supported, and maintained.
- Provide mentorship and technical guidance to other engineers across STO and Development.
- Participate in technical interviews for engineering roles as needed.
- Participate in the on-call rotation and contribute to ensuring operational readiness and reliability.
- Other duties, as assigned.
Please note this reflects only a portion of our current technical stack, and we are constantly evolving and revisiting our stack as we grow:
- Microsoft Azure cloud infrastructure (global footprint across EU, APAC, US)
- Azure App Service
- Azure SQL
- Azure PowerBI and Microsoft Fabric
- Infrastructure-as-Code using Terraform
- CI/CD pipelines leveraging Azure DevOps
- Multi-deployment architecture:
- SaaS
- Hybrid (SaaS + customer on-premise database
- Fully on-premise deployments
- Observability and monitoring leveraging Azure-native telemetry and monitoring solutions
- 6+ years of experience in DevOps, Site Reliability Engineering, Cloud Operations, or Software Engineering roles.
- Strong expertise operating and supporting production workloads on Microsoft Azure.
- Demonstrated experience implementing and operating infrastructure-as-code using Terraform.
- Deep understanding of core SRE concepts including SLOs, SLIs, error budgets, and reliability metrics.
- Experience building and maintaining modern CI/CD pipelines.
- Strong programming capability in at least one language, with production-grade implementation experience.
- Strong experience with observability practices including telemetry, centralized logging, alerting, and performance monitoring.
- Experience operating within regulated or compliance-driven environments (SOC 2 Type II preferred).
- Azure Associate- or Professional-level certifications.
- Experience supporting hybrid or on-premise deployment models.
- Familiarity with ITIL-aligned incident and change management practices.
- Exposure to AWS environments in addition to Azure.
- Ability to independently scope, design, and deliver technical solutions in environments with evolving or ambiguous requirements.
- Our salaries are tailored to roles, levels and locations. Your individual pay within this range is influenced by factors like work location, skills, experience and education. As you progress in your role, your compensation may adapt, offering flexibility for growth beyond initial levels. For specifics, your recruiter will provide details and address any questions during the hiring process.