KR
Kishore Kumar Raju
Senior Site Reliability Engineer · Kubernetes · AWS
Overview
Skills
Experience
Education
Results-oriented Senior Site Reliability Engineer with 7+ years of experience driving infrastructure reliability, cost optimization, and automation in high-availability cloud environments. Specialist in Kubernetes, AWS, SRE best practices, and mentoring technical teams for improved operability and onboarding. Proven record of reducing deployment times, improving system observability, and leading migration projects for enterprise-scale systems.
Cloud & Infrastructure
AWS
OCI
High availability
Disaster Recovery
Containers & Orchestration
Kubernetes
Docker
EKS
Monitoring & Observability
PagerDuty
Datadog
Kibana
ELK Stack
Automation & CI/CD
Jenkins
Terraform
Atlantis
Shell scripting
Python
Messaging & Serverless
Kafka
Oracle Functions
API Gateway
SRE & Leadership
Incident management
RCA
Post-mortems
On-call
Technical mentoring
Process improvement
Lead AWS cost optimization, including migration from Intel to Graviton instances for significant savings.
Architected debugging processes and improved deployment observability across teams.
Managed and scaled Kafka clusters for high-throughput, low-latency event streaming.
Automated backup log delivery to S3 using Python tools across production and non-prod environments.
Owned incident management with PagerDuty; built strong communication and on-call processes.
Mentored and onboarded junior engineers, enhancing SRE culture and knowledge sharing.
Migrated infrastructure components, notably Elasticsearch, and led tooling and process improvements.
Identified and resolved deployment bottlenecks, reducing deployment times by 40%.
Automated AMI creation for Redis, QueueWorkers, and Elasticsearch, accelerating infrastructure recovery and DR.
Developed dashboards and alerts for real-time application and log monitoring.
Led RCA documentation, implemented monitors to prevent recurring infrastructure issues.
Automated Docker/ECR workflows with Jenkins, including version management and EKS deployment.
Designed scalable backup log tools using Python and integrated daily S3 backups.
Supported feature release planning and infrastructure scaling based on historical load metrics.
Collaborated on optimizing Elasticsearch queries and ELK stack logging pipelines.
Led migration from monolithic to serverless architectures; reduced maintenance and costs using Oracle Functions.
Automated migration of 300+ resources from Ravello to OCI; enhanced resource security via API Gateway and Fn Project.
Improved infrastructure provisioning using Atlantis and Terraform pull request automation.
Supported RESTful service development with JSON-based integrations.
Contributed to an in-house migration tool for OCI-C to OCI resource transitions.
B.Tech in Computer Science
Amrita Vishwa Vidyapeetham, Coimbatore
May 2018 · CGPA: 7.8