Company

HoneywellSee more

addressAddressSecunderabad, Telangana
CategoryIT

Job description

An SRE engineer works in collaboration with Development, Test, and IT Operations to create and deploy scalable and reliable software systems (for on-prem deployments, cloud-based deployments (preferably Azure) with a strong hold on microservices deployments).

The SRE engineer is responsible for operating applications in production mission-critical systems and taking necessary actions to keep the site up and running.

Key focus areas of SRE:

.Automation

.Monitoring (Application Monitoring and Log monitoring) ex-ELK, EFK, OpenSearch, PLG, etc.

.Tracing expertise in Managed k8s clusters (Preferably Azure)

.Service Mesh tools expertise (For Service-to-Service communication) Ex- Istio, Cilium, Nginx, etc

.Service Level Objectives

.Eliminating toil

.Release Engineering Process

.Embracing Risk

.Team collaboration

.Championing culture

Roles and Responsibilities of an SRE engineer:

  • Work blamelessly, always assuming the best intentions, and finding systemic causes together.
  • Create an SRE culture that reinforces our SRE principles. Enhance and maintain uptime.
  • Celebrate failure as an investment in reliability. Learn from each one with.
  • Treat reliability as a feature. Put reliability goals in specifications right from the start.
  • Share information within the teams and organization, and work in collaboration with other teams.
  • that are empathetic and fair.
  • Monitor infrastructure using SRE tools and suggest tools as necessary.
  • Build monitoring alerts and incident response processes, using the monitoring systems (for alerting and dashboards).
  • Improve operational processes and team practices.
  • Coding infrastructure automation across the CI/CD pipeline.
  • As the solution scales, ensure reliability through designing, building, and maintaining the core infrastructure.
  • Demonstrate strong programming skills and thorough knowledge of systems.
  • Bring about cultural shifts to provide a foundation for process changes.
  • After incidents, document actions in runbooks to create automated solutions during incident response.
  • On-call rotation for incident response and proactive incident measures.
  • Administer production jobs and understand debugging info.
  • Drain traffic away from a cluster, block, or rate-limit unwanted traffic, bringing up additional serving capacity.
  • Roll back a bad software push, with minimal downtime.
  • Describe the architecture, various components, and dependencies of the services to Teams.
  • Provide visibility into the performance of the application and reduce the cost of failure to lower new feature cycle time.

Key Skills of an SRE engineer:

.Familiarity with at least two coding/scripting languages (Python, Go, Java, Dotnet, C, C++, PowerShell, etc).

.Cloud competency (Microsoft Azure is preferred).

.Deep understanding of key Azure services like Azure Kubernetes Services (AKS), Databricks, Data Factory, API Management, Functions Apps, Application gateway, etc.

.CI/CD process and tools (Jenkins, GitHub Actions, Azure DevOps, etc.).

.Should have experience with Service Mesh and message broker service (Kafka).

.Deep understanding of containerization approach - Docker, Helm, Kubernetes.

.IaC tools (Ansible, Terraform, Chef, Puppet etc).

.Webserver (Apache HTTP and Nginx)

.Version control (Git, GitHub, BitBucket).

.Experience in Infrastructure Monitoring (Datadog, Prometheus, Grafana, etc.)

.Experience in Log and performance monitoring (Splunk, ELK, New Relic, etc.)

.Deep understanding of Databases (SQL, NoSQL, Postgres Flexi Server, etc.)

Candidate Profile

Honeywell is looking for .

Education : Any Graduate / Post Graduate

Key Skills

Apache Http, Nginx, Github, Data Factory, Api Management, C, Elk, Chef, Prometheus, Go, Dotnet, Nosql, Docker, Terraform, Microsoft Azure, Python, Java, C++, Powershell, Bitbucket, Git, Ansible, Databricks, Puppet, Helm, Kubernetes
Refer code: 997526. Honeywell - The previous day - 2024-04-15 10:26

Honeywell

Secunderabad, Telangana

Share jobs with friends

Related jobs

Advanced Software Engr

Sr Advanced Software Engr

Honeywell

Hyderabad, Telangana

3 months ago - seen