V ery good knowledge of Log analysing monitoring tools like Prometheus, Loki, Dynatrace, Grafana SolarWinds. Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
What ll be Responsible for
- Design and development of security policies, standards, and procedures in accordance with organization goals.
- Responsible for proactive monitoring of alerts (Network, Infra, Applications) and taking corrective actions.
- Responsible for Incident Management life cycle Service requests fulfilment
- Responsible for Incident logging, accurately tracks and documents all incidents.
- Adherence to the process compliance
- Adherence to the SLAs defined for the platform, Service uptime.
- Coordination with cross-group peers both proactively and reactively produces quality documentation and share with the appropriate team members.
- Responsible to develop SOP documents.
- Ability to deep dive into identifying the root cause of various service-impacting events and optimizing.
- Act as a First Point of Contact for incidents, escalations, and business-impacting technology issues
- To ensure the maximum possible service availability and performance of the platforms
- Responsible for continuous improvement of the process science.
What you d have
- Experience of 4-6 years in NOC
- Experience in Alert/Incident Management and a good understanding of SLAs
- Troubleshooting, Problem-solving Strong presentation skills
- Analytical and communication skills
- Strong knowledge of Linux, Network database querying
- Knowledge of asset management
- Very good knowledge of Log analysing monitoring tools like Prometheus, Loki, Dynatrace, Grafana SolarWinds
- Understanding of Infrastructure environments Cloud, VMware, storage, networks, databases, etc.
- Strong Linux, Networking, Log analysing, and database querying skills.
- Must have experience with monitoring tools like Prometheus, Loki, Grafana, and Dynatrace building monitoring dashboards.
- Experience in alerts mitigation optimization - Knowledge of the ITIL framework
- Hands-on exp with observability tools will be an added advantage.
- Must have expertise in maintaining/updating asset management.
- Certifications: ITIL foundation, AZ-900, Shell Scripting, Python, Hardware networking.
Skills: Asset Management, Prometheus, Shell Scripting, Grafana, Itil, Solarwinds, Hardware Networking, Linux, Database, Network, Dynatrace, Python
Experience: 4.00-6.00 Years