Position Details: Senior Infrastructure / NOC - Engineer (DevOps)

Location: Chennai, Tamil Nadu
Openings: 1
Salary Range:
Job Type: Full Time


Job Summary
Shift - Afternoon (2:00 PM to 11:00 PM - IST) and must be flexible

Must have good communication skills
Minium of 10+ years of work expereince 

Specific Job Responsibilities:

  • Act as a primary interface to business users for all IT support issues
  • Record and triage incoming client requests and system monitoring alert notifications
  • Perform access management activities and conduct patch/remediation efforts
  • Troubleshoot and resolve software, hardware, security and network issues via defined runbooks, escalate unresolved issues to senior engineers in a timely manner
  • Extend personal capabilities through training, reading and technical project work
  • Contribute to technical infrastructure and improve internal tooling.
  • Improve automation by writing new tools and contributing to existing ones.
  • Design, build and maintain highly scalable, fault-tolerant and performant systems in conjunction with our engineering teams.
  • Provision, optimize and maintain our cloud-based infrastructure.
  • provide End-to-end support from load balancers to databases.
  • Internal services such as configuration management, monitoring, load testing, and deployment/continuous integration workflows.
  • Perform cloud-based migrations with (near-)zero downtime.
  • Analyze, assess and remediate performance infrastructure bottlenecks.
  • Research and utilize new technologies and processes to enhance workflows.
  • Contribute to the continual improvement of information security.
  • Trigger and audit vulnerability scans, conduct data/log forensics and analysis to detect security incidents.
  • Remediate security concerns raised from the above activities further informed by consultation with relevant engineering teams.
  • Troubleshoot issues and outages across the entire technology stack.
  • Participate in a 24/7 on-call rotation.

Position Requirements:

  • Experience with Linux server configuration, administration and monitoring
  • Exposure to automation for streamlining repetitive tasks for increased efficiency in execution
  • Knowledge of infrastructure and operations cloud-based services (e.g. New Relic, Rollbar, PagerDuty, Slack...)
  • Capable of isolating root cause for issues based on known data flows and service integrations
  • Ability to manage multiple priorities in a fast-paced environment
  • Experience supporting enterprise IT environments
  • Detail-oriented in following procedures and documenting steps taken to resolve issues
  • Amenable to working all hours as needed to provide service coverage (NOC operations intended to provide 24/7 support)
  • Experience with AWS and related orchestration tools.
  • Experience with MySQL, Postgres, Redis and/or Elasticsearch administration.
  • Strong experience with Docker for packaging and deployment workflows.
  • Strong experience with automation for streamlining repetitive tasks for increased efficiency.
  • A strong familiarity with the principles and theories of distributed and high-performance systems.
  • Ability to reason about performance, security, and user interactions in complex systems.
  • Exposure on HashiCorp tools (Packer, Consul, Terraform, Vault, etc..).
  • Professional verbal and written communication skills.
  • Recommend procedural changes and defend those recommendations logically.
  • Self-motivated in acquiring knowledge and skills intended to enhance effectiveness in their role.

5 years of directly related experience in:

  • Proficient in one or more programming languages as well as operations experience.
  • Experience with AWS and related orchestration tools.
  • Linux server administration and configuration.
  • Monitoring alert management
  • Customer issue resolution and support

> experinence in working with github / version control
> Must have expereince in CI/CD tools like travis, jenkins.
> experience / exposure to kubernetes

Key Skills
Kubernetes, monitoring, github, Linux, docker, shell scripting, AWS, NOC

Perform an action:

IMAGE: Apply to Position