Westlake, Texas19 days ago
p/>Skills and Knowledge:
Candidate must also possess:
- Demonstrated Expertise (DE) performing DevOps and Site Reliability Engineering (SRE) by managing Continuous Integration/Continuous Delivery (CI/CD) pipelines; automating build, test, and deployment processes, using GitHub, Jenkins, Maven, SonarQube, Artifactory, and Kubernetes; supporting Cloud infrastructure provisioning and configuration, using Amazon Web Services (AWS), Azure, Terraform, Ansible, and CloudFormation; and implementing monitoring and logging solutions for performance and availability, using Datadog, Prometheus, Grafana, Splunk, and CloudWatch.
- DE supporting Java-based applications and Elastic Kubernetes Service (EKS) infrastructure by analyzing and debugging Java thread/heap dumps (to resolve performance bottlenecks), using JConsole, VisualVM, and Eclipse MAT; automating operational tasks using Python, Shell Scripting, and SQL optimization; and ensuring high application reliability and continuity through proactive monitoring, incident triaging, and problem management, using ServiceNow, Salesforce, and Jira.