Icon hamburger
What job do you want?
Apply to this job.
Think you're the perfect candidate?
Apply Now

You’re being taken to an external site to apply.

Enter your email below to receive job recommendations for similar positions.

Site Reliability Engineer - $120k+ - Uptown Charlotte

Vaco Technology Uptown Charlotte Full-Time
Apply Now

The Site Reliability Engineer (SRE) is responsible for all non-production environments. The ideal candidate should have hands on experience learning, triaging (both proactive and reactive) and documenting application stacks, using monitoring tools (Splunk, AppDynamics, UI-session replay, Sentry, and/or others) and have expert-level proficiency in at least one area such as content delivery, application development (Java, JavaScript), networking or infrastructure. They should understand web traffic movement through all layers of infrastructure including F5 load balancers and firewalls.

The SRE will partner with application development and API teams to gain understanding of the application stacks, triage environment issues, design monitoring methods, and provide reporting to executive leadership Will lead a small tactical team which will be the single point of contact for our Agile development and product teams regarding all non-production environment issues.

Job Responsibilities

* Partner with the Agile development teams to learn and assume responsibility for documentation, logging, and monitoring for various systems

* Partner with DevOps on CI/CD improvements using Bitbucket, Jenkins, & OpenShift

* Implementation of monitoring on various online applications using solutions such as Splunk, UI-session replay, AppDynamics, etc. and ability to determine the right toolset to accomplish monitoring goals on net new application stacks

* Strong knowledge of custom alerts and ability to integrate data housed in disparate data sources to create workflow driven alerting

* Have understanding of administration of application servers like Node.js, NGINX, JBoss, Apache, Spring Boot, etc.

* Continuously tune and validate quality of current tools for network, system monitoring, UI-session replay, log file parsing, and implement a toolkit that works

* Assist in vulnerability scanning, RCA proposals for defects in Scrum team backlogs

* Participate in routine Agile and Scrum ceremonies


* Must have expert level knowledge of:

o Content Delivery Networks (CDN)


o Customer Facing Web Applications (Full Stack)

* Must have direct experience with:

o Leading Triages

o Monitoring tools (Splunk, AppDynamics, and/or others)

o SQL, Linux, Scripting, file manipulation, reporting and Visio

o Big data elements like server logs, user URL's, etc

o CI/CD tools such Bitbucket, Jenkins, & OpenShift

* Ability to communicate effectively to various levels of Sr. Management -- Technology and Business

* Experience and capability to lead small teams

* Ability to work off-hours and/or weekends as needed

Additional Desired Knowledge & Skills:

* Experience with complex multi-system environments

* Working knowledge of Agile methodologies (Scrum, Kanban, Lean, XP)

* Experience supporting hybrid server environments (on-premise, AWS, Azure, etc.)

* Good understanding of financial industry operations metrics and reporting practices a plus

* Passion, positive attitude, engagement and desire to take over challenging assignments as part of a team to make things WORK

1.What does the term DevOps mean to you and give an example of how you used its concepts in the past?

2.What does the term Site Reliability Engineering (SRE) mean to you and give and example of how you used its concepts in the past?

3.Give an example of how you worked across different teams in your previous organizations, such as product owners, development, or architecture teams, to solve an issue?

4.Give an example of how you worked to automate a previously manual task, workflow, or business function and what impact did that have on the business?

* Must have expert level knowledge of: o Content Delivery Networks (CDN) o NGINX o Customer Facing Web Applications (Full Stack)

Recommended skills

Data/Record Logging
Amazon Web Services
Application Development
Root Cause Analysis
Server (Computer Science)
Apply to this job.
Think you're the perfect candidate?
Apply Now

Help us improve CareerBuilder by providing feedback about this job: Report this job

Report this Job

Once a job has been reported, we will investigate it further. If you require a response, submit your question or concern to our Trust and Site Security Team

Job ID: SITER73052


For your privacy and protection, when applying to a job online, never give your social security number to a prospective employer, provide credit card or bank account information, or perform any sort of monetary transaction. Learn more.

By applying to a job using CareerBuilder you are agreeing to comply with and be subject to the CareerBuilder Terms and Conditions for use of our website. To use our website, you must agree with the Terms and Conditions and both meet and comply with their provisions.

Vaco Technology is a customer-focused, premier provider of IT employment solutions. Our strength is our ability to match talented and experienced information technology professionals to the unique business needs of the client. Our team-based approach allows us to combine our experiences in the IT industry to develop effective, customized solutions quickly and efficiently. Since we aim to build long-term relationships, we are always striving to provide higher quality service and produce better results.

Vaco Technology
is a division of Vaco, LLC. Our sister divisions, Vaco Resources, Vaco Financial and Continuum Search provide highly skilled, seasoned professionals for finance, accounting and executive search needs.

View the full profile