DevOps Manager

Location: Pittsburgh

Job Type: Full Time / Permanent

The DevOps Manager is responsible for personnel management with regard to work assignment, mentorship, and evaluation. They will also help expand the team when the time comes. Our infrastructure stack consists of AWS (ECS, EKS, EC2, RDS, ElastiCache, Fargate), Docker, Cloudflare. Our build pipelines are in Atlasssian BitBucket. For production insight, we have the suite of AWS services like CloudWatch, as well as other third party services like SumoLogic for log aggregation, and DataDog for application monitoring. We also rely on CloudFlare as a WAF and caching strategy. Our biggest challenges that this role should solve are: • Central ownership and accountability for work in progress and planned roadmap. We’ve gotten by doing what needs done, when it needs done. Looking forward, we need a more structured approach to how we manage and measure our systems. • Bridging the gap between application-specific understanding versus arbitrary “infrastructure” understanding. We need help determining what the DevOps group should be aware of, and how they gain this knowledge. Or perhaps its enough for DevOps to govern the IaC produced by the development teams? How do we move towards more developer-produced IaC? • Switching from a reactive mindset to a proactive one. We need to project the DevOps plan into the future to align with the needs of the business instead of only focusing on what’s immediately in front of us.

The Right Fit:  You are fascinated by the Cloud and are hungry to learn as much as you can. There’s are always new technologies and patterns, and you’re on the lookout for what we can use to make our infrastructure better. You have great organizational skills. Seeing well-executed processes for incident management, production support, and change requests is a great reward. You can look holistically at a situation and access the shape of the problem and how to satisfy future needs, but don’t trip over immediate tactical needs of the team. You can speak well at both altitudes, and know the right way to frame things when talking high-level with a non-technical audience, as well as dive deep into the nuts and bolts of the infrastructure. You appreciate data, whether it’s the throughput of your team or the performance of you application, you know your numbers and, more importantly, the bottlenecks that the numbers reveal.

Responsibilities: Manage the team responsible for the AWS Cloud Infrastructure for 3 eCommerce applications and all supporting services • Support and advise on fulfillment center infrastructure (linux, docker, rabbit) • Own the DevOps roadmap, defining the scope of projects and the resources required for them. • Lead a team of DevOps and Site Reliability Engineers, including day to day work, performance evaluations, mentorship • Work closely with the security team to identify and remediate findings during PCI audits • Implement and manage on- and off-hours production support rotations • Hire and grow the DevOps team as the business and technology footprint expands • Manage production incidents, postmortems, and Root Cause Analysis • Assign and monitor system metrics, including uptime, outage response time. • Manage overlap between Ops and Developers • Perform hands-on infrastructure work; all of our managers contribute to the activities of their teams.

Qualifications • 6+ years in Cloud infrastructure • 2+ years managing a team

Preferred Qualification • Experience building a DevOps Team • Demonstration of using infrastructure or application metrics to improve the operations of applications • An evangelist of Infrastructure as Code • Experience organizing a production support rotation • Experience with incident tracking systems, like VictorOps.  Equal opportunity employer. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential job functions. Company does not sponsor individuals for the purpose of obtaining H-1 Visas.