AA Cloud is hiring a Site Reliabilty. In this role, you will work closely with a talented team of dynamic and passionate architects and engineers to deliver automated infrastructure and cloud solutions to AA Cloud customers. You will be responsible for leading AA Cloud customers through the process of transforming from a legacy operations model to a true SRE engineering model. In this role, you will work closely with AA Cloud customer executives, architects and engineers to architect and automate the infrastructure and application deployment pipeline to engineer scalable sites.
We are obsessed with adding value and providing unprecedented levels of customer service, so you should be as well!
Responsibilities
Communicate and work with CxO and Executive Level customers on DevOps transformation strategies around people, process and technology
Work closely with DevOps and Infrastructure Architects to design, implement and manage secure, scalable and reliable cloud infrastructure environments for AA Cloud customers
Assist AA Cloud customers in selecting, implementing, and tuning configuration management (CM) and continuous integration (CI) technology platforms
Lead AA Cloud teams to deliver CM, CI/CD, and DevOps consulting engagements with mid and large sized enterprise customers
Assist AA Cloud customers in ensuring operational readiness for launching workloads into public, private, and hybrid cloud environments
Work with AA Cloud customers to validate and troubleshoot existing CI and CM practices, and make recommendations for improvements and optimization
Work with AA Cloud customers to validate existing infrastructure performance, and make and recommendations for improvements and optimization
Implement infrastructure best practices for AA Cloud customers in areas such as CI, CM, performance, scalability, security, and availability
Qualifications
Excellent verbal and written communication and presentation skills suitable for CxO and Executive level stakeholder and strategy meetings
Ability to facilitate the process of gathering requirements and providing alternative DevOps architecture options for a variety of environments including multiple flavors of IaaS and private cloud environments
Keen focus on customer service and an understanding that technology solutions must provide business value
10+ years of experience managing large scale, production Linux environments
5+ years of development engineering and DevOps team leadership experience
5+ years of experience with common application stacks (e.g. Apache, Nginx, Tomcat, Rails, NodeJS, PHP)
5+ years of experience with configuration management and continuous integration tools and concepts (e.g. Chef, Ansible, Docker, Puppet, Jenkins, Travis CI, Hudson)
Experience with relational and NoSQL database and related technologies (e.g. MySQL, Redis, Memcached, MongoDB)
Strong working knowledge of Amazon Web Services, Microsoft Azure, and/or Google Cloud Platform
Experience with networking and security concepts and technologies in a production environment, preferably production AWS environment
Experience with log collection tools and analysis, as well as infrastructure performance monitoring tools and optimization practices