Site Reliability Engineer

Job Description
AA Cloud is hiring a Site Reliabilty. In this role, you will work closely with a talented team of dynamic and passionate architects and engineers to deliver automated infrastructure and cloud solutions to AA Cloud customers. You will be responsible for leading AA Cloud customers through the process of transforming from a legacy operations model to a true SRE engineering model. In this role, you will work closely with AA Cloud customer executives, architects and engineers to architect and automate the infrastructure and application deployment pipeline to engineer scalable sites.
We are obsessed with adding value and providing unprecedented levels of customer service, so you should be as well!
Responsibilities
  • Communicate and work with CxO and Executive Level customers on DevOps transformation strategies around people, process and technology
  • Work closely with DevOps and Infrastructure Architects to design, implement and manage secure, scalable and reliable cloud infrastructure environments for AA Cloud customers
  • Assist AA Cloud customers in selecting, implementing, and tuning configuration management (CM) and continuous integration (CI) technology platforms
  • Lead AA Cloud teams to deliver CM, CI/CD, and DevOps consulting engagements with mid and large sized enterprise customers
  • Assist AA Cloud customers in ensuring operational readiness for launching workloads into public, private, and hybrid cloud environments
  • Work with AA Cloud customers to validate and troubleshoot existing CI and CM practices, and make recommendations for improvements and optimization
  • Work with AA Cloud customers to validate existing infrastructure performance, and make and recommendations for improvements and optimization
  • Implement infrastructure best practices for AA Cloud customers in areas such as CI, CM, performance, scalability, security, and availability
Qualifications
  • Excellent verbal and written communication and presentation skills suitable for CxO and Executive level stakeholder and strategy meetings
  • Ability to facilitate the process of gathering requirements and providing alternative DevOps architecture options for a variety of environments including multiple flavors of IaaS and private cloud environments
  • Keen focus on customer service and an understanding that technology solutions must provide business value
  • 10+ years of experience managing large scale, production Linux environments
  • 5+ years of development engineering and DevOps team leadership experience
  • 5+ years of experience with common application stacks (e.g. Apache, Nginx, Tomcat, Rails, NodeJS, PHP)
  • 5+ years of experience with configuration management and continuous integration tools and concepts (e.g. Chef, Ansible, Docker, Puppet, Jenkins, Travis CI, Hudson)
  • Experience with relational and NoSQL database and related technologies (e.g. MySQL, Redis, Memcached, MongoDB)
  • Strong working knowledge of Amazon Web Services, Microsoft Azure, and/or Google Cloud Platform
  • Experience with networking and security concepts and technologies in a production environment, preferably production AWS environment
  • Experience with log collection tools and analysis, as well as infrastructure performance monitoring tools and optimization practices