The Principal AWS Cloud Engineer leads and manages a team of cloud engineers and DevOps, fostering collaboration and innovation to deliver complex cloud solutions. This role involves coordinating with various stakeholders to plan, design, develop, test, and maintain cloud-based applications and infrastructure. The engineer provides technical leadership in AWS architecture, ensures the scalability, security, and cost-efficiency of cloud systems, and maintains high standards of quality.
Key responsibilities include overseeing the implementation of new systems, ensuring timely project delivery, and continually updating technical knowledge. The role also involves maintaining detailed documentation, understanding the release process, and performing system maintenance.
- Leading and managing a team of AWS cloud engineers, fostering a collaborative and innovative environment to deliver complex cloud solutions.
- Coordinating with scrum masters, project managers, and other engineers to plan, design, develop, test, and maintain cloud-based applications and infrastructure.
- Providing technical leadership in all aspects of AWS cloud architecture and solutions, ensuring best practices and efficient use of AWS services.
- Overseeing the design, development, and implementation of new cloud systems and enhancements to existing systems, ensuring scalability, security, and cost-efficiency.
- Ensuring all cloud engineering projects are delivered on time and meet the agreed-upon quality standards and performance metrics.
- Continually updating technical knowledge and skills, particularly in AWS services and cloud technologies, and sharing them with the team.
- Maintaining high standards of cloud solution quality within the team by establishing and promoting good practices and habits.
- Providing regular performance feedback, coaching, and career development opportunities to team members, fostering their growth and professional development.
- Facilitating root cause analysis of cloud system issues, implementing resolution strategies, and ensuring proactive measures are taken to prevent future occurrences.
- Developing and maintaining detailed and comprehensive documentation and diagrams
- Being knowledgeable in the release process and overall architecture of our systems, ensuring smooth deployment and integration of new features and updates.
- Performing system maintenance procedures even during non-business hours and over the weekends
- Participating in an on-call coverage for support of systems outside of normal business hours
- Ensuring high availability of internal and production systems
- Ensuring that company and business unit standard server technologies (Linux and Windows), proper performance, optimization, and capacity standards are implemented