This is a home-based position but you would be required to visit the office once every few weeks for team meetings etc.
Key Responsibilities and Accountabilities:
- Design, implement, and maintain the organization's and customers infrastructure to meet performance and reliability requirements.
- Deploy servers and services using industry best practices and automation tools.
- Evaluate and recommend new technologies and solutions to enhance the infrastructure.
- Set up and configure monitoring systems to proactively identify and address potential issues for all customers.
- Analyze system performance data to identify areas for improvement and optimization.
- Work closely with support teams to investigate and resolve complex infrastructure-related incidents.
- Define and implement alerting mechanisms to ensure timely response to incidents.
- Create and maintain comprehensive documentation for infrastructure components, configurations, and operational procedures.
- Develop and maintain runbooks for common troubleshooting scenarios.
- Ensure documentation is up-to-date and accessible to relevant team members.
- Implement and maintain failover mechanisms to ensure continuous availability of critical systems.
- Collaborate with cross-functional teams to design and execute disaster recovery plans.
- Conduct regular testing of failover and disaster recovery processes.
- Manage and coordinate system patching and updates to ensure security and performance.
- Develop and execute strategies for minimizing downtime during maintenance activities.
- Stay informed about the latest security vulnerabilities and apply patches promptly.
- Monitor resource utilization and plan for capacity expansion as needed.
- Collaborate with stakeholders to forecast infrastructure needs and propose scalable solutions.
- Implement and maintain tools for capacity planning and performance monitoring.
The ideal candidate will play a crucial role in designing, deploying, and maintaining our infrastructure, ensuring optimal performance, reliability, and security. The Infrastructure Engineer will work collaboratively with cross-functional teams to implement and support a robust technology foundation. If you have a passion for building and maintaining scalable infrastructure, along with expertise in monitoring, deploying servers, and implementing failover mechanisms, we want to hear from you.
Core Competencies
- Proven experience as an Infrastructure Engineer or similar role.
- Strong knowledge of server deployment, configuration, and maintenance.
- Expertise in implementing and managing monitoring and alerting systems.
- Familiarity with automation tools for infrastructure deployment.
- Experience in designing and implementing failover mechanisms for high availability.
- Proficient in system patching and updating processes.
- Excellent documentation skills.
- Strong problem-solving and troubleshooting abilities.
- Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
- Ability to interpret written requirements and technical specification documents