We are seeking a highly skilled and experienced Lead Software Engineer to join our dynamic infrastructure team. The ideal candidate will be a hands-on expert in managing Linux environments, strongly emphasizing cloud infrastructure automation, particularly within the Google Cloud Platform (GCP). This role requires deep knowledge of networking services, email protocols, web proxy solutions, and a proven ability to leverage Infrastructure as Code (IaC) principles using Terraform and manage code versions with GitHub. A cloud certification is essential for this position.
Responsibilities:
DevOps Engineering:
- Design, implement, and manage Infrastructure as Code (IaC) using Terraform across multi-cloud environments (e.g., AWS, Azure, GCP).
Want more jobs like this?
Get jobs delivered to your inbox every week.
- Collaborate with development and operations teams to define infrastructure requirements and ensure scalable, secure, and reliable cloud solutions.
- Implement blue-green deployments and zero-downtime release strategies.
- Manage and version control infrastructure configurations and scripts using Git and GitHub.
- Collaborate with development and operations teams to build and improve CI/CD pipelines.
- Participate in code reviews and contribute to improving IaC standards and practices.
- Leverage DNS routing strategies to enable zero-downtime deployments and fast rollback capabilities.
Linux System Administration:
- Install, configure, maintain, and troubleshoot Linux servers (e.g., RHEL, CentOS, Ubuntu) in a production environment.
- Perform system monitoring, performance tuning, and capacity planning.
- Implement and maintain system security best practices, including firewall (iptables) configurations and SELinux.
- Automate routine tasks using shell scripting (Bash, Python).
- Oversee logical volume management (LVM) and file system administration.
- Configure and administer SMTP services (e.g., Postfix, Sendmail) for reliable email delivery and reception.
- Manage and troubleshoot internal DNS (Domain Name System) infrastructure, including zone file management and recursive/authoritative DNS servers.
- Implement and maintain automation for cloud resource provisioning, configuration, and deployment.
- Monitor and optimize GCP resource utilization and costs.
Security & Compliance:
- Ensure compliance with security standards and policies across Linux and GCP environments.
- Implement and maintain robust backup and disaster recovery strategies.
- Participate in security audits and vulnerability assessments.
Documentation & Support:
- Create and maintain comprehensive technical documentation for systems, configurations, and procedures.
- Provide support and troubleshoot issues, responding to incidents and resolving problems efficiently.
- Participate in an on-call rotation as required.
Basic Qualifications:
- 5+ years of experience as a Linux System Administrator.
- GCP Certified (Associate Cloud Engineer or Professional Cloud Architect preferred).
- 5+ years of experience with Terraform for infrastructure provisioning and management on GCP.
- Proficiency with Git and GitHub for source code management and collaboration.
- Bachelor's degree in computer science, Information Technology, or a related field, or equivalent practical experience.
Preferred Qualifications:
- Strong expertise in SMTP configuration and troubleshooting.
- In-depth knowledge of internal DNS administration and best practices.
- Proven experience with Web Proxy solutions (e.g., Nginx as a reverse proxy, Squid).
- Experience with scripting languages (Bash, Python).
- Familiarity with monitoring tools (e.g., DataDog, Grafana).