We are seeking an Infrastructure Architect and Deployment Specialist, responsible for planning, designing, and implementing robust infrastructure setups for both staging and production environments. The ideal candidate will have a strong understanding of infrastructure requirements, networking, security, deployment automation, and AI development environments.
Key Responsibilities:
1. Infrastructure Planning and Design:
- Assess requirements for compute, storage, network, and security.
- Design infrastructure layout ensuring high availability and disaster recovery.
- Select suitable data center providers meeting operational needs and compliance standards.
- Provide on-prem managed hosting services with adherence to SLAs.
2. Network Setup:
- Configure internal networking, subnets, firewalls, and routing.
- Establish robust external connectivity for API access and user connectivity.
- Define connectivity to necessary cloud services (e.g. Google Tesseract, OpenAI, etc.).
3. Server and Hardware Setup:
- Provision servers according to specified requirements.
- Configure storage solutions optimized for performance and cost.
- Set up necessary databases and backup systems.
4. Security Implementation:
- Install firewalls, IDS/IPS systems for network protection.
- Implement data encryption at rest and in transit.
- Ensure compliance with industry standards and data protection laws.
5. Software and Platform Setup:
- Install and configure operating systems, build & release agents, and application pools.
- Set up HTTPS protocols and container orchestration using Kubernetes.
- Install and configure databases ensuring replication, backups, and failover systems.
6. Deployment Automation:
- Set up CI/CD pipelines using GitHub Actions for automated testing and deployment.
- Define GIT branching policy and enforce pull request approval process.
- Implement Infrastructure as Code (IaC) using tools like Terraform or Ansible.
7. Monitoring and Maintenance:
- Implement monitoring tools and centralized logging solutions.
- Set up automated alerts for critical system aspects.
8. API and Microservices Deployment:
- Configure API gateway for effective management and authentication.
- Deploy and configure microservices, integrating with APIs and frontend.
9. Prepare Environment for AI Development:
- Set up servers/environments for AI and LLM model development.
- Ensure secure and efficient data access for AI training.
- Manage Python code hosting and integration with the application.
10. Documentation and Training:
- Create detailed documentation of architecture and setup procedures.
- Provide training to operational teams on system management and troubleshooting.
Server Requirements:
Production (7 Servers):
- (2) Web Servers
- (2) App Servers
- (1) Vector DB Server
- (1) Local LLM Server
- (1) Postgres DB Server
Staging (5 Servers):
- (1) Web Server
- (1) App Server
- (1) Vector DB Server
- (1) Local LLM Server
- (1) Postgres DB Server
Qualifications:
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Strong experience in infrastructure planning, deployment, and automation.
- Proficiency in networking, security, and deployment tools.
- Experience with AI development environments and microservices architecture.
- Excellent documentation and communication skills.
If you’re passionate about designing and implementing scalable and secure infrastructure setups, and you thrive in a dynamic environment, we’d love to hear from you! 😎💪