Enterprise Architect (42496)
Take the next step in your career as a Enterprise Architect working on complex AI and cloud platforms. You will design, implement and maintain network environments using InfiniBand, Ethernet and advanced routing technologies while ensuring high availability and performance. The role combines network operations, automation, security and collaboration across technical teams. We expect strong Linux networking knowledge, experience with scripting, CI/CD, Kubernetes and firewall management, as well as a Master’s degree in IT. If you enjoy working with modern infrastructure and driving innovation, this opportunity offers a dynamic and technically challenging environment.
🚀 Project
- coordinate operations across Data Center, IaaS and PaaS layers including network lifecycle activities such as installs, upgrades, changes and firmware updates
- manage network interconnections and maintain related documentation
- provision and maintain InfiniBand switches in line with ITIL standards
- develop and maintain automation scripts and perform configuration changes across the project lifecycle
- maintain network environments including patching and firmware upgrades at scale
- follow and improve ITIL processes including incident, problem and change management and adhere to ZERO Outage guidelines
- collaborate with Platform Engineers and AI solution teams to ensure smooth deployments and operations
- manage high-speed network fabric using InfiniBand and Ethernet / RoCE technologies
- design, develop, test and support ICT components and applications for AI Factory Cloud platform
- build concepts and methods for automation, optimization and standardization
- provide technical consulting, project deliverables and support innovation initiatives
- design and implement service architecture and lead technical activities across teams
- mentor team members and contribute to research and development activities
🎯 Skills
- master’s degree in Information Technologies
- hands-on experience with network installation, maintenance and operations
- deep knowledge of InfiniBand, RoCE and low-latency high-throughput networking
- experience with NVIDIA/Mellanox switches and UFM management
- strong knowledge of data center routing and protocols including BGP and OSPF and Cisco or Juniper routers
- strong Linux networking skills including Cumulus OS, Ubuntu or Debian and configuration of bridges, VLANs and routing
- experience with network tools such as iperf, ethtool, nvidia-smi, perfquery and Mellanox/NVIDIA diagnostics
- experience with monitoring, incident detection and root cause analysis in large-scale environments
- knowledge of NOC/SOC operations and on-call models
- experience in firewall and security management including FortiGate administration, NAT, VPNs, IDS/IPS and HA
- experience with configuration and lifecycle management including provisioning, upgrades and patching
- knowledge of ITIL processes
- 5+ years of experience with IaaS, PaaS and SaaS systems
- 3+ years of experience with CI/CD pipelines and serverless architectures
- knowledge of AI Factory Cloud platform and NVIDIA GPU platforms
- knowledge of Kubernetes or similar container technologies
- experience with VMware environments
- scripting skills in Go, Python or Bash
- experience with automation tools such as Ansible, SaltStack, Terraform and Helm
- experience with Git and CI/CD tools such as GitHub or GitLab
- knowledge of Software-Defined Networking principles
- experience with monitoring tools such as Grafana and Prometheus
- advanced English level
💡 Nice to have
- german language