Job Description
            
                Who You Are Sr. Network Architect with HPC Experience- Hybrid on site Santa Clara, CA HPC Network architect: Lead the architecture, design, and deployment of global-scale DCs interconnects and fabric for HPC, AI, and GPU computing clusters.
Develop high-performance data center fabric using InfiniBand, Ultra Ethernet, and related technologies.
Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability.
Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads.
Expert in networking technologies: InfiniBand, Ultra Ethernet, ROCEv2, DCQCN, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS.
Experience automating network infrastructure
Master's preferred
10+ Years
 Sr. Network Automation Engineer - Hybrid on site 2 days/week Santa Clara, CA About the Role We’re seeking a hands-on Infrastructure Systems Developer who thrives in building full-stack systems with a focus on network automation.
This is not your traditional network engineering role — we’re looking for someone who comes from the DevOps or systems development world and has ventured into networking by building tools, platforms, and automation frameworks that interact with network infrastructure.
You will own the architecture and development of a full-stack system that ingests, stores, and acts on network telemetry and configuration data — from backend frameworks to frontend UI to device interaction and automation.
Key Responsibilities Design and architect an end-to-end automation system for network configuration and telemetry.
Choose and implement the right technologies: Database: SQL, NoSQL, or MDM SQL solutions.
Backend: Python (FastAPI, Flask), Go, or similar.
Frontend: React, Vue, or modern JS framework.
Build integrations with CI/CD pipelines (e.g., Jenkins).
Implement configuration management and telemetry collection using Ansible, SaltStack, or similar tools.
Create APIs and services to interface with network devices.
Ensure scalable data storage and retrieval for network metadata and telemetry.
Collaborate with network engineers, DevOps, and security teams.
Requirements Strong programming skills in Python, Go, or equivalent languages.
Experience designing and building production-level infrastructure systems.
Deep understanding of system architecture and software lifecycle.
Familiarity with network automation concepts and tools, even if not a traditional network engineer.
Hands-on experience with: CI/CD: Jenkins or equivalent.
Config Management: Ansible, Salt.
Database systems: SQL and NoSQL (MongoDB, PostgreSQL, etc.).
Frontend frameworks: React, Vue, or similar.
Comfort working across the full stack and owning the entire lifecycle of a system.
Nice to Have Exposure to network protocols and device-level APIs (e.g., NETCONF, RESTCONF).
Experience in telemetry collection, parsing, and visualization.
Contributions to open-source DevOps or automation tools.
Experience with MDM/metadata modeling.