Job Description
            
                Our technology has no boundaries! Nvidia is building the most modern and groundbreaking compute platforms globally for widespread use.
It’s because of our work that scientists, researchers and engineers can advance their ideas.
At its core, our visual computing technology not only enables an amazing computing experience, but it is also energy efficient! We pioneered a supercharged form of computing loved by the most demanding computer users in the world - scientists, designers, artists, and gamers.
It’s not just technology though! It is our people, some of the brightest in the world, and our diverse company culture make NVIDIA one of the most fun, innovative and dynamic places to work in the world! At the center of NVIDIA's culture are our core values like innovation, excellence and determination and team, that guide us to be the best we can be.
  
  
We are looking for a Senior Network Validation Engineer to lead & hands on contribute to Network validation activities in the Datacenter Systems Engineering team.
You'll work closely with solutions, Network & Storage architects, HW system engineers, validation engineers, OEM/ODMs, and AE teams to ensure product validation and test coverage are optimal for Data Center scale AI products.
The ideal candidate is self-motivated, works well with different teams, very comfortable in a lab environment and demonstrates passion towards product level validation.
They should have strong debug analysis fundamentals as well as automation and scripting experience.
They must be capable of thriving in fast paced environment with evolving product definitions.
  
  
What you’ll be doing:
+ Design validation plans from bare metal to at scale data center integration tests.
+ Debug, triage issues, perform root cause analysis, verify fixes, define new tests, and improve product test plans.
+ Configure, administer, troubleshoot, and oversee the qualification of Ethernet and InfiniBand networks in large-scale datacenter environments.
+ Perform server function & network validations including Ethernet & InfiniBand protocol & system level reliability test end to end application tests.
+ Design, develop, and maintain automation frameworks and test automation suites, including automated reporting, while consistently increasing end-to-end automation coverage with each release cycle.
+ Track and coordinate all validation activities from bring up to production release.
+ Collaborate with multi-functional teams including application teams, HW designers, networking team, FW, security etc.
to debug any HW/SW product issues.
+ Provide inputs to architecture teams for next generation Data Center networking design.
  
  
  
What we need to see:
+ M.S. degree in Engineering/Computer Science/related field (or equivalent experience).
+ 10+ years of experience.
+ Over 5 years of proven experience in Software Quality Engineering and Network Testing, including significant contributions to QA strategies and test documentation.
+ Strong skills in Python (preferred) or other scripting languages like Perl, Shell and hands-on experience with Jenkins or similar CICD based pipelines
+ Strong technical abilities, problem solving, designing, coding and debugging skills
+ Extensive hands-on experience in configuring and troubleshooting data center networking, including Layer 2/Layer 3 protocols such as VLAN, BGP, EVPN, and spine-leaf topology & InfiniBand networks experience desired.
+ Experience with using test tools from Ixia or Spirent and working experience in test management
+ Hands on experience working on Unix or Linux based OS
+ Great team player with multi-tasking ability and good interpersonal & documentation skills
+ Solid foundation in and understanding of software engineering practices
+ Excellent design, debugging and problem-solving skills, with a strong bias for action, quality and engineering excellence.
  
  
  
Ways to stand out from the crowd:
+ Certificate in CCIE (Routing & Switching / Service Provider / Data Center).
+ Demonstrated experience with RDMA (Remote Direct Memory Access) technologies and related protocols such as InfiniBand or RoCE.
+ Knowledge or experience of AI Data Center validation with GPU clusters.
+ Experience in REST API & Kubernetes and background in network automation tools like Ansible, Jenkins & Robot framework.
+ Experience in IPv6 & Telemetry at a Data Center scale with Observability tools like Grafana & Prometheus preferred
  
  
  
NVIDIA is widely considered one of the technology world’s most desirable employers.
We employ some of the most forward-thinking and talented people in the world.
Are you passionate about joining our life work to amplify human imagination and intelligence?
If you are creative, collaborative, and have a passion for creating custom silicon solutions that power forward-looking computing systems, we want to hear from you.
  
  
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
The base salary range is 160,000 USD - 253,000 USD.
  
  
  
  
  
 You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) .
  
  
  
  
  
Applications for this job will be accepted at least until October 18, 2025.
  
  
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.
As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.