• Senior GPU Cluster

    NVIDIA (Santa Clara, CA)
    …working with distributed system software architecture + Basic understanding of HPC GPU cluster , slurm + Basic understanding of Machine learning concepts and ... experience for customer as well as engineers supporting the cluster . Much of our software development focuses...running and instrumenting distributed LLM training on a multi gpu HPC cluster + Knowledge of LLM… more
    NVIDIA (08/13/24)
    - Related Jobs
  • Senior High Performance Computing…

    NVIDIA (Santa Clara, CA)
    …for a deeply technical HPC cluster administrator to lead a diverse cluster of GPU -accelerated systems and provide architectural mentorship to product teams ... team, you will provide leadership in the design and implementation of groundbreaking GPU compute cluster that runs demanding deep learning, high performance… more
    NVIDIA (06/26/24)
    - Related Jobs
  • Senior Software Test Development…

    NVIDIA (Santa Clara, CA)
    We are looking for a highly experienced AI Senior Software Test development engineer in NVIDIA's Deep Learning SWQA team. The position is in NVIDIA Deep Learning ... to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech… more
    NVIDIA (09/06/24)
    - Related Jobs
  • Senior Software Test Development…

    NVIDIA (Santa Clara, CA)
    …to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech ... We are looking for a Software Test development engineer in NVIDIA's Deep Learning...improve test automation. + Experience in validating Data Center GPU based infrastructure (multi-GPUS, multi-nodes, cluster ). +… more
    NVIDIA (09/05/24)
    - Related Jobs
  • Senior Software Engineer, Server…

    NVIDIA (Santa Clara, CA)
    NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More ... recently, GPU deep learning ignited modern deep learning - the...by doing failure analysis for whole system and architecting software and firmware to be fault resilient. You will… more
    NVIDIA (08/22/24)
    - Related Jobs
  • Senior Software Architect - Data…

    NVIDIA (Santa Clara, CA)
    software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has ... We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data...customers. What you'll be doing: + You will lead software activities for NVIDIA's deep learning server platforms, from… more
    NVIDIA (07/16/24)
    - Related Jobs
  • Senior DevOps Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …be used for a variety of AI workloads. This includes working on custom software related to GPU asset provisioning, configuration, and lifecycle management across ... deployments and toil elimination. We view DevOps as a software engineering discipline and expect significant contributions to our...You will be harnessing multiple data streams, ranging from GPU hardware diagnostics to cluster and network… more
    NVIDIA (08/29/24)
    - Related Jobs
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our ... Computing Company', and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some… more
    NVIDIA (09/05/24)
    - Related Jobs
  • Senior Site Reliability Engineer - Internal…

    NVIDIA (Santa Clara, CA)
    …DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern ... computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next...fixing problems before they occur + Building automation for Cluster bring up and scaled up operation. + Improving… more
    NVIDIA (09/12/24)
    - Related Jobs
  • Senior Cloud Services Software

    NVIDIA (Santa Clara, CA)
    …seeking a distributed software engineer to join our team! As a Senior engineer, you'll be instrumental in developing and optimizing AI infrastructure services to ... resiliency for DGX Cloud. Your expertise in cloud services software architecture that drives the full resilience stack that...that allows the framework to be integrated with the cluster scheduler visibly to the users + Strong understanding… more
    NVIDIA (09/18/24)
    - Related Jobs