Talent.com
This job offer is not available in your country.
Deep Learning Solutions Architect – Inference Optimization

Deep Learning Solutions Architect – Inference Optimization

NVIDIAGeneral Trias, Cavite, Philippines
4 hours ago
Job description

NVIDIA’s Worldwide Field Operations (WWFO) team is seeking a Solution Architect with a deep understanding of neural network inference. As our customers adopt increasingly complex inference pipelines on state of the art infrastructure, there is a growing need for experts who can guide the integration of advanced inference techniques such as speculative decoding, request scheduler optimizations or FP4 quantization. The ideal candidate will be proficient using tools such as TRT LLM, vLLM, SGLang or similar, and have strong systems knowledge, enabling customers to fully use the capabilities of the new GB300 NVL72 systems (for example work on efficient KV cache offloading or help with inference of new architectures like hybrid or diffusion models, or architect the pre‑ and post‑processing pipelines).

Solutions Architects work with the most exciting computing hardware and software, driving the latest breakthroughs in artificial intelligence! We need individuals who can enable customer productivity and develop lasting relationships with our technology partners, making NVIDIA an integral part of end‑user solutions. We are looking for someone always passionate about artificial intelligence, someone who can maintain understanding of a fast paced field, someone able to coordinate efforts between corporate marketing, industry business development and engineering. Solutions Architects are the first line of technical expertise between NVIDIA and our customers. Your duties will vary from working on proof‑of‑concept demonstrations, to driving relationships with key executives and managers in order to promote adoption of NVIDIA based AI technology. Engaging with developers, scientific researchers, data scientists, IT managers and senior leaders is a significant part of the Solutions Architect role.

What you will be doing

  • Work directly with key customers to understand their technology and provide the best AI solutions.
  • Perform in‑depth analysis and optimization to ensure the best performance on GPU architecture systems (in particular Grace / ARM based systems). This includes support in optimization of large scale inference pipelines.
  • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof‑of‑concept evaluations.

What we need to see

  • Excellent verbal, written communication, and technical presentation skills in English.
  • MS / PhD or equivalent experience in Computer Science, Data Science, Electrical / Computer Engineering, Physics, Mathematics, other Engineering fields.
  • 5+ years work or research experience with Python / C++ / other software development.
  • Work experience and knowledge of modern NLP including good understanding of transformer, state space, diffusion, MOE model architectures. This can include either expertise in training or optimization / compression / operation of DNNs.
  • Understanding of key libraries used for NLP / LLM training (such as Megatron‑LM, NeMo, DeepSpeed etc.) and / or deployment (e.g. TensorRT‑LLM, vLLM, Triton Inference Server).
  • Enthusiastic about collaborating with various teams and departments—such as Engineering, Product, Sales, and Marketing—this person thrives in dynamic environments and stays focused amid constant change.
  • Self‑starter with demeanor for growth, passion for continuous learning and sharing findings across the team.
  • Ways to Stand Out from The Crowd

  • Demonstrated experience in running and debugging large‑scale distributed deep learning training or inference processes.
  • Experience working with larger transformer‑based architectures for NLP, CV, ASR or other.
  • Applied NLP technology in production environments.
  • Proficient with DevOps tools including Docker, Kubernetes, and Singularity.
  • Understanding of HPC systems : data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and / or management experience.
  • Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    #J-18808-Ljbffr

    Create a job alert for this search

    Solution Architect • General Trias, Cavite, Philippines

    Related jobs
    Senior AI Research Engineer, Model Inference (100% Remote)

    Senior AI Research Engineer, Model Inference (100% Remote)

    Tether Operations LimitedManila, 00, PH
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show moreLast updated: 16 days ago
    • Promoted
    Principal Solutions Architect - Applied AI

    Principal Solutions Architect - Applied AI

    Searce Inc, Metro Manila, Philippines, Metro Manila, Philippines
    Principal Solutions Architect - Applied AI.Searce's AI Architect Leader responsible for designing and delivering complex, high-value AI solutions. accountable for technical excellence, client advis...Show moreLast updated: 21 days ago
    • Promoted
    Machine Learning Engineer, Supportability

    Machine Learning Engineer, Supportability

    StripeSan Pablo, Laguna, Philippines
    Stripe is a financial infrastructure platform for businesses.Millions of companies—from the world’s largest enterprises to the most ambitious startups—use Stripe to accept payments, grow their reve...Show moreLast updated: 18 days ago
    • Promoted
    BPO - AI Solutions Architect (Hybrid Setup)

    BPO - AI Solutions Architect (Hybrid Setup)

    Unity Communications LLCParañaque, Metro Manila, Philippines
    Beyond AI agent deployments, this role will also support the Business Intelligence Department Head on wider initiatives in Data Analytics and automation across the organization.The Business Intelli...Show moreLast updated: 5 days ago
    • Promoted
    Solutions Architect (Artificial Intelligence / Machine Learning)

    Solutions Architect (Artificial Intelligence / Machine Learning)

    JPMorganChase, Metro Manila, Philippines, Metro Manila, Philippines
    Solutions Architect (Artificial Intelligence / Machine Learning).Solutions Architect (Artificial Intelligence / Machine Learning). Are you passionate about shaping the future of artificial intelligence ...Show moreLast updated: 6 days ago
    • Promoted
    Customer Support Specialist - WFH - $4.50 USD per hour

    Customer Support Specialist - WFH - $4.50 USD per hour

    ETechAlaminos, Calabarzon
    In a world of disruption and increasingly complex business challenges, our professionals bring truth into focus with the Etech Lens. Our sharp analytical skills, paired with the latest technology, a...Show moreLast updated: 1 day ago
    BPO - AI Solutions Architect (Hybrid Setup) #M1

    BPO - AI Solutions Architect (Hybrid Setup) #M1

    Unity CommunicationsParañaque, Metro Manila, Philippines
    Quick Apply
    Beyond AI agent deployments, this role will also support the Business Intelligence Department Head on wider initiatives in Data Analytics and automation across the organization.Business Intelligenc...Show moreLast updated: 3 days ago
    • Promoted
    Remote Customer Support Specialist - $4.50 USD per hour

    Remote Customer Support Specialist - $4.50 USD per hour

    ETechSan Pablo, Laguna
    In a world of disruption and increasingly complex business challenges, our professionals bring truth into focus with the Etech Lens. Our sharp analytical skills, paired with the latest technology, a...Show moreLast updated: 1 day ago
    • Promoted
    Solutions architect

    Solutions architect

    SoftwareONE Deutschland GmbHAntipolo, Rizal, Philippines
    SoftwareOne is a leading global software and cloud solutions provider that helps organizations build, buy, and manage cloud solutions. The company supports migration and modernization of workloads, ...Show moreLast updated: 30+ days ago
    Tech Solutions Architect

    Tech Solutions Architect

    Shae GroupQuezon City, Metro Manila, PH
    Quick Apply
    We are a rapidly growing software and healthtech group building scalable platforms across AI, digital health, and low-code / no-code systems. Our work spans SaaS, custom enterprise builds, and consume...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Cloudforce One Threat Intelligence

    Machine Learning Engineer, Cloudforce One Threat Intelligence

    CloudflareCavite City, Cavite, Philippines
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world’s largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 14 days ago
    • Promoted
    Solutions Architect

    Solutions Architect

    Stratpoint TechnologiesMandaluyong, Metro Manila, Philippines
    Be among the first 25 applicants.Direct message the job poster from Stratpoint Technologies.We are trusted, modern technology leaders in Agile Software Development, Quality Assurance, Cloud Consult...Show moreLast updated: 14 days ago
    • Promoted
    Solutions architect

    Solutions architect

    SoftwareONEMakati, Metro Manila, Philippines
    SoftwareOne is a leading global software and cloud solutions provider that helps organizations build, buy, and manage cloud solutions. The company assists clients to migrate and modernize workloads ...Show moreLast updated: 28 days ago
    • Promoted
    Machine Learning Engineer, Identity Product

    Machine Learning Engineer, Identity Product

    StripeSan Pablo, Laguna, Philippines
    Who we are : Stripe is a financial infrastructure platform for businesses.Millions of companies use Stripe to accept payments, grow their revenue, and accelerate new business opportunities.Our missi...Show moreLast updated: 11 days ago
    • Promoted
    Data Solutions Architect

    Data Solutions Architect

    NovareTaguig, Metro Manila, Philippines
    In this role, you will define how data flows across our systems, ensuring interoperability between tools and enabling a scalable, secure, and modern data platform. You are an expert in cloud-native ...Show moreLast updated: 30+ days ago
    • Promoted
    Tech Solutions Architect

    Tech Solutions Architect

    ShaeWellnessMandaluyong, Metro Manila, Philippines
    We are a rapidly growing software and healthtech group building scalable platforms across AI, digital health, and low-code / no-code systems. Our work spans SaaS, custom enterprise builds, and consume...Show moreLast updated: 13 days ago
    • Promoted
    Solutions Architect

    Solutions Architect

    HR TechX Corp.Taguig, Metro Manila, Philippines
    Get AI-powered advice on this job and more exclusive features.Provide support to other Solution Architects for elements such as completing scoping questionnaires with clients.Have a high-level unde...Show moreLast updated: 6 days ago
    BPO - AI Solutions Architect (Hybrid Setup)

    BPO - AI Solutions Architect (Hybrid Setup)

    Unity CommunicationsParañaque, Metro Manila, Philippines
    Quick Apply
    Beyond AI agent deployments, this role will also support the Business Intelligence Department Head on wider initiatives in Data Analytics and automation across the organization.Business Intelligenc...Show moreLast updated: 30+ days ago