Search result for all job vacancies on Jobstore

IN02 NVIDIA GraphicsPLtd,Pune

Senior System Software Engineer, Conversational AI

Full-time

India, Pune

Information Technology

10 months ago

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotic.....

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and intelligent assistants. Come join the team and see how you can make a lasting impact on the world! We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement. NVIDIA is looking for a System Software Engineer to develop tools for building powerful, flexible, multi-modal AI agents driven by Large Language Models(LLM) & improve the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join us.

What you’ll be doing:

Architect, implement and optimize GPU accelerated scalable Retrieval Augmented Generation(RAG) workflow. Build a scalable microservice based architecture deployable on multi-node, multi-cloud environment
Designing, implementing and testing domain specific agents and workflows and a framework which can support multi-turn, multi-modal, multi-user conversations with a LLM driven agents.
Develop knowledge discovery, and reasoning capabilities including but not limited to disambiguation, clarification, and anticipation for dialogue systems
Analyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.
Characterize performance and quality metrics across platforms for various AI and system components
Collaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA products
Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math
8+ years of experience
Excellent programming skills in Python
Knowhow of Large Language model applications
Familiarity with microservices, Docker, helm, kubernetes etc.
Experience of working on end to end Software lifecycle, release packaging & CI/CD pipeline
Hands-on experience on conversational AI Technologies like Large Language Models, Information Retrieval, Natural Language Processing, Dialogue systems (including system integration, state tracking and action prediction), Question and Answering, etc.
General background around version control and code review tools like Git, Gerrit, Gitlab.
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic environment

Ways to stand out from the crowd:

Strong fundamentals in Programming, optimizations and Software design
Strong knowledge of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (BERT, GPT, Megatron), Language Models
Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT
Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Quick Apply

IN02 NVIDIA GraphicsPLtd,Pune

System Software Engineer, Conversational AI

Full-time

India, Pune

Information Technology

10 months ago

NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotic.....

What you’ll be doing:

Build GPU accelerated scalable LLM driven Retrieval Augmented Generation(RAG) workflow and build a scalable microservice based architecture deployable on multi-node, multi-cloud environment
Build domain specific agents and workflows and build a framework which can support multi-turn, multi-modal, multi-user conversations with a LLM driven agents.
Develop knowledge discovery, and reasoning capabilities including but not limited to disambiguation, clarification, and anticipation for dialogue systems
Evaluate and benchmark end to end RAG and conversational AI agent pipelines for accuracy as well as system performance
Analyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.
Characterize performance and quality metrics across platforms for various AI and system components
Collaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA products
Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math
5+ years of experience and excellent programming skills in Python
Knowledge of Large Language model applications
Familiarity with microservices, Docker, helm, kubernetes etc.
Experience of working on end to end Software lifecycle, release packaging & CI/CD pipeline
Hands-on experience on conversational AI Technologies like Large Language Models, Information Retrieval, Natural Language Processing, Dialogue systems (including system integration, state tracking and action prediction), Question and Answering, etc.
Knowledge of vector databases and embedding models
General background around version control and code review tools like Git, Gerrit, Gitlab.
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic environment

Ways to stand out from the crowd:

Strong fundamentals in Programming, optimizations and Software design
Strong knowledge of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (BERT, GPT, Megatron), Language Models
Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT
Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Quick Apply

IN02 NVIDIA GraphicsPLtd,Pune

Senior Deep Learning Scientist, LLM and Tools

Full-time

India, Pune

Education / Training

10 months ago

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our i.....

What You’ll Be Doing:

Develop, Train, Fine-tune, and Deploy multimodal large language models for retrieval augmented generation.
Develop LLM agent framework for orchestrating large scale RAG applications
Build LLM agent framework for reasoning and action prediction in multimodal environment.
Apply instruction tuning, reinforcement learning from human feedback (RLHF), and parameter efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for different RAG use cases.
Measure and benchmark model and application performance.
Analyze model accuracy and bias and recommend the next course of action & Improvements.
Maintain model evaluation systems.
Drive the gathering, building, and annotation of domain specific datasets to train LLMs for different tasks and applications.
Characterize performance and quality metrics across platforms for various AI and system components.
Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.
Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment and collaborate with various teams on new product features and improvements of existing products.

What We Need To See:

Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 5+ years of experience.
Excellent programming skills in Python with strong fundamentals in programming, optimizations and software design.
Solid understanding of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (BERT, BART, GPT/T5, Megatron, LLMs)
Hands-on experience on conversational AI Technologies like Natural Language Understanding, Natural Language Generation, Dialog systems (including system integration, state tracking and action prediction), Information retrieval and Question and Answering, Machine Translation etc.
Experience with Training BERT, GPT and Megatron Models for different NLP and dialog system tasks using “PyTorch” Deep Learning Frameworks and performing NLP data wrangling and tokenization.
Develop large scale multimodal information retrieval system leveraging open source frameworks such as LlamaIndex, LangChain, FAISS, Haystack and so one.
Experience developing production LLM powered applications and tools with natural language interface.
Understanding of MLOps life cycle and experience with MLOps workflows & traceability and versioning of datasets including knowhow of database management and queries (in SQL, MongoDB etc).
Experience using end-to-end MLOps platform such as KubeFlow, MLFlow, AirFlow.
Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.

Ways To Stand Out From The Crowd:

Fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic/ Korean / Italian / Portuguese
Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT
Background with Dockers and Kubernetes and strong C++ programming skills
Background with deploying machine learning models on data center, cloud, and embedded systems as well as experience developing document extraction for different documents types and sources, and indexing at scale
Experience adapting LLMs to different domains such as automotive, health care, finance etc.

With highly competitive salaries and a comprehensive benefits package, Nvidia is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Large Language Models. If you're a creative engineer with a real passion for robust and enjoyable user experiences, we want to hear from you. NVIDIA is committed to encouraging a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Quick Apply

IN02 NVIDIA GraphicsPLtd,Pune

Senior Deep Learning Scientist, Conversational AI

Full-time

India, Pune

Education / Training

10 months ago

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our i.....

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, autonomous cars and conversational AI that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company, and build our teams. Join us at the forefront of technological advancement!

NVIDIA is looking for Senior Deep Learning Scientist, Conversational AI who is passionate in areas such as, embodied AI, conversational AI, robotics (navigation, manipulation), AR/VR/MR, egocentric computer vision, grounded 3D perception, simulation and sim2real transfer, pre-training for embodied agents, and human-AI interaction, bringing to bear foundational knowledge from areas such as deep learning, reinforcement learning, computational statistics, and applied mathematics. You will have an opportunity to make core algorithmic advances and apply your ideas at scale using our NeMo LLM MLOps platform. You will develop high-impact, high-visibility Large language modeI products and improve the experience of millions of customers. If you're creative & passionate about solving real world embodied conversational AI problems, come join our Digital Human LLM team. For more details on NeMo Frameworks for LLMs check: https://www.nvidia.com/en-us/ai-data-science/generative-ai/nemo-framework/

What You’ll Be Doing

Develop, Train, Fine-tune, and Deploy LLMs for driving embodied conversational AI systems including multimodal understanding, speech synthesis, image generation, UI and animation rendering and control, environment interaction, and dialog reasoning and tool systems.
Apply innovative fundamental and applied research to develop products for embodied conversational artificial intelligence.
Build novel data driven paradigms for embodied intelligence including customization recipes for different domains and enterprise use cases.
Develop systems and framework using various data modalities (images, video, text, audio, tactile, etc) and the roles they play in different levels of embodied reasoning and decision making.
Explore paradigms that can deliver a spectrum of embodied behaviors - from simulated characters to real robots, and from short horizon, low level to long horizon, high level.
Enable long-horizon reasoning and facilitate low level skills for Embodied AI tasks.
Apply alignment techniques such as instruction tuning, reinforcement learning from human feedback (RLHF), and parameter efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve use cases.
Measure and benchmark model and application performance and Analyze model accuracy and bias and recommend the next course of action & Improvements.
Drive the gathering, building, and annotation of domain specific datasets to train LLMs for different embodied tasks and applications and maintain model evaluation systems and characterize performance and quality metrics across platforms for various AI and system components.
Collaborate and innovate with various teams on new product features, improvements of existing products and participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

What We Need To See

Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 5+ years of experience.
Excellent programming skills in Python with strong fundamentals optimizations and software design.
Solid understanding of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (ViT, BERT, BART, GPT/T5, Megatron, LLMs).
Hands-on experience on conversational AI Technologies
Experience with Training ViT, BERT, GPT and Megatron Models for different computer vision, NLP and dialog system tasks using “PyTorch” Deep Learning Frameworks and performing data wrangling and tokenization.
Solid understanding of MLOps life cycle and experience with MLOps workflows & traceability and versioning of datasets including knowhow of database management and queries (in SQL, MongoDB etc).
Strong collaborative and interpersonal skills, and optimally guide and influence within a dynamic matrix environment.

Ways To Stand Out From The Crowd

Fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic/ Korean / Italian / Portuguese.
Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT.
Background with Dockers and Kubernetes and deploying machine learning models on data center, cloud, and embedded systems and strong C++ programming skills.
Experience developing all aspects of large language models.
Integrating embodied AI systems with various sensor inputs