The Full Body Vectorization (FBV) Team is the engine powering semantic search across Microsoft. We own and operate the leading embedding models that generate the core vector representations and help on all Bing index content across web content, fresh, multimedia, ads and impact on multiple stacks from retrieval to ranking.
We are now building Next-Generation Search Engine and Grounding system to lead the technical wave and improve the quality of Search, Copilot and all kinds of tools benefit from AI. It’s a good opportunity to join us to reach the sky and use our intelligence to help customers across the worlds.
We are looking for Senior Applied Scientist who has passion to leverage latest innovations, independently lead the exploration from idea/ design, experiments and landing to the production. We want you have understanding and interest in retrieval/ Search, LLM, Multi-modal, NLP with deep experience on DL/ML.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Apply state-of-the-art Research/ Industry Innovation s to benefit on Search/ Grounding quality with measurable business impact.
- Explore big bets with mature thinking, ability to drive the direction from idea to the product ship on production.
- Collaborate with product, algo and engineering partners across the world with clear communication to push the project move further
- Optimize and implement on productivity & agility, including model training/ inference code, data pipeline/ tools and shipping process.
- Mentor and guide young team members grow up as good model not only from technical side, but also the attitude and methodology on work.
- Keep learning and can help build sharing team culture with discussion and ideas from innovations
- Be sensitive to data. Help review design/ experiments/ pipelines with your expertise.
- Drive the data quality improvement
- Have solid coding ability for online system maintain/ update and feature change.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 2+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field OR equivalent experience.
- Prior experience with Research, Applied Science, Search / Recommedation or other solid experience on deep model training.
- Experience with common machine learning, deep learning frameworks and concepts, using use of LLMs, prompting.
- Experience in pytorch or tensorflow.
Preferred Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 5+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
- Solid experience and understanding on Deep model and retrieval.
- Solid Research experience and publications.
- Familiar with model optimization on training/ inference and GPU side.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
The Full Body Vectorization (FBV) Team is the engine powering semantic search across Microsoft. We own and operate the leading embedding models that generate the core vector representations and help on all Bing index content across web content, fresh, multimedia, ads and impact on multiple stacks from retrieval to ranking.
We are now building Next-Generation Search Engine and Grounding system to lead the technical wave and improve the quality of Search, Copilot and all kinds of tools benefit from AI. It’s a good opportunity to join us to reach the sky and use our intelligence to help customers across the worlds.
We are looking for Senior Applied Scientist who has passion to leverage latest innovations, independently lead the exploration from idea/ design, experiments and landing to the production. We want you have understanding and interest in retrieval/ Search, LLM, Multi-modal, NLP with deep experience on DL/ML.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Apply state-of-the-art Research/ Industry Innovation s to benefit on Search/ Grounding quality with measurable business impact.
- Explore big bets with mature thinking, ability to drive the direction from idea to the product ship on production.
- Collaborate with product, algo and engineering partners across the world with clear communication to push the project move further
- Optimize and implement on productivity & agility, including model training/ inference code, data pipeline/ tools and shipping process.
- Mentor and guide young team members grow up as good model not only from technical side, but also the attitude and methodology on work.
- Keep learning and can help build sharing team culture with discussion and ideas from innovations
- Be sensitive to data. Help review design/ experiments/ pipelines with your expertise.
- Drive the data quality improvement
- Have solid coding ability for online system maintain/ update and feature change.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 2+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field OR equivalent experience.
- Prior experience with Research, Applied Science, Search / Recommedation or other solid experience on deep model training.
- Experience with common machine learning, deep learning frameworks and concepts, using use of LLMs, prompting.
- Experience in pytorch or tensorflow.
Preferred Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 5+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
- Solid experience and understanding on Deep model and retrieval.
- Solid Research experience and publications.
- Familiar with model optimization on training/ inference and GPU side.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Principal Technical Program Manager
Principal Technical Program Manager
- Location
- Job Number
- City
- Team
- Country
- Discipline
The Microsoft AI Monetization team is responsible for driving innovative monetization strategies across Microsoft’s AI products including Copilot, Bing, Edge, and Microsoft Advertising. We focus on delivering highly relevant and engaging experiences for consumers while maximizing value for advertisers and partners.
We are seeking an accomplished Principal Technical Program Manager to drive critical monetization initiatives for Bing Search. This execution-focused role requires a seasoned leader who can orchestrate complex, cross-organizational programs involving multiple CVPs, finance partners, and engineering teams to unlock new revenue opportunities while delivering exceptional user experiences.
The ideal candidate will bring deep expertise in monetization and experimentation and analysis, with proven ability to drive results across highly matrixed organizations. You will build end-to-end user experiences across platforms and lead data-driven experimentation to improve relevance, engagement, and overall product performance. Experience with Search and Ads ecosystems is highly valued.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Own end-to-end execution of large-scale monetization programs, driving delivery against aggressive timelines and measurable business outcomes.
- Define execution strategy, milestones, and success metrics for complex technical programs across ad delivery, AI/ML-driven targeting and bidding, traffic optimization, whole-page monetization, and cross-platform user experiences.
- Orchestrate execution across multiple engineering, product, and partner teams, proactively managing dependencies, risks, and tradeoffs in highly matrixed environments.
- Establish solid operational rigor through clear review cadences, executive-level communication, and cross-team alignment to maintain velocity and quality at scale.
- Partner closely with Finance, Product, Engineering, Data Science, and Design to model revenue impact, track performance, and translate technical complexity into business-aligned outcomes.
- Lead data-driven experimentation and analysis to validate monetization hypotheses, measure incremental impact, and inform prioritization across initiatives.
- Apply deep understanding of the Search and Ads ecosystem—including SERP rendering, ad marketplaces, ranking systems, and monetization infrastructure—to guide technical and product decisions.
- Influence senior stakeholders across Bing, Microsoft Advertising, AI Platform, and partner organizations to drive alignment, unblock decisions, and deliver cohesive solutions.
Qualifications
Required Qualifications:
- Bachelor’s Degree AND 6+ years experience in engineering, product/technical program management, data analysis, or product development
- OR equivalent experience.
- 3+ years of experience managing cross-functional and/or cross-team projects.
Additional or Preferred Qualifications:
- Bachelor’s Degree AND 12+ years experience engineering, product/technical program management, data analysis, or product development
- OR equivalent experience.
- Experience with Search and Ads ecosystems including search engines (Bing, Google), advertising platforms (Microsoft Advertising, Google Ads), ad serving systems, or related technologies.
- 8+ years of experience managing cross-functional and/or cross-team projects.
- 1+ year(s) of experience reading and/or writing code (e.g., sample documentation, product demos)
#MicrosoftAI #MAI
Technical Program Management IC5 – The typical base pay range for this role across the U.S. is USD $139,900 – $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 – $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Principal Technical Program Manager
Principal Technical Program Manager
- Location
- Job Number
- City
- Team
- Country
- Discipline
The Microsoft AI Monetization team is responsible for driving innovative monetization strategies across Microsoft’s AI products including Copilot, Bing, Edge, and Microsoft Advertising. We focus on delivering highly relevant and engaging experiences for consumers while maximizing value for advertisers and partners.
We are seeking an accomplished Principal Technical Program Manager to drive critical monetization initiatives for Bing Search. This execution-focused role requires a seasoned leader who can orchestrate complex, cross-organizational programs involving multiple CVPs, finance partners, and engineering teams to unlock new revenue opportunities while delivering exceptional user experiences.
The ideal candidate will bring deep expertise in monetization and experimentation and analysis, with proven ability to drive results across highly matrixed organizations. You will build end-to-end user experiences across platforms and lead data-driven experimentation to improve relevance, engagement, and overall product performance. Experience with Search and Ads ecosystems is highly valued.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Own end-to-end execution of large-scale monetization programs, driving delivery against aggressive timelines and measurable business outcomes.
- Define execution strategy, milestones, and success metrics for complex technical programs across ad delivery, AI/ML-driven targeting and bidding, traffic optimization, whole-page monetization, and cross-platform user experiences.
- Orchestrate execution across multiple engineering, product, and partner teams, proactively managing dependencies, risks, and tradeoffs in highly matrixed environments.
- Establish solid operational rigor through clear review cadences, executive-level communication, and cross-team alignment to maintain velocity and quality at scale.
- Partner closely with Finance, Product, Engineering, Data Science, and Design to model revenue impact, track performance, and translate technical complexity into business-aligned outcomes.
- Lead data-driven experimentation and analysis to validate monetization hypotheses, measure incremental impact, and inform prioritization across initiatives.
- Apply deep understanding of the Search and Ads ecosystem—including SERP rendering, ad marketplaces, ranking systems, and monetization infrastructure—to guide technical and product decisions.
- Influence senior stakeholders across Bing, Microsoft Advertising, AI Platform, and partner organizations to drive alignment, unblock decisions, and deliver cohesive solutions.
Qualifications
Required Qualifications:
- Bachelor’s Degree AND 6+ years experience in engineering, product/technical program management, data analysis, or product development
- OR equivalent experience.
- 3+ years of experience managing cross-functional and/or cross-team projects.
Additional or Preferred Qualifications:
- Bachelor’s Degree AND 12+ years experience engineering, product/technical program management, data analysis, or product development
- OR equivalent experience.
- Experience with Search and Ads ecosystems including search engines (Bing, Google), advertising platforms (Microsoft Advertising, Google Ads), ad serving systems, or related technologies.
- 8+ years of experience managing cross-functional and/or cross-team projects.
- 1+ year(s) of experience reading and/or writing code (e.g., sample documentation, product demos)
#MicrosoftAI #MAI
Technical Program Management IC5 – The typical base pay range for this role across the U.S. is USD $139,900 – $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 – $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.
In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:
Develop novel data collection strategies
Improve dataset quality and integrity
Understand data-driven model behaviors
Train models to understand the impact of data and data mixes
Align datasets with ethical and societal values
This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.
We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Responsibilities
- Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
- Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
- Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
- Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
- Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
- Embody our culture and values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
- 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
- Proficiency in statistics and exploratory data analysis methods.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.
In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:
Develop novel data collection strategies
Improve dataset quality and integrity
Understand data-driven model behaviors
Train models to understand the impact of data and data mixes
Align datasets with ethical and societal values
This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.
We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Responsibilities
- Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
- Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
- Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
- Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
- Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
- Embody our culture and values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
- 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
- Proficiency in statistics and exploratory data analysis methods.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Senior Software Engineer
Senior Software Engineer
- Location
- Job Number
- City
- Team
- Country
- Discipline
The MAI-A Fundamentals team is dedicated to building one of the world’s largest distributed systems to power Bing Search, Copilot Search, Grounding APIs, and drive advancements in relevance through cutting-edge deep learning techniques. We are currently developing the next generation of grounding APIs and their hosting platform services, with the goal of creating the fastest and most modern search API business.
Our efforts focus on constructing a system with a tens-of-billions-level index, an orchestration engine capable of operating within hundreds of milliseconds, and an online GPU inference system. These innovations will enable rapid, scalable, and intelligent search capabilities for a wide range of applications.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Understand User Requirements
- Collaborates with appropriate stakeholders (e.g., relevance customers teams, project manager, other team members, and all other service owners) to determine user requirements for each scenario on the service gaps based on their scenarios. Well understands detailed asks in different scenarios and make sure to build efficient functionalities and services to fulfill their requirement.
- Design, coding and implementation
- Takes part in the discussions for the architecture of products/solutions and creates proposals for architecture by testing design hypotheses and helping to refine code plans. Provides reactions, proposed solutions, and inputs to architects.
- Independently creates a clear plan for testing and assuring quality of solutions, and defines success for outcomes of tests (e.g., unit tests).
- Writes extensible and maintainable code. Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and ROI. Reviews the code from other team members to assure it meets the team’s and Microsoft’s quality standards.
- Scenario onboarding and online supporting
- Collaborates closely with scenario owners to facilitate their onboarding process and offers essential support to ensure their business success.
- Delivers efficient toolsets designed to enhance the platform’s usability, debugging capabilities, and monitoring functionalities.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- At least 4 years of experience in designing, developing and maintaining distributed information management systems.
- Familiar with cloud computing platforms such as Azure, K8s.
- Good communication, collaboration and problem solving skills, fluent English speaking and writing.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 6+ years experience in designing, developing and maintaining distributed computing platform. Experience in writing high quality code and conducting code reviews.
- Experiences in Bing search or other search engine platform services, or other large scale platform services.
- Nice to have knowledges for deep learning techniques and frameworks and experiences in LLM prompt engineering.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Senior Software Engineer
Senior Software Engineer
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking an expert Senior GPU Engineer to join our AI Infrastructure team. In this role, you will architect and optimize the core inference engine that powers our large-scale AI models. You will be responsible for pushing the boundaries of hardware performance, reducing latency, and maximizing throughput for Generative AI and Deep Learning workloads.
You will work at the intersection of Deep Learning algorithms and low-level hardware, designing custom operators and building a highly efficient training/inference execution engine from the ground up.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Custom Operator Development: Design and implement highly optimized GPU kernels (CUDA/Triton) for critical deep learning operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries.
- Inference Engine Architecture: Contribute to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic memory management (e.g., KV Cache optimization).
- Performance Optimization: Deeply analyze and profile model performance using tools like Nsight Systems/Compute. Identify bottlenecks in memory bandwidth, instruction throughput, and kernel launch overheads.
- Model Acceleration: Implement advanced acceleration techniques such as Quantization (INT8, FP8, AWQ), Kernel Fusion, and continuous batching.
- Distributed Computing: Optimize communication primitives (NCCL) to enable efficient multi-GPU and multi-node inference (Tensor Parallelism, Pipeline Parallelism).
- Hardware Adaptation: Ensure the software stack fully utilizes modern GPU architecture features (e.g., NVIDIA Hopper/Ampere Tensor Cores, Asynchronous Copy).
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Professional Depth: 4+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development.
- Architectural Mastery: Expertise in the CUDA programming model and NVIDIA GPU architectures (specifically Ampere/Hopper).Deep understanding of the memory hierarchy (Shared Memory, L2 cache, Registers), warp-level primitives, occupancy optimization, and bank conflict resolution.
- Familiarity with advanced hardware features: Tensor Cores, TMA (Tensor Memory Accelerator), and asynchronous copy.
- Programming & Systems: Proven ability to navigate and modify complex, large-scale codebases (e.g., PyTorch internals, Linux kernel).
- Experience with build and binding ecosystems: CMake, pybind11, and CI/CD for GPU workloads.
- Performance Engineering: Mastery of NVIDIA Nsight Systems/Compute.Ability to mathematically reason about performance using the Roofline Model, memory bandwidth utilization, and compute throughput.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Professional Depth: 5+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development.
- Engine & Framework Expertise: Working knowledge of state-of-the-art inference/training stacks: sglang, vLLM, TensorRT-LLM, DeepSpeed, or Megatron-LM.Deep understanding of optimization patterns: PagedAttention, RadixAttention (Prefix Caching), continuous batching, and speculative decoding.
- Operator & GEMM Optimization: * Practical experience with CUTLASS, CuTe, or OpenAI Triton.Expertise in high-performance linear algebra (GEMM) optimization, including tiling strategies, data layouts, and mixed-precision accumulation.
- Distributed Systems: Proficiency in multi-GPU/multi-node scaling using NCCL and parallelism strategies (Tensor, Pipeline, and Sequence parallelism).
- Vibe Coding & AI-Native Velocity: An AI-native mindset: Expert at using vibe coding tools to bypass boilerplate and accelerate the development lifecycle.The technical intuition to architect systems rapidly, moving from “vibe” to “highly-optimized production code” with extreme velocity.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Senior Software Engineer
Senior Software Engineer
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking an expert Senior GPU Engineer to join our AI Infrastructure team. In this role, you will architect and optimize the core inference engine that powers our large-scale AI models. You will be responsible for pushing the boundaries of hardware performance, reducing latency, and maximizing throughput for Generative AI and Deep Learning workloads.
You will work at the intersection of Deep Learning algorithms and low-level hardware, designing custom operators and building a highly efficient training/inference execution engine from the ground up.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Custom Operator Development: Design and implement highly optimized GPU kernels (CUDA/Triton) for critical deep learning operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries.
- Inference Engine Architecture: Contribute to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic memory management (e.g., KV Cache optimization).
- Performance Optimization: Deeply analyze and profile model performance using tools like Nsight Systems/Compute. Identify bottlenecks in memory bandwidth, instruction throughput, and kernel launch overheads.
- Model Acceleration: Implement advanced acceleration techniques such as Quantization (INT8, FP8, AWQ), Kernel Fusion, and continuous batching.
- Distributed Computing: Optimize communication primitives (NCCL) to enable efficient multi-GPU and multi-node inference (Tensor Parallelism, Pipeline Parallelism).
- Hardware Adaptation: Ensure the software stack fully utilizes modern GPU architecture features (e.g., NVIDIA Hopper/Ampere Tensor Cores, Asynchronous Copy).
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Professional Depth: 4+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development.
- Architectural Mastery: Expertise in the CUDA programming model and NVIDIA GPU architectures (specifically Ampere/Hopper).Deep understanding of the memory hierarchy (Shared Memory, L2 cache, Registers), warp-level primitives, occupancy optimization, and bank conflict resolution.
- Familiarity with advanced hardware features: Tensor Cores, TMA (Tensor Memory Accelerator), and asynchronous copy.
- Programming & Systems: Proven ability to navigate and modify complex, large-scale codebases (e.g., PyTorch internals, Linux kernel).
- Experience with build and binding ecosystems: CMake, pybind11, and CI/CD for GPU workloads.
- Performance Engineering: Mastery of NVIDIA Nsight Systems/Compute.Ability to mathematically reason about performance using the Roofline Model, memory bandwidth utilization, and compute throughput.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Professional Depth: 5+ years of experience in systems programming, HPC, or GPU software development, featuring at least 5 years of hands-on CUDA/C++ kernel development.
- Engine & Framework Expertise: Working knowledge of state-of-the-art inference/training stacks: sglang, vLLM, TensorRT-LLM, DeepSpeed, or Megatron-LM.Deep understanding of optimization patterns: PagedAttention, RadixAttention (Prefix Caching), continuous batching, and speculative decoding.
- Operator & GEMM Optimization: * Practical experience with CUTLASS, CuTe, or OpenAI Triton.Expertise in high-performance linear algebra (GEMM) optimization, including tiling strategies, data layouts, and mixed-precision accumulation.
- Distributed Systems: Proficiency in multi-GPU/multi-node scaling using NCCL and parallelism strategies (Tensor, Pipeline, and Sequence parallelism).
- Vibe Coding & AI-Native Velocity: An AI-native mindset: Expert at using vibe coding tools to bypass boilerplate and accelerate the development lifecycle.The technical intuition to architect systems rapidly, moving from “vibe” to “highly-optimized production code” with extreme velocity.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, Software Co-Design AI HPC Systems – MAI Superintelligence Team
Member of Technical Staff, Software Co-Design AI HPC Systems – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Our team’s mission is to architect, co-design, and productionize next-generation AI systems at datacenter scale. We operate at the intersection of models, systems software, networking, storage, and AI hardware, optimizing end-to-end performance, efficiency, reliability, and cost. Our work spans today’s frontier AI workloads and directly shapes the next generation of accelerators, system architectures, and large-scale AI platforms. We pursue this mission through deep hardware–software co-design, combining rigorous systems thinking with hands-on engineering. The team invests heavily in understanding real production workloads large-scale training, inference, and emerging multimodal models and translating those insights into concrete improvements across the stack: from kernels, runtimes, and distributed systems, all the way down to silicon-level trade-offs and datacenter-scale architectures.
This role sits at the boundary between exploration and production. You will work closely with internal infrastructure, hardware, compiler, and product teams, as well as external partners across the hardware and systems ecosystem. Our operating model emphasizes rapid ideation and prototyping, followed by disciplined execution to drive high-leverage ideas into production systems that operate at massive scale.
In addition to delivering real-world impact on large-scale AI platforms, the team actively contributes to the broader research and engineering community. Our work aligns closely with leading communities in ML systems, distributed systems, computer architecture, and high-performance computing, and we regularly publish, prototype, and open-source impactful technologies where appropriate.
Responsibilities
Lead the co-design of AI systems across hardware and software boundaries, spanning accelerators, interconnects, memory systems, storage, runtimes, and distributed training/inference frameworks.
Drive architectural decisions by analyzing real workloads, identifying bottlenecks across compute, communication, and data movement, and translating findings into actionable system and hardware requirements.
Co-design and optimize parallelism strategies, execution models, and distributed algorithms to improve scalability, utilization, reliability, and cost efficiency of large-scale AI systems.
Develop and evaluate what-if performance models to project system behavior under future workloads, model architectures, and hardware generations, providing early guidance to hardware and platform roadmaps.
Partner with compiler, kernel, and runtime teams to unlock the full performance of current and next-generation accelerators, including custom kernels, scheduling strategies, and memory optimizations.
Influence and guide AI hardware design at system and silicon levels, including accelerator microarchitecture, interconnect topology, memory hierarchy, and system integration trade-offs.
Lead cross-functional efforts to prototype, validate, and productionize high-impact co-design ideas, working across infrastructure, hardware, and product teams.
Mentor senior engineers and researchers, set technical direction, and raise the overall bar for systems rigor, performance engineering, and co-design thinking across the organization.
Qualifications
Required/Minimum Qualifications
- Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Strong background in one or more of the following areas:
- AI accelerator or GPU architectures
- Distributed systems and large-scale AI training/inference
- High-performance computing (HPC) and collective communications
- ML systems, runtimes, or compilers
- Performance modeling, benchmarking, and systems analysis
- Hardware–software co-design for AI workloads
- Proficiency in systems-level programming (e.g., C/C++, CUDA, Python) and performance-critical software development.
- Proven ability to work across organizational boundaries and influence technical decisions involving multiple stakeholders.
- Experience designing or operating large-scale AI clusters for training or inference.
- Deep familiarity with LLMs, multimodal models, or recommendation systems, and their systems-level implications.
- Experience with accelerator interconnects and communication stacks (e.g., NCCL, MPI, RDMA, high-speed Ethernet or InfiniBand).
- Background in performance modeling and capacity planning for future hardware generations.
- Prior experience contributing to or leading hardware roadmaps, silicon bring-up, or platform architecture reviews.
- Publications, patents, or open-source contributions in systems, architecture, or ML systems are a plus.
Software Engineering IC5 – The typical base pay range for this role across the U.S. is USD $139,900 – $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 – $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
Software Engineering IC6 – The typical base pay range for this role across the U.S. is USD $163,000 – $296,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $220,800 – $331,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer