Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are building next-generation customized AI silicon designed to accelerate AI workloads with unprecedented efficiency. We are looking for an exceptional Systems Engineer to bridge the gap between our custom hardware and modern AI inference frameworks.
We build foundational AI infrastructure that enables large-scale training and inference across diverse workloads and rapidly evolving hardware generations. Our work directly shapes how AI systems are designed, deployed, and scaled today and into the future. Engineers on this team operate with end-to-end ownership, deep technical rigor, and a strong bias toward real-world impact.
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
The Role
As a Senior AI Systems Engineer, you will own the software integration layer between our custom AI chip’s proprietary SDK and SGLang, a state-of-the-art serving framework for Large Language Models (LLMs) and Vision-Language Models. You will be responsible for ensuring that our silicon can seamlessly run SGLang inference workloads at peak performance, bypassing the traditional CUDA ecosystem entirely.
Responsibilities
- Framework Integration: Architect and develop the backend integration to make our custom AI chip a first-class citizen in SGLang.
- Custom Operator Development: Write custom C++ / PyTorch extensions that map SGLang’s primitive operations (e.g., RadixAttention, FlashAttention, matrix multiplications) to our custom chip’s proprietary software layer.
- Performance Optimization: Profile and optimize end-to-end LLM inference latency, throughput, and memory utilization (Paged Attention) on our hardware.
- Cross-Functional Collaboration: Work closely with our hardware architecture and compiler teams to provide feedback on our custom software stack and silicon design based on framework-level bottlenecks.
- Testing & Deployment: Build robust testing pipelines to validate model accuracy and performance parity against standard GPU baselines.
Qualifications
- BS, MS, or PhD in Computer Science, Computer Engineering, or a related field.
- Software engineering experience focusing on systems programming, ML infrastructure, or AI compilers.
- Expertise in Python: Deep understanding of memory management, concurrent programming.
- Experience with LLM Inference Engines: Hands-on experience modifying or extending frameworks like SGLang, vLLM, DeepSpeed-FastGen, or TensorRT-LLM.
- PyTorch Internals: Strong experience writing PyTorch C++ extensions and custom operators.
- Hardware Interfacing: Proven track record of integrating machine learning workloads with hardware accelerators (GPUs, TPUs, NPUs) using custom SDKs, APIs, or low-level drivers.
- Prior experience working on non-CUDA software ecosystems (e.g., AMD ROCm, AWS Neuron, Google XLA).
- Familiarity with AI compilers and intermediate representations (MLIR, Apache TVM, OpenAI Triton).
- Strong understanding of underlying LLM architectures (Transformers, MoE) and state-of-the-art attention algorithms (FlashAttention v2/v3).
- Previous experience at an AI silicon startup or working on custom accelerators (e.g., Google TPU, AWS Trainium).
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are building next-generation customized AI silicon designed to accelerate AI workloads with unprecedented efficiency. We are looking for an exceptional Systems Engineer to bridge the gap between our custom hardware and modern AI inference frameworks.
We build foundational AI infrastructure that enables large-scale training and inference across diverse workloads and rapidly evolving hardware generations. Our work directly shapes how AI systems are designed, deployed, and scaled today and into the future. Engineers on this team operate with end-to-end ownership, deep technical rigor, and a strong bias toward real-world impact.
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
The Role
As a Senior AI Systems Engineer, you will own the software integration layer between our custom AI chip’s proprietary SDK and SGLang, a state-of-the-art serving framework for Large Language Models (LLMs) and Vision-Language Models. You will be responsible for ensuring that our silicon can seamlessly run SGLang inference workloads at peak performance, bypassing the traditional CUDA ecosystem entirely.
Responsibilities
- Framework Integration: Architect and develop the backend integration to make our custom AI chip a first-class citizen in SGLang.
- Custom Operator Development: Write custom C++ / PyTorch extensions that map SGLang’s primitive operations (e.g., RadixAttention, FlashAttention, matrix multiplications) to our custom chip’s proprietary software layer.
- Performance Optimization: Profile and optimize end-to-end LLM inference latency, throughput, and memory utilization (Paged Attention) on our hardware.
- Cross-Functional Collaboration: Work closely with our hardware architecture and compiler teams to provide feedback on our custom software stack and silicon design based on framework-level bottlenecks.
- Testing & Deployment: Build robust testing pipelines to validate model accuracy and performance parity against standard GPU baselines.
Qualifications
- BS, MS, or PhD in Computer Science, Computer Engineering, or a related field.
- Software engineering experience focusing on systems programming, ML infrastructure, or AI compilers.
- Expertise in Python: Deep understanding of memory management, concurrent programming.
- Experience with LLM Inference Engines: Hands-on experience modifying or extending frameworks like SGLang, vLLM, DeepSpeed-FastGen, or TensorRT-LLM.
- PyTorch Internals: Strong experience writing PyTorch C++ extensions and custom operators.
- Hardware Interfacing: Proven track record of integrating machine learning workloads with hardware accelerators (GPUs, TPUs, NPUs) using custom SDKs, APIs, or low-level drivers.
- Prior experience working on non-CUDA software ecosystems (e.g., AMD ROCm, AWS Neuron, Google XLA).
- Familiarity with AI compilers and intermediate representations (MLIR, Apache TVM, OpenAI Triton).
- Strong understanding of underlying LLM architectures (Transformers, MoE) and state-of-the-art attention algorithms (FlashAttention v2/v3).
- Previous experience at an AI silicon startup or working on custom accelerators (e.g., Google TPU, AWS Trainium).
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Member of Technical Staff – Data Scientist
Member of Technical Staff – Data Scientist
- Location
- Job Number
- City
- Team
- Country
- Discipline
We’re looking for data scientists to help build the next generation of post-training methods for frontier models at Microsoft AI. You’ll join a small, high-impact team working across all stages of post-training, with a focus on evaluation design, high-quality training data, and scalable data pipelines for state-of-the-art foundation models.
In this role, you’ll help turn raw model capability into reliable, aligned, and measurable performance improvements, directly shaping how frontier models behave in real-world deployments.
About the Role:
Microsoft AI is building the next generation of frontier models that power Copilot and other large-scale AI experiences. The Post-Training team is responsible for transforming powerful pretrained models into robust, aligned, and high-performing systems used by millions of people worldwide.
Our work focuses on improving general quality, instruction following, coding and math ability, tool use, agentic behaviors, personality, and other critical model capabilities. We operate across the full post-training lifecycle — from data generation and curation, to evaluation and diagnostics, to reward modeling and reinforcement learning.
We are a small, highly autonomous team that works closely with pre-training, product, and engineering partners to rapidly iterate on ideas, run large-scale experiments, and safely advance model capabilities. Each team member owns meaningful parts of the post-training pipeline and has direct access to the compute, data, and decision-making needed to move quickly from insight to production.
Microsoft Superintelligence Team
This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.
We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Responsibilities
Design evaluations of advanced model capabilities and use them to drive rapid, high-signal iteration loops
Work with vendors to produce high quality evaluation and training data
Build data pipelines to produce high quality evaluation and training data
Build data flywheels to hill-climb on model weaknesses, using data from various surfaces where our models are deployed
Ensure optimal quality, quantity and coverage of data across our post-training stages
Run post-training experiments and ablations to produce models that climb our evals
Embody our culture and values.
We’re Looking For People Who:
Have deep experience with LLMs, either training them or applying them in production
Have developed production-scale data pipelines for synthesizing, curating, or processing large quantities of data
Can design, run, and interpret large-scale ML experiments with careful statistical and empirical reasoning.
Possess strong generalist engineering and mathematical skills.
Have clear written and verbal communication, and the ability to collaborate effectively with researchers, engineers and other disciplines.
Bonus skills: Demonstrated SOTA results in any area of large-scale training, inference, or evaluation.
Qualifications
Required skills
Hands‑on experience with large language models, including training or applying them in production (not just prompting)
Designing and running post‑training experiments (evals, ablations, preference tuning / RLHF‑style methods)
Building and owning scalable data pipelines for training and evaluation data
Strong Python skills for ML experimentation, data processing, and analysis
Solid statistical, experimental, and general engineering fundamentals
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Principal Applied Scientist
Principal Applied Scientist
- Location
- Job Number
- City
- Team
- Country
- Discipline
This role is part of the Microsoft Search & Ads Network (MSAN) modeling team, focused on building large-scale machine learning systems for ads retrieval, ranking, user understanding and marketplace optimization across different surfaces. The team develops end-to-end models that predict user engagement and advertiser value – powering candidate generation, relevance scoring, and serving stack ranking that directly impact ad quality, delivery efficiency, and revenue. Responsibilities span the full modeling lifecycle, including training data and labeling strategy, feature and signal design, model development, and rigorous offline and online evaluation. Engineers and applied scientists work closely at the intersection of machine learning, economics, and large-scale systems to deliver high-performance real-time inference and robust experimentation in production.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Have a solid background in Machine Learning, Reinforcement Learning, Causal Inference, Data Science, Data Mining, or related field.
- Be passionate about artificial intelligence and optimization at web scale.
- Play a key role in driving algorithmic improvements to online and offline systems, develop and deliver robust and scalable solutions, make direct impact to both user and advertisers experience, and continually increase the revenue for Bing ads.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 9+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
- Research experience (publications) in the following areas: statistical machine learning, deep learning, data mining, causal inference, information retrieval, and Bayesian inference.
- 2+ years of experience in any of the following areas: ads retrieval and ranking system, statistical machine learning, deep learning, data mining, causal inference, information retrieval, game theory, mechanism design, optimization and Bayesian inference.
- Proficient problem solving and data analysis skills.
- Proficient software design and development skills/experience.
#MicrosoftAI
Applied Sciences IC5 – The typical base pay range for this role across Canada is CAD $142,400 – CAD $257,500 per year.
Find additional pay information here:
https://careers.microsoft.com/v2/global/en/canada-pay-information.html
Applied Sciences IC5 – L’échelle salariale de base typique pour ce rôle dans l’ensemble du Canada est de 142,400 $ CAD à 257,500 $ CAD par année.
Pour plus d’information au sujet de la rémunération, veuillez cliquer ici:
https://careers.microsoft.com/v2/global/en/canada-pay-information.html
Ce poste sera ouvert pendant au moins cinq jours et les candidatures seront acceptées de façon continue jusqu’à ce que le poste soit pourvu.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft est un employeur offrant l’égalité d’accès à l’emploi. Tous les candidats qualifiés seront pris en considération pour l’emploi, sans égard à l’âge, à l’ascendance, à la citoyenneté, à la couleur, aux congés médicaux ou familiaux, à l’identité ou à l’expression de genre, aux renseignements génétiques, à l’état d’immigration, à l’état matrimonial, à l’état de santé, à l’origine nationale, à un éventuel handicap physique ou mental, à l’affiliation politique, au statut de vétéran protégé ou au statut militaire, à la race, à l’ethnie, à la religion, au sexe (y compris la grossesse), à l’orientation sexuelle ou à toute autre caractéristique protégée par les lois, ordonnances et règlements locaux applicables. Si vous avez besoin d’aide avec des accommodements religieux et/ou d’un accommodement raisonnable en raison d’un handicap pendant le processus de candidature, apprenez-en plus sur la demande d’accommodement.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Principal Applied Scientist
Principal Applied Scientist
- Location
- Job Number
- City
- Team
- Country
- Discipline
This role is part of the Microsoft Search & Ads Network (MSAN) modeling team, focused on building large-scale machine learning systems for ads retrieval, ranking, user understanding and marketplace optimization across different surfaces. The team develops end-to-end models that predict user engagement and advertiser value – powering candidate generation, relevance scoring, and serving stack ranking that directly impact ad quality, delivery efficiency, and revenue. Responsibilities span the full modeling lifecycle, including training data and labeling strategy, feature and signal design, model development, and rigorous offline and online evaluation. Engineers and applied scientists work closely at the intersection of machine learning, economics, and large-scale systems to deliver high-performance real-time inference and robust experimentation in production.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Have a solid background in Machine Learning, Reinforcement Learning, Causal Inference, Data Science, Data Mining, or related field.
- Be passionate about artificial intelligence and optimization at web scale.
- Play a key role in driving algorithmic improvements to online and offline systems, develop and deliver robust and scalable solutions, make direct impact to both user and advertisers experience, and continually increase the revenue for Bing ads.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 9+ years related experience (e.g., statistics, predictive analytics, research)
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
- OR equivalent experience.
- Research experience (publications) in the following areas: statistical machine learning, deep learning, data mining, causal inference, information retrieval, and Bayesian inference.
- 2+ years of experience in any of the following areas: ads retrieval and ranking system, statistical machine learning, deep learning, data mining, causal inference, information retrieval, game theory, mechanism design, optimization and Bayesian inference.
- Proficient problem solving and data analysis skills.
- Proficient software design and development skills/experience.
#MicrosoftAI
Applied Sciences IC5 – The typical base pay range for this role across Canada is CAD $142,400 – CAD $257,500 per year.
Find additional pay information here:
https://careers.microsoft.com/v2/global/en/canada-pay-information.html
Applied Sciences IC5 – L’échelle salariale de base typique pour ce rôle dans l’ensemble du Canada est de 142,400 $ CAD à 257,500 $ CAD par année.
Pour plus d’information au sujet de la rémunération, veuillez cliquer ici:
https://careers.microsoft.com/v2/global/en/canada-pay-information.html
Ce poste sera ouvert pendant au moins cinq jours et les candidatures seront acceptées de façon continue jusqu’à ce que le poste soit pourvu.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft est un employeur offrant l’égalité d’accès à l’emploi. Tous les candidats qualifiés seront pris en considération pour l’emploi, sans égard à l’âge, à l’ascendance, à la citoyenneté, à la couleur, aux congés médicaux ou familiaux, à l’identité ou à l’expression de genre, aux renseignements génétiques, à l’état d’immigration, à l’état matrimonial, à l’état de santé, à l’origine nationale, à un éventuel handicap physique ou mental, à l’affiliation politique, au statut de vétéran protégé ou au statut militaire, à la race, à l’ethnie, à la religion, au sexe (y compris la grossesse), à l’orientation sexuelle ou à toute autre caractéristique protégée par les lois, ordonnances et règlements locaux applicables. Si vous avez besoin d’aide avec des accommodements religieux et/ou d’un accommodement raisonnable en raison d’un handicap pendant le processus de candidature, apprenez-en plus sur la demande d’accommodement.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Senior Software Engineer–Infra-Microsoft Copilot
Senior Software Engineer–Infra-Microsoft Copilot
- Location
- Job Number
- City
- Team
- Country
- Discipline
As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad — to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It’s also inclusive: we aim to make AI accessible to all — consumers, businesses, developers — so that everyone can realize its benefits.
Our Platform Infrastructure team is responsible for building and scaling the backend platform at the core of Microsoft consumer products, the integrations with our AI models and the tools that our engineers use. We collaborate closely with cross-functional engineering, product management, and AI research, empowering all Microsoft Copilot teams to more effectively bring cutting-edge AI research to production.
We’re seeking experienced Platform Infrastructure Engineers who are passionate about AI, are deeply proficient in scaling backend technologies, and possess a mastery of templating to architect solutions that stand the test of time.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Design, develop, and maintain performant and secure AI Platform services that power Copilot.
- Work collaboratively with platform, infrastructure, application engineers, and AI researchers to build next generation AI products and services.
- Ship high-quality and maintainable code, and ensure the reliability, scalability, and performance of platform components.
- Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
- Enjoy working in a fast-paced, design-driven, product development cycle.
- Embody our Culture and Values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 4+ years’ experience building scalable platforms on public cloud infrastructure like Azure, AWS, or GCP with extensive use of technologies like Docker, Kubernetes, nginx, RDBMS, key-value stores, etc.
- 4+ years’ experience in building and releasing production software at the platform level.
- Solid knowledge of APIs, data flows, systems, and services.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Experience managing high scale, multi-region, production environments on Kubernetes in cloud environments
- Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience.
- Dedication to writing clean, maintainable, and well-documented code with a focus on reliability, security and ease of use.
- Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, and other engineers.
- Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders.
- Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in web development and AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Senior Software Engineer–Infra-Microsoft Copilot
Senior Software Engineer–Infra-Microsoft Copilot
- Location
- Job Number
- City
- Team
- Country
- Discipline
As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad — to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It’s also inclusive: we aim to make AI accessible to all — consumers, businesses, developers — so that everyone can realize its benefits.
Our Platform Infrastructure team is responsible for building and scaling the backend platform at the core of Microsoft consumer products, the integrations with our AI models and the tools that our engineers use. We collaborate closely with cross-functional engineering, product management, and AI research, empowering all Microsoft Copilot teams to more effectively bring cutting-edge AI research to production.
We’re seeking experienced Platform Infrastructure Engineers who are passionate about AI, are deeply proficient in scaling backend technologies, and possess a mastery of templating to architect solutions that stand the test of time.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Design, develop, and maintain performant and secure AI Platform services that power Copilot.
- Work collaboratively with platform, infrastructure, application engineers, and AI researchers to build next generation AI products and services.
- Ship high-quality and maintainable code, and ensure the reliability, scalability, and performance of platform components.
- Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
- Enjoy working in a fast-paced, design-driven, product development cycle.
- Embody our Culture and Values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 4+ years’ experience building scalable platforms on public cloud infrastructure like Azure, AWS, or GCP with extensive use of technologies like Docker, Kubernetes, nginx, RDBMS, key-value stores, etc.
- 4+ years’ experience in building and releasing production software at the platform level.
- Solid knowledge of APIs, data flows, systems, and services.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Experience managing high scale, multi-region, production environments on Kubernetes in cloud environments
- Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience.
- Dedication to writing clean, maintainable, and well-documented code with a focus on reliability, security and ease of use.
- Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, and other engineers.
- Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders.
- Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in web development and AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Principal Applied Science Manager
Principal Applied Science Manager
- Location
- Job Number
- City
- Team
- Country
- Discipline
Job Title: Principal Applied Science Manager
Visit us at https://aka.ms/STCI
- Are you excited about working on webservices service traffic at web scale across ~100 languages and ~200 regions?
- Are you interested in researching and shipping on the state-of-the-art DL based NLP models?
- Are you passionate about tackling this fundamental problem at web scale which impacts millions of users?
If you answered yes to any of these questions, then Microsoft Bing’s RAI Defensives team could be right fit for you. The team is focused on keeping Bing safe for our customers by detecting queries and content that require due diligence for the best user experience. All this using the state-of-the-art deep learnt models. We are looking for a talented, energetic, creative, and passionate software engineer/applied sciences manager with experience in building/shipping high-quality software solution for cutting edge web scale problems.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50-mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- The Principal Applied Science Manager role involves building and managing a team of high-caliber software engineers and applied researchers.
- The manager will ensure product and development excellence, career development of the team and establishing a strong culture of trust, collaboration & inclusion.
- Additionally, this role will require to work closely with various organizations in Microsoft AI, including Copilot and Edge. A suitable candidate is expected to have experience of delivering reliable & scalable distributed services with a good understanding of data science fundamentals.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research).
- OR Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research).
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research).
- OR equivalent experience.
- 1+ year(s) of people management experience.
- 6+ years of software engineering or applied researcher experience or equivalent.
- 5+ years of experience leading high performing teams.
- 5+ years of technical leadership experience, planning, designing, implementing, and delivering large projects spanning multiple engineers as the primary owner or equivalent.
- A demonstrated track record of excellent communication and collaboration skills.
- Experience developing, debugging, and maintaining code.
- Experience leading a team to train, build and infer large scale DL models.
- Demonstrated organizational, problem solving and prioritization skills.
Preferred Qualifications:
- Master’s Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 9+ years related experience (e.g., statistics, predictive analytics, research).
- OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research).
- OR equivalent experience.
- 5+ years of people management experience.
- 5+ years experience creating publications (e.g., patents, libraries, peer-reviewed academic papers).
- 2+ years experience presenting at conferences or other events in the outside research/industry community as an invited speaker.
- 5+ years experience conducting research as part of a research program (in academic or industry settings).
- 3+ years experience developing and deploying live production systems, as part of a product team.
- 3+ years experience developing and deploying products or systems at multiple points in the product cycle from ideation to shipping.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
Member of Technical Staff, AI Systems Engineer – Microsoft Superintelligence
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are building next-generation customized AI silicon designed to accelerate AI workloads with unprecedented efficiency. We are looking for an exceptional Systems Engineer to bridge the gap between our custom hardware and modern AI inference frameworks.
We build foundational AI infrastructure that enables large-scale training and inference across diverse workloads and rapidly evolving hardware generations. Our work directly shapes how AI systems are designed, deployed, and scaled today and into the future. Engineers on this team operate with end-to-end ownership, deep technical rigor, and a strong bias toward real-world impact.
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
The Role
As a Senior AI Systems Engineer, you will own the software integration layer between our custom AI chip’s proprietary SDK and SGLang, a state-of-the-art serving framework for Large Language Models (LLMs) and Vision-Language Models. You will be responsible for ensuring that our silicon can seamlessly run SGLang inference workloads at peak performance, bypassing the traditional CUDA ecosystem entirely.
Responsibilities
- Framework Integration: Architect and develop the backend integration to make our custom AI chip a first-class citizen in SGLang.
- Custom Operator Development: Write custom C++ / PyTorch extensions that map SGLang’s primitive operations (e.g., RadixAttention, FlashAttention, matrix multiplications) to our custom chip’s proprietary software layer.
- Performance Optimization: Profile and optimize end-to-end LLM inference latency, throughput, and memory utilization (Paged Attention) on our hardware.
- Cross-Functional Collaboration: Work closely with our hardware architecture and compiler teams to provide feedback on our custom software stack and silicon design based on framework-level bottlenecks.
- Testing & Deployment: Build robust testing pipelines to validate model accuracy and performance parity against standard GPU baselines.
Qualifications
- BS, MS, or PhD in Computer Science, Computer Engineering, or a related field.
- Software engineering experience focusing on systems programming, ML infrastructure, or AI compilers.
- Expertise in Python: Deep understanding of memory management, concurrent programming.
- Experience with LLM Inference Engines: Hands-on experience modifying or extending frameworks like SGLang, vLLM, DeepSpeed-FastGen, or TensorRT-LLM.
- PyTorch Internals: Strong experience writing PyTorch C++ extensions and custom operators.
- Hardware Interfacing: Proven track record of integrating machine learning workloads with hardware accelerators (GPUs, TPUs, NPUs) using custom SDKs, APIs, or low-level drivers.
- Prior experience working on non-CUDA software ecosystems (e.g., AMD ROCm, AWS Neuron, Google XLA).
- Familiarity with AI compilers and intermediate representations (MLIR, Apache TVM, OpenAI Triton).
- Strong understanding of underlying LLM architectures (Transformers, MoE) and state-of-the-art attention algorithms (FlashAttention v2/v3).
- Previous experience at an AI silicon startup or working on custom accelerators (e.g., Google TPU, AWS Trainium).
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer
Principal Software Engineering–Backend–Microsoft Copilot
Principal Software Engineering–Backend–Microsoft Copilot
- Location
- Job Number
- City
- Team
- Country
- Discipline
As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad — to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It’s also inclusive: we aim to make AI accessible to all — consumers, businesses, developers — so that everyone can realize its benefits.
Our Platform team is responsible for building and scaling the backend platform at the core of Microsoft consumer products, the integrations with our AI models and the tools that our engineers use. We collaborate closely with cross-functional engineering, product management, and AI research, empowering all Microsoft Copilot teams to more effectively bring cutting-edge AI research to production.
We’re seeking experienced Platform Infrastructure Engineers who are passionate about AI, are deeply proficient in scaling backend technologies, and possess a mastery of templating to architect solutions that stand the test of time.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
- Design, develop, and maintain performant and secure AI Platform services that power Copilot.
- Work collaboratively with platform, infrastructure, application engineers, and AI researchers to build next generation AI products and services.
- Ship high-quality and maintainable code, and ensure the reliability, scalability, and performance of platform components.
- Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
- Enjoy working in a fast-paced, design-driven, product development cycle.
- Embody our Culture and Values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 4+ years’ experience building scalable platforms on public cloud infrastructure like Azure, AWS, or GCP with extensive use of technologies like Docker, Kubernetes, nginx, RDBMS, key-value stores, etc.
- 6+ years’ experience in building and releasing production software at the platform level.
- Solid knowledge of APIs, data flows, systems, and services.
Preferred Qualifications:
- Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Solid experience in designing and building scalable backend services, APIs, and distributed systems in cloud environments.
- Solid understanding of system design principles, including data modeling, caching strategies, and service-to-service communication (e.g., REST, gRPC).
- Ability to troubleshoot and resolve complex backend issues across multiple services, with a focus on reliability, performance, and scalability.
- Proficiency in writing clean, maintainable, and testable code, with a solid emphasis on code quality, observability, and security best practices.
- Experience working with databases (SQL/NoSQL), and a deep understanding of data consistency, indexing, and query optimization.
- Demonstrated ability to collaborate effectively with cross-functional teams, including frontend engineers, product managers, and infrastructure teams.
- Solid communication skills, with the ability to explain backend architectures and technical trade-offs to both technical and non-technical stakeholders.
- Ability to operate in a fast-paced environment, handle ambiguity, and manage multiple priorities with a sold sense of ownership.
- Passion for backend technologies and distributed systems, with a continuous learning mindset toward new frameworks, architectures, and AI-driven backend capabilities.
- Proven track record of contributing to team growth through code reviews, technical mentorship, and knowledge sharing.
#MicrosoftAI
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Product, Android Engineer
Backend Engineer