Member of Technical Staff, Data Research Engineer – MAI Superintelligence Team
Member of Technical Staff, Data Research Engineer – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Responsibilities
- Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data
- Develop and maintain scalable data pipelines for multimodal ingestion, pre-processing, filtering, and annotation
- Analyse real-world multimodal datasets to assess quality, diversity, relevance, and identify areas for improvement
- Build lightweight tools and workflows for dataset auditing, visualization, and versioning
- Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices
Qualifications
- Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or a related technical field AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience
- Experience in data analysis or data engineering
- Proficiency in statistics and exploratory data analysis methods
- Ability to communicate technical findings effectively to research and product teams
- Master’s Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- Familiarity with data processing frameworks such as Spark, Ray, Apache Beam
- Experience working with large-scale, real-world datasets that are unstructured or semi-structured
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, Hardware Health – MAI Superintelligence Team
Member of Technical Staff, Hardware Health – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Microsoft AI operates one of the world’s most advanced AI training infrastructures, featuring multi-gigawatt clusters spanning tens of thousands of high-performance GPUs, ultra-low-latency NVLink/NVSwitch networks, and innovative liquid-cooling systems. Our team is seeking a Member of Technical Staff, Hardware Health, to ensure these systems deliver sustained reliability, performance, and availability across exascale-class deployments.
We work closely with research, hardware, datacenter, and platform engineering teams to develop predictive health models, failure detection frameworks, and autonomous remediation systems that keep our AI clusters operating at frontier scale.
Our newly formed organization, Microsoft AI, is dedicated to advancing Copilot and other consumer AI products and research. The team is responsible for Copilot, Bing, Edge, and generative AI research. Join us and help shape the future of personal computing.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we embrace a growth mindset, innovate to empower others, and collaborate to achieve shared goals. Every day, we build on our values of respect, integrity, and accountability to foster a culture of inclusion where everyone can thrive at work and beyond.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
Advanced ROCE transport design, congestion control, ECN/WRED/DCTCP tuning
Fabric architecture, topology planning, network modeling, and scaling strategy
Telemetry, observability, reliability engineering, and automated troubleshooting
Develop and tune the deployment of novel routing techniques to achieve reliability in large networks
Work with world class network designers like NVIDIA, Broadcom, and in-house silicon/network co-design teams
AI training + inference cluster bring-up, performance benchmarking, and root-cause analysis
Gather data and insights to develop the pretraining compute roadmap
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values
Qualifications
Required Minimum Qualifications
- Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, AI Networking – MAI Superintelligence Team
Member of Technical Staff, AI Networking – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Microsoft AI is hiring a Member of Technical Staff, AI Networking to design and scale the world’s most advanced high-performance networks powering Copilot and next-generation AI systems. Join the team building the fabric that connects frontier-class datacenters, enables multi-gigawatt AI supercomputers, and supports the training of the most sophisticated AI models on the planet.
In our efforts to build these models to develop novel responsible and efficient artificial general intelligence, large compute-capacity is required, and as an AI Networking Engineer, you’ll shape the end-to-end networking architecture, link-layer to fabric-wide systems for hyperscale AI training clusters. design, bring up, and scale the distributed Ethernet and InfiniBand fabrics that connect hundreds of thousands of GPUs across multi-megawatt data halls. You’ll benchmark, profile, debug and tune the training and inference of AI workloads running in the production clusters. You’ll engineer ultra-low-latency ROCE networks, design congestion-free transport mechanisms, optimize lossless fabrics at 10k–100k+ GPU scale, and partner deeply across Azure, Microsoft AI, and datacenter teams to turn cutting-edge ideas into running global infrastructure. If you want to build networking systems that push physics, silicon, and software to the limit and directly accelerate Microsoft’s frontier AI models, this is the most exciting seat in the industry.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
Advanced ROCE transport design, congestion control, ECN/WRED/DCTCP tuning
Fabric architecture, topology planning, network modeling, and scaling strategy
Telemetry, observability, reliability engineering, and automated troubleshooting
Develop and tune the deployment of novel routing techniques to achieve reliability in large networks
Work with world class network designers like NVIDIA, Broadcom, and in-house silicon/network co-design teams
AI training + inference cluster bring-up, performance benchmarking, and root-cause analysis
Gather data and insights to develop the pretraining compute roadmap
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values
Qualifications
Required Minimum Qualifications
- Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, Compute Orchestration & Scheduling – MAI Superintelligence Team
Member of Technical Staff, Compute Orchestration & Scheduling – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Microsoft AI is looking for a Member of Technical Staff, Compute Orchestration & Scheduling to help build the next wave of capabilities of our personalized AI assistant, Copilot. We’re looking for someone who will bring an abundance of positive energy, empathy, and kindness to the team every day, in addition to being highly effective. The right candidate enjoys building world-class consumer experiences and products in a fast-paced environment. You will actively contribute to the development of AI models that are powering our innovative products. You will wear multiple hats and work on engineering, research, and everything in between. Your contributions will span model architecture, data curation, training and inference infrastructures, evaluation protocols, alignment and reinforcement learning from human feedback (RLHF), and many other exciting topics at the cutting edge of AI.
Microsoft AI is building foundational models to develop novel responsible and efficient artificial general intelligence. The foundational models require large compute-capacity, and as a Member of Technical Staff, Compute Orchestration & Scheduling you would be responsible for designing and building our compute orchestration and scheduling layer on top of Kubernetes and Ray, working on everything from workload placement and scaling to reliability and developer experience. You’ll work closely with research and framework teams to turn their requirements into scalable abstractions, improve cluster efficiency, and ensure our compute platform is observable, and easy to operate in production. As a contributing member of the core group of engineers, you would also bring to the table best practices driving architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact the business goals of a wide range of users and facilitate the next wave of growth and innovation in AI.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non-U.S., country-specific) of that location. This expectation is subject to local law and may vary by jurisdiction.
Responsibilities
Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
Benchmark GB200 and AMD MIxxx GPU clusters
Gather data and insights to develop the pretraining compute roadmap
Care deeply about conversational AI and its deployment
Actively contribute to the development of AI models that are powering our innovative products
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values
Qualifications
Required Minimum Qualifications
- Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff, Software Engineer – MAI SuperIntelligence team
Member of Technical Staff, Software Engineer – MAI SuperIntelligence team
- Location
- Job Number
- City
- Team
- Country
- Discipline
Responsibilities
- Design and build core platform services for scalable training and evaluation, including cluster orchestration, job scheduling, data and compute pipelines, and artifact management.
- Standardize containerized workflows by maintaining Docker images, CI/CD, and runtime configurations; advocate for best practices in security, reproducibility, and cost efficiency.
- Implement end-to-end observability and operations through metrics, tracing, logging, dashboard development, monitoring, and automated alerts for model training and platform health (using Prometheus, Grafana, OpenTelemetry).
- Architect and operate services on Azure cloud platforms, managing infrastructure-as-code (Terraform/Helm), secrets, networking, and storage.
- Enhance developer experience by creating tools, CLIs, and portals that simplify job submission, metrics analysis, and experiment management for generalist software engineering and research teams.
- Enforce security and compliance policies for data access, container hardening, and supply-chain integrity, and partner with security and privacy teams to maintain robust practices in multi-tenant environments and secret management.
- Collaborate cross-functionally with data, model, and product teams to align infrastructure roadmaps with training needs, evaluation protocols, and Copilot product goals.
Qualifications
Required skills
- Strong software engineering background building reliable, scalable production systems (Python preferred)
- Hands‑on experience supporting large‑scale ML / LLM training, evaluation, or experimentation infrastructure
- Operating GPU‑heavy workloads in cloud environments using Docker and Kubernetes (scheduling, utilization, isolation)
- Designing and running data / compute pipelines and orchestration (e.g., Airflow, Argo) with object storage (Azure Blob / S3)
- Platform reliability and operability: observability, metrics, logging, tracing, alerting (Prometheus, Grafana, OpenTelemetry)
Desired skills
- Building secure, reproducible platforms using CI/CD, infrastructure‑as‑code (Terraform, Helm), container security, and secrets management
- Experience working closely with AI researchers in fast‑moving, experimental, frontier‑scale research environments and building internal tools (CLIs, portals, APIs) to boost productivity
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff – AI Pretraining – MAI Superintelligence Team
Member of Technical Staff – AI Pretraining – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
- Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects
- Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making
- Have experience and/or in-depth understandings about large-scale distributed systems
- Demonstrate an ability to work collaboratively in a fast-paced, innovative environment
Responsibilities
- Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
- Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
- Collaborate closely with teams on infrastructure, data, post-training, and multimodality
- Embody our culture and values.
Qualifications
- · Bachelor’s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- Proven expertise in the area of pretraining
- Demonstrated experience in large-scale AI.
- Passionate about conversational AI and its deployment.
- Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff – AI Pretraining – MAI Superintelligence Team
Member of Technical Staff – AI Pretraining – MAI Superintelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
- Have proven expertise in areas of interest, evidenced by an exceptional publication track record and significant technical leadership in high-impact projects
- Exhibit strong analytical skills, attention to detail, and a commitment to data-driven decision-making
- Have experience and/or in-depth understandings about large-scale distributed systems
- Demonstrate an ability to work collaboratively in a fast-paced, innovative environment
Responsibilities
- Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
- Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
- Collaborate closely with teams on infrastructure, data, post-training, and multimodality
- Embody our culture and values.
Qualifications
- · Bachelor’s Degree in Computer Science, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- Proven expertise in the area of pretraining
- Demonstrated experience in large-scale AI.
- Passionate about conversational AI and its deployment.
- Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers.
- Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI.
- Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.
In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:
Develop novel data collection strategies
Improve dataset quality and integrity
Understand data-driven model behaviors
Train models to understand the impact of data and data mixes
Align datasets with ethical and societal values
This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.
We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Responsibilities
- Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
- Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
- Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
- Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
- Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
- Embody our culture and values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
- 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
- Proficiency in statistics and exploratory data analysis methods.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
Member of Technical Staff -Member of Technical Staff – Pretraining Text Data
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking engineers and researchers to join our Pretraining Text Data team, where we are building the next generation of foundation large language models. If you are passionate about designing and curating high-quality datasets to power frontier AI models, this role is for you.
In this role, you’ll work at the intersection of data and innovation—collaborating with scientists, engineers, and annotators to curate, analyze, and evaluate diverse text datasets critical to model development. You will lead efforts to:
Develop novel data collection strategies
Improve dataset quality and integrity
Understand data-driven model behaviors
Train models to understand the impact of data and data mixes
Align datasets with ethical and societal values
This is a cross-disciplinary, high-impact role ideal for engineers and researchers who want to push the boundaries of what AI can learn from data.
Microsoft Superintelligence Team
Microsoft Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
This role is part of Microsoft AI’s Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being.
We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!
Responsibilities
- Create high-quality datasets for training and evaluation; run experiments on new datasets (data ablations) to assess their impact and determine the most effective data.
- Develop and maintain scalable data pipelines for text data ingestion, preprocessing, filtering, and annotation.
- Analyze real-world text datasets to assess quality, diversity, relevance, and identify areas for improvement.
- Build lightweight tools and workflows for dataset auditing, visualization, and versioning.
- Collaborate with Safety, Ethics, and Governance teams to ensure datasets meet standards for quality, privacy, and responsible AI practices.
- Embody our culture and values.
Qualifications
Required Qualifications:
- Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
Preferred Qualifications:
- Master’s Degree in in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR Bachelor’s Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 12+ years technical engineering experience with coding in languages including, but not limited to, Python and common data libraries (Pandas, NumPy, etc.)
- OR equivalent experience.
- 2+ years of experience in data analysis or data engineering, including work with large-scale datasets that are unstructured or semi-structured.
- Proficiency in statistics and exploratory data analysis methods.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer
Member of Technical Staff – Senior ML Engineer – MAI Super Intelligence Team
Member of Technical Staff – Senior ML Engineer – MAI Super Intelligence Team
- Location
- Job Number
- City
- Team
- Country
- Discipline
We are seeking a Senior Machine Learning Engineer to bridge the gap between advanced Vision-Language Model (VLM) research and high-performance production serving. Unlike standard data science and engineering roles, this position requires a dual competency: you must be capable of designing novel VLM architectures (including dataset curation and multilingual alignment) AND optimizing the inference stack (kernel optimization, distillation, and memory management) to run these models on specific hardware constraints (NVIDIA H100 and AMD MI300x).
The successful candidate will own the entire vertical slice: from reading the latest arXiv papers and improving training sets, to writing the C++/CUDA kernels that serve the final model in production.
Responsibilities
1. VLM Research & Architecture Design
Continuously evaluate and implement the latest research trends in Vision-Language Models, specifically focusing on Referring Expression Comprehension (REC), Document Understanding (Pix2Struct), and Visual Question Answering (VQA).Design and build massive-scale training and evaluation datasets, ensuring multilingual compatibility and broad visual understanding for European market requirements.Lead the model co-design process, creating architectures that are natively optimized for accelerator capabilities (compute-bound vs. memory-bound operations).
2. Advanced Inference Optimization & Serving
Architect high-throughput serving layers using SGLang and vLLM, optimizing for non-standard decoding strategies.
Implement scientific experiments to find the Pareto-optimal frontier between serving latency and generation quality.Execute Knowledge Distillation (KD), unstructured pruning, and quantization techniques to fit large-scale VLM architectures onto single-node GPU setups (specifically H100 or MI300x) without compromising model quality.
3. Systems Engineering & Kernel Development
Write and optimize custom kernels (CUDA/HIP) to accelerate serving latency, identifying bottlenecks at the operator level.
Manage the full pre-training and post-training tech stack, ensuring seamless integration between model weights and inference engines.Take ownership of landing the serving-efficient model in a production environment, ensuring reliability and scalability.
Qualifications
Mandatory Requirements (Must Have)
- Education: Master’s or PhD in Computer Science, Artificial Intelligence, or High-Performance Computing.
- Experience: Minimum 4+ years of experience in Machine Learning, with a mandatory split focus between Model Architecture and Systems Optimization.
- VLM Expertise: Proven experience building and shipping Vision-Language Models (e.g., architectures similar to CLIP, Flamingo, Pix2Struct). Must have experience creating custom evaluation sets for tasks like Document Understanding.
- Serving Stack Proficiency: Expert-level knowledge of SGLang and vLLM for optimized serving.
- Hardware Specifics: Demonstrable experience optimizing models for both NVIDIA (H100) and AMD (MI300x) accelerators.
- Optimization Techniques: Hands-on experience with Knowledge Distillation and Pruning to reduce model latency for target serving sizes.
- Production Engineering: A track record of taking complex multi-modal models from research code to a deployed, user-facing production product.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Similar jobs
Sr Account Executive(Advertising)
Principal Software Engineer
Member of Technical Staff, AI Product, Android Engineer