New
Applied Scientist II
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewTheMicrosoft Applied Sciences Groupincubates disruptive technologies for Microsoft's next-gen hardware products and is working on several exciting projects that will shape how computers and other devices perceive the user and the user's environment. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling new experiences to the market. A lot of these experiences will be powered by large language models, natural language processing (NLP), speech, and computer vision - and as part of this team, you will have the unique opportunity to develop and implementing the ML algorithms and deep neural network (DNN) models that make magic happen!We are looking for an Applied Scientist II - Applied Science in the field of Large Language Models (LLMs) and Small Language Models (SLMs). This role emphasizes evaluation and data synthesis as the foundation for building future intelligent and agentic systems.You will design and operationalize robust evaluation metrics for multimodal embeddings, retrieval-augmented generation (RAG) pipelines, and agentic systems that combine reasoning, planning, and action. These metrics will not only measure performance but also guide the creation of new datasets, inform data synthesis and augmentation strategies, and identify opportunities for novel end-to-end solutions.Through this evaluation-driven approach, you will shape the direction of agentic architectures, new model designs, and applied research initiatives, ensuring that our systems continuously adapt, improve, and expand their capabilities. The ability to analyze multimodal data and interpret human and human-object interactions is central to Applied Science's mission of enabling seamless human-computer interaction.As part of this team, you will collaborate with a growing group of talented researchers already dedicated to this mission and leverage data and hardware resources available to only a select few. Naturally, the opportunity for you to push the state of the art in this field is huge. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesDesign and implement evaluation frameworks to measure the performance of multimodal embeddings, RAG pipelines, and agentic systems.Use evaluation insights to drive data synthesis, augmentation, and collection strategies that improve coverage and robustness.Build pipelines to test algorithms and models, analyze the results, and translate findings into actionable improvements.Develop metrics and benchmarks that inform system-level performance and guide the evolution of end-to-end human-computer interaction solutions.Collaborate with researchers and engineers to prototype and scale agentic systems that combine reasoning, planning, and action.Research and develop LLMs and SLMs with Python and other relevant programming languages. |