Principal Applied Scientist - Small Models in Reading
Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
Job DescriptionOverview
Microsoft’s Applied Sciences Group is seeking a visionary and hands-on Principal Applied Scientist to lead research and development in SLM, VLM, multimodal AI, across , vision and agent workloads. This role is ideal for candidates passionate about building real-world systems that unify visual and textual modalities to power next- user experiences across devices and platforms.
As a senior member of the team, you will drive innovation across model architecture, training, and deployment, especially for scalable autoregressive models that handle both , structured text and reasoning tasks. You will also play a key role in converting cutting-edge research into practical applications and experiences for users across the globe.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of where everyone can thrive at work and beyond.
Responsibilities
- Design and prototype unified token-based architectures that treat text and/or image data as sequences for coherent text or multimodal .
- Work on SLMs for tasks such as planning, image captioning, visual question answering, and structured text .
- Build scalable training pipelines for large-scale text datasets.
- Optimize deep neural networks for deployment on Neural Processing Units (NPUs), GPUs and cloud environments, maximizing efficiency and performance.
- Collaborate with cross-functional teams to integrate models into Microsoft products and services.
- Publish research in top-tier venues (NeurIPS, CVPR, ICCV, ICLR) and contribute to the scientific community.
- Mentor junior scientists and engineers, fostering a collaborative and innovative research environment.
Qualifications
Required Qualifications:
- Doctorate in Computer Vision, Machine Learning, or a related field with demonstratable experience in applied research or product development
- OR Master's degree in Computer Vision, Machine Learning, or a related field with demonstratable experience in applied research or product development
- OR Bachelor's degree in Computer Vision, Machine Learning, or a related field with demonstratable experience in applied research or product development
- Strong publication record in top-tier venues (CVPR, ICCV, ECCV, NeurIPS, ICLR, AAAI).
- Advanced Python or C++ (especially C++11 and newer) experience.
- Advanced experience in deep learning and its different toolkits, in particular Pytorch or TensorFlow.
Other Requirements: Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Demonstrated ability to translate research into real-world applications.
- Proficiency in Python and deep learning frameworks (e.g., PyTorch, TensorFlow, HuggingFace).
- Hands-on experience with generative models, especially diffusion and transformer-based synthesis.
- Experience building and training multimodal autoregressive models,
Qualifications
- Experience deploying models to production or on-device environments.
- Experience optimizing models for Neural Processing Units (NPUs) or other hardware accelerators.
- Knowledge of quantization, pruning, and efficient fine-tuning techniques.
- Experience with RLHF, proven ability to design, prototype and implement training pipelines for planning and reasoning
- Strong collaborative skills across cross-functional teams.
#W+DJOBS
#ASG
#AppliedSciencesGroup
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to , ancestry, citizenship, , family or medical care leave, or expression, genetic information, immigration status, marital status, medical condition, , physical or mental , political affiliation, protected veteran or military status, , , , (including ), , or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with accommodations and/or a reasonable accommodation due to a during the application process, read more about requesting accommodations.
If you are interested in applying for this job please press the Apply Button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.