Data Scientist / Applied Scientist (Mid and Senior Levels) We are hiring multiple Data Scientists and Applied Scientists across LATAM to join the Microsoft 365 team. These are remote positions, allowing you to work from the comfort of your home. As part of our team, you will help shape one of Microsoft's fastest-growing cloud services. Your work will directly impact Copilot, enabling personalized, context-aware experiences that empower millions of users across Microsoft 365. You will work at the forefront of applied AI and machine learning, shaping the future of intelligent productivity tools. You will contribute to a culture of innovation, continuous learning, and experimentation. About The Team The Substrate Core Substrate team powers the infrastructure that underpins Microsoft 365's most critical services and Copilot. We are a high-impact, forward-looking team focused on building intelligent, scalable, and cost-efficient platforms that enable Microsoft to deliver world-class productivity experiences to billions of users. Your core mission will be to deliver exciting innovation in Microsoft Copilot. Microsoft Copilot is revolutionizing how people work and has created an unprecedented opportunity to advance the state-of-the-art in a way that benefits millions of people. More About Your Responsibilities We are seeking candidates with research skills and the desire to pursue the cutting edge in model development that pushes technological boundaries. We are looking for candidates with interest and experience in language model training, large language model evaluation, and quality assessment. - Develop and evaluate ML models using prepared datasets, customer feedback, and novel training/fine-tuning algorithms for language models. - Write production-quality code, apply debugging best practices, and stay current with industry trends. - Drive customer-centric solutions by aligning with business goals and managing stakeholder expectations. - Collaborate cross-functionally to define success metrics and improve AI quality at scale. - Lead research projects that yield new algorithms, tools, or insights solving open problems. - Analyze evaluation outputs to identify gaps in coverage, quality, and usability. - Design experiments, define metrics, and develop ML pipelines for encoder-decoder and cross-encoder models, semantic search, and user intent understanding. - Work with large-scale data while championing privacy and compliance. - Engage with customers and internal teams to identify pain points and drive impactful improvements. Qualifications Main qualifications: - Bachelor's degree in Statistics, Econometrics, Computer Science, Electrical/Computer Engineering, or related field. - 4+ years of experience in predictive analytics, statistics, or research. - Experience with synthetic data generation and data management for evaluation/training. - At least one year of experience publishing patents or peer-reviewed papers. - Deep motivation for user-centric AI and interest in human cognition, memory, and AI. Preferred Qualifications Main preferred qualifications: - Master's degree in Statistics, Econometrics, Computer Science, Electrical/Computer Engineering, or related field. - Experience with large-scale embedding models and transformer architectures. - Familiarity with reinforcement learning and distributed computing platforms (e.g., Heron, AML, Euclid). - Proficient analytical skills and experience with telemetry and performance metrics. - DevOps experience and cloud services knowledge (Azure preferred). - Agile development experience and a structured approach to software design.