Hi, I'm Fandi Yi (ʘ‿ʘ)╯

Senior ML EngineerLLM SpecialistBuilder

I am a builder and problem solver with over 6 years' Experience in Machine learning. Skilled in multimodal LLMs, agentic frameworks, and RAG, with end-to-end expertise from model design to deployment and optimization. At RBC, I lead a team to develop LLM-driven agentic chatbot that streamline enterprise operations and projected 30% reduction in operational time for entire enterprise change management team. Outside of work, I co-authored a MODELS 2023 paper on LLM taxonomy generation and contribute to Sherpa, an open-source LLM agentic framework.

6+
Years in ML
15+
Personal Projects For Fun
20+
Tech Stack
1
Publication
Introduction

About Me

👋Hello!

I am a builder and problem solver with over 6 years' Experience in Machine learning. Skilled in multimodal LLMs, agentic frameworks, and RAG, with end-to-end expertise from model design to deployment and optimization. At RBC, I lead a team to develop LLM-driven agentic chatbot that streamline enterprise operations and projected 30% reduction in operational time for entire enterprise change management team. Outside of work, I co-authored a MODELS 2023 paper on LLM taxonomy generation and contribute to Sherpa, an open-source LLM agentic framework.

Get in Touch

fandiyi2333@gmail.com
514-550-9528
LinkedIn Profile
7 Mabelle Ave, Toronto, ON M9A 0C9

Languages

EnglishFluent
ChineseNative
FrenchBeginner

Technical Skills

Languages & Tools

Python (8+ yrs)JavaSQLJavaScriptMATLABC

Frameworks

PyTorchTensorFlowHuggingFaceLangGraphLangChainLangFuse

ML Focus

LLMRAGReinforcement LearningModel QuantizationA/B Testing

Cloud & Infra

GCP (Vertex AI, BigQuery)AzureDockerOCPKafkaJenkinsCI/CD

Data Systems

ElasticSearchNeo4jMongoDBMySQL

Specialties

Multimodal AgentsConversational AIModel DeploymentPerformance Optimization
Career Journey

Experience & Education

💼Work & Education Experience

Senior Machine Learning Engineer

RBCIntelligent Ops Team
2024/10Current
📍 Toronto, Canada
  • Won RBC employee performance award in 2025.
  • Lead a 5 members team developed and deployed the RAG based LLM chatbot in RBC by leveraging FastMCP, LangGraph, Elasticsearch (VectorDB) and self-hosted LangFuse (monitoring), which consumed by entire RBC change management team, projected 30% reduction in operational time by the end of 2026.
  • Deployed and Optimized 4 enterprise-level models. Designed Models' architecture to optimize their performance on OCP: Successfully reduced alert triage model's 80% memory consumption by refactoring script.
  • Delivered a real-time incident alerting enrichment system using Java Spring Boot, Python FastAPI and Kafka. Consumed by two business units in RBC to help them reduce incident processing time by 30%.

Machine Learning Research Assistant (Part-time)

McGill University
2022/122023/05
📍 Remote
  • Worked with Prof. Emine Sarigollu's team to explore customers' interaction for circular economy (CE) topic on Twitter.
  • Scraped 500k+ tweets and built NLP pipelines for emotion, sentiment and topic analytics using fine-tuned BERT-based classifiers using PyTorch with CUDA accelerators.

NLP & ML Engineering

CIBCData & Analytics Team
2022/012024/10
📍 Toronto, Canada
  • Developed entity linking API using scikit-learn and Spacy for daily contract analytics, which are used for matching entities in contract documents with CIBC internal suppliers' name.
  • Created a hierarchical structure for contracts' parent-child relationship, helping identify contracts' property.
  • Enabled large-scale contract documents search (>100k files) by building an automated document metadata extraction pipeline to clean, extract and update data from unstructured text files by using Python and SQLite.

ML Engineering Intern

Alibaba GroupAlibaba Brain Team
2021/102021/12
📍 Hangzhou, China
  • Worked in the Alibaba Brain team to analyze and develop a product for Objectives Key Results (OKR) project management tool.
  • Partnered with product managers and developers from other teams to Define the indicators measuring the synergy effect among different business units using the OKR data (10 million+ records) from the AlibabaCloud database.
  • Applied Neo4j to build a graph database for OKR data to visualize collaboration among departments.

Data Scientist Intern

CIBCData & Analytics Team
2021/052021/09
📍 Toronto, Canada
  • Developed an end-to-end generative system to automate supplier profile slides generation process by querying Microsoft Access databases with VBA.
  • Created a customized named entity recognition (NER) model to extract key information from contract documents.

Data Scientist Coop

Allianz SE InsuranceCapstone Project
2020/112021/05
📍 Montreal, Canada
  • Built a semi-supervised model to predict the intention of Canadian small business buying insurance products, and built an LSTM model to predict the Google Trend for insurance products to provide insights for their market team.
  • Deployed real-time ML application for social media analytics with Google Cloud Function and Google Data Studio.

🎓Education

Master of Management in Analytics

McGill University
2020/082021/12
GPA: 3.89/4.00
Entrance Scholarship
Relevant Courses:
Deep LearningDatabase Distrib. Syst.NLPLarge Language ModelA/B Testing

Bachelor of Electrical Engineering

McGill University
2016/092020/05
GPA: 3.35/4.00, Final Year: 3.93/4.00
Relevant Courses:
Algorithm DesignComputer VisionApplied Machine LearningNumerical Methods
Portfolio

Featured Projects

A collection of projects spanning ML research, open-source contributions, and production systems.

📄
2023/022023/09arXiv

MODELS Conference 2023

Published on MODELS conference: Prompting or Fine-tuning? A Comparative Study of Large Language Models for Taxonomy Construction as a co-first author.

Compared different LLMs' prediction performance with prompting and fine-tuning methods for various taxonomy datasets.

PythonPyTorchNLPFine-tuning
🤖
2024/05CurrentGitHub

LLM Framework – Sherpa

Joined as one of the contributors in Sherpa developer community.

Built search refinement and chain-of-action tools for this agentic LLM framework.

PythonLangChainLLMsAgents
📱
2025/10CurrentGitHub

Dream Journal App

Practicing my vibe coding skill by publishing a dream journal application to apple app store from scratch.

Vibe coding frontend and backend with Cursor IDE, set up database and edge functions in Supabase, Build and submit app with Expo Application Services.

React NativeExpoSupabaseTypeScript
📈
2023/01PresentDiscord

Stock Signal Bot

Created a taxonomy dataset specific to computing classification system domain.

Deployed real-time Discord bot by using Heroku for stock signal alerts using MACD/RSI analytics.

PythonDiscord.pyHerokuMACD/RSI
🏠
2021/022021/04GitHub

Advanced AI Analytics for Airbnb Hosts

Built an application powered by a polynomial regression model to help hosts to adjust their prices.

Applied AutoML with ML Flow on Databricks to choose the best performance ML model and hyperparameter tuning.

Applied Docker to containerize ML models and the application orchestrated with Kubernetes.

PythonDockerKubernetesMLflow
🔧
2019/092020/05GitHub

Continuous Testing And Validation of Jamscript

Worked in Prof. M.Maheswaran's lab to test and validate a programming language for Edge-Oriented mobile IoT.

Developed a continuous integration pipeline for an open-source programming language: Jamscript (a polyglot language that combines C and JavaScript) with Travis CI.

CJavaScriptTravis CIIoT
👤
2019/092019/12GitHub

Face Recognition and Tagging

Developed a face recognition system from scratch in a team of 5 using Python Sklearn.

Compared the face recognition performance on PCA and bag-of-words methods.

PythonSklearnPCAOpenCV
🚀
2018/102020/04McGill University

McGill Rocket Club

Aero-Structure and Propulsion Sub-Team Member.

Participated in designing different parts of the rocket model; Won Spaceport America Cup 2018 champion.

CADPropulsionAero-Structure
Beta Version

Talk to Digital Me

Ask questions about my skills, projects, and experience. ฅ(^•ﻌ•^ฅ)
Please Note: This is a simulated feature, not a real model. I made it for fun.

fandi-inference-engine v2.0
Model loaded
$Enter your query:
Try:
Let's Connect

Get In Touch

Have a project in mind or want to discuss opportunities? I'd love to hear from you.

📍
Location
7 Mabelle Ave, Toronto, ON M9A 0C9

Send a Message

FY
Fandi.
© 2025 Fandi Yi. All rights reserved.

Built withNext.jsTailwind CSSFramer Motion