Hi, I'm Fandi Yi (ʘ‿ʘ)╯

Senior ML Engineer•LLM Specialist•Builder

I am a builder and problem solver with over 6 years' Experience in Machine learning. Skilled in multimodal LLMs, agentic frameworks, and RAG, with end-to-end expertise from model design to deployment and optimization. At RBC, I lead a team to develop LLM-driven agentic chatbot that streamline enterprise operations and projected 30% reduction in operational time for entire enterprise change management team. Outside of work, I co-authored a MODELS 2023 paper on LLM taxonomy generation and contribute to Sherpa, an open-source LLM agentic framework.

Project & Research Work & Education Experience Digital Me Get In Touch

Years in ML

15+

Personal Projects For Fun

20+

Tech Stack

Publication

Scroll to explore

Introduction

About Me

👋Hello!

Get in Touch

fandiyi2333@gmail.com

514-550-9528

LinkedIn Profile

7 Mabelle Ave, Toronto, ON M9A 0C9

Languages

EnglishFluent

ChineseNative

FrenchBeginner

Technical Skills

Languages & Tools

Python (8+ yrs)JavaSQLJavaScriptMATLABC

Frameworks

PyTorchTensorFlowHuggingFaceLangGraphLangChainLangFuse

ML Focus

LLMRAGReinforcement LearningModel QuantizationA/B Testing

Cloud & Infra

GCP (Vertex AI, BigQuery)AzureDockerOCPKafkaJenkinsCI/CD

Data Systems

ElasticSearchNeo4jMongoDBMySQL

Specialties

Multimodal AgentsConversational AIModel DeploymentPerformance Optimization

Career Journey

Experience & Education

💼Work & Education Experience

Senior Machine Learning Engineer

RBC•Intelligent Ops Team

2024/10 — Current

📍 Toronto, Canada

▹Won RBC employee performance award in 2025.
▹Lead a 5 members team developed and deployed the RAG based LLM chatbot in RBC by leveraging FastMCP, LangGraph, Elasticsearch (VectorDB) and self-hosted LangFuse (monitoring), which consumed by entire RBC change management team, projected 30% reduction in operational time by the end of 2026.
▹Deployed and Optimized 4 enterprise-level models. Designed Models' architecture to optimize their performance on OCP: Successfully reduced alert triage model's 80% memory consumption by refactoring script.
▹Delivered a real-time incident alerting enrichment system using Java Spring Boot, Python FastAPI and Kafka. Consumed by two business units in RBC to help them reduce incident processing time by 30%.

Machine Learning Research Assistant (Part-time)

McGill University

2022/12 — 2023/05

📍 Remote

▹Worked with Prof. Emine Sarigollu's team to explore customers' interaction for circular economy (CE) topic on Twitter.
▹Scraped 500k+ tweets and built NLP pipelines for emotion, sentiment and topic analytics using fine-tuned BERT-based classifiers using PyTorch with CUDA accelerators.

NLP & ML Engineering

CIBC•Data & Analytics Team

2022/01 — 2024/10

📍 Toronto, Canada

▹Developed entity linking API using scikit-learn and Spacy for daily contract analytics, which are used for matching entities in contract documents with CIBC internal suppliers' name.
▹Created a hierarchical structure for contracts' parent-child relationship, helping identify contracts' property.
▹Enabled large-scale contract documents search (>100k files) by building an automated document metadata extraction pipeline to clean, extract and update data from unstructured text files by using Python and SQLite.

ML Engineering Intern

Alibaba Group•Alibaba Brain Team

2021/10 — 2021/12

📍 Hangzhou, China

▹Worked in the Alibaba Brain team to analyze and develop a product for Objectives Key Results (OKR) project management tool.
▹Partnered with product managers and developers from other teams to Define the indicators measuring the synergy effect among different business units using the OKR data (10 million+ records) from the AlibabaCloud database.
▹Applied Neo4j to build a graph database for OKR data to visualize collaboration among departments.

Data Scientist Intern

CIBC•Data & Analytics Team

2021/05 — 2021/09

📍 Toronto, Canada

▹Developed an end-to-end generative system to automate supplier profile slides generation process by querying Microsoft Access databases with VBA.
▹Created a customized named entity recognition (NER) model to extract key information from contract documents.

Data Scientist Coop

Allianz SE Insurance•Capstone Project

2020/11 — 2021/05

📍 Montreal, Canada

▹Built a semi-supervised model to predict the intention of Canadian small business buying insurance products, and built an LSTM model to predict the Google Trend for insurance products to provide insights for their market team.
▹Deployed real-time ML application for social media analytics with Google Cloud Function and Google Data Studio.

🎓Education

Master of Management in Analytics

McGill University

2020/08 — 2021/12

GPA: 3.89/4.00

⭐ Entrance Scholarship

Relevant Courses:

Deep LearningDatabase Distrib. Syst.NLPLarge Language ModelA/B Testing

Bachelor of Electrical Engineering

McGill University

2016/09 — 2020/05

GPA: 3.35/4.00, Final Year: 3.93/4.00

Relevant Courses:

Algorithm DesignComputer VisionApplied Machine LearningNumerical Methods

Portfolio

Featured Projects

A collection of projects spanning ML research, open-source contributions, and production systems.

📄

2023/02 → 2023/09arXiv

MODELS Conference 2023

Published on MODELS conference: Prompting or Fine-tuning? A Comparative Study of Large Language Models for Taxonomy Construction as a co-first author.

Compared different LLMs' prediction performance with prompting and fine-tuning methods for various taxonomy datasets.

PythonPyTorchNLPFine-tuning

🤖

2024/05 → CurrentGitHub

LLM Framework – Sherpa

Joined as one of the contributors in Sherpa developer community.

Built search refinement and chain-of-action tools for this agentic LLM framework.

PythonLangChainLLMsAgents

📱

2025/10 → CurrentGitHub

Dream Journal App

Practicing my vibe coding skill by publishing a dream journal application to apple app store from scratch.

Vibe coding frontend and backend with Cursor IDE, set up database and edge functions in Supabase, Build and submit app with Expo Application Services.

React NativeExpoSupabaseTypeScript

📈

2023/01 → PresentDiscord

Stock Signal Bot

Created a taxonomy dataset specific to computing classification system domain.

Deployed real-time Discord bot by using Heroku for stock signal alerts using MACD/RSI analytics.

PythonDiscord.pyHerokuMACD/RSI

🏠

2021/02 → 2021/04GitHub

Advanced AI Analytics for Airbnb Hosts

Built an application powered by a polynomial regression model to help hosts to adjust their prices.

Applied AutoML with ML Flow on Databricks to choose the best performance ML model and hyperparameter tuning.

Applied Docker to containerize ML models and the application orchestrated with Kubernetes.

PythonDockerKubernetesMLflow

🔧

2019/09 → 2020/05GitHub

Continuous Testing And Validation of Jamscript

Worked in Prof. M.Maheswaran's lab to test and validate a programming language for Edge-Oriented mobile IoT.

Developed a continuous integration pipeline for an open-source programming language: Jamscript (a polyglot language that combines C and JavaScript) with Travis CI.

CJavaScriptTravis CIIoT

👤

2019/09 → 2019/12GitHub

Face Recognition and Tagging

Developed a face recognition system from scratch in a team of 5 using Python Sklearn.

Compared the face recognition performance on PCA and bag-of-words methods.

PythonSklearnPCAOpenCV

🚀

2018/10 → 2020/04McGill University

McGill Rocket Club

Aero-Structure and Propulsion Sub-Team Member.

Participated in designing different parts of the rocket model; Won Spaceport America Cup 2018 champion.

CADPropulsionAero-Structure

View All on GitHub

Beta Version

Talk to Digital Me

Ask questions about my skills, projects, and experience. ฅ(^•ﻌ•^ฅ)
Please Note: This is a simulated feature, not a real model. I made it for fun.

fandi-inference-engine v2.0

Model loaded

$Enter your query:

Try:

Let's Connect

Get In Touch

Have a project in mind or want to discuss opportunities? I'd love to hear from you.

Quick Contact

fandiyi2333@gmail.com

Connect with me

GitHub

View my code

📍

Location

7 Mabelle Ave, Toronto, ON M9A 0C9

Hi, I'm Fandi Yi (ʘ‿ʘ)╯

About Me

👋Hello!

Get in Touch

Languages

Technical Skills

Languages & Tools

Frameworks

ML Focus

Cloud & Infra

Data Systems

Specialties

Experience & Education

💼Work & Education Experience

Senior Machine Learning Engineer

Machine Learning Research Assistant (Part-time)

NLP & ML Engineering

ML Engineering Intern

Data Scientist Intern

Data Scientist Coop

🎓Education

Master of Management in Analytics

Bachelor of Electrical Engineering

Featured Projects

MODELS Conference 2023

LLM Framework – Sherpa

Dream Journal App

Stock Signal Bot

Advanced AI Analytics for Airbnb Hosts

Continuous Testing And Validation of Jamscript

Face Recognition and Tagging

McGill Rocket Club

Talk to Digital Me

Get In Touch

Quick Contact

Send a Message