My Projects
Here's a collection of projects I've worked on:
-
Agentic Context Engineering vs. Parameter finetuning | Dec 2025 | Report — proposed a retrieval-driven “learning-at-inference” framework that embeds the GSM8K training set into a semantic graph and, on failures, retrieves the nearest solved example to generate and accumulate reusable heuristics in a persistent playbook. Implemented the agent loop on Qwen2.5-1.5B-Instruct and benchmarked against 0-/3-shot prompting and parameter-efficient training baselines (QLoRA + GRPO).
-
Decision Transformer Modification | Dec 2025 | Report — implemented and extended the Decision Transformer for offline RL by swapping the CNN image encoder for a Vision Transformer to test generalization in image-based environments. Replaced the MLP state encoder with a GNN to evaluate relational state representations and compared both variants against the baseline.
-
Soundscape Cartography | Dec 2024 — an interactive dashboard that applies clustering analysis to Spotify audio features, offering an alternative to traditional genre classifications by enabling users to compare songs based on characteristics such as danceability and speechiness.
-
LLM Cantonese Response Comparison (with Winnie Yeung) | Nov 2024 — a streamlit application that explores the gap between large language models and specialized finetuned models in processing Cantonese content. Presented in PyCon Hong Kong 2024.