Portfolio

Compute-Skipping Policies for Diffusion LLMs (dLLM-v2) (2026)

Two compute-skipping policies for diffusion LLMs (Fast-dLLM v2) that reuse stable hidden states across denoising steps to cut FLOPs: a layer-level cache-reuse policy and a stability-aware token-level policy that recomputes only the least-similar tokens.
Report (PDF)
Compute-skipping reuse across transformer blocks in dLLM-v2

WireTap: Breaking Server SGX via DRAM Bus Interposition (2025)

ACM CCS ‘25 (AR: 13.9%) — Distinguished Paper Award (1.39%)
wiretap.fail
WireTap logo

Efficient Attention Calculation (2024)

This project explores efficient transformer attention. I implemented a PQ‑tree‑based token subsampler to reduce complexity from O(L2) to O(L log L), and reproduced some of previously proposed attention approximation/subsampling methods.
Efficient attention (PQ-tree guided) diagram

Artwork Capability Assessment of NLP Models (2024)

We produce the first English Louvre Dataset and investigate how effectively NLP models understand art-related text by curating and annotating a cultural-heritage corpus (Van Dyke’s painting textbook, translated Louvre records, etc.), defining a BRAT schema (entities, relations, events), and fine‑tuning DistilBERT for NER. Results show strong performance and qualitative reasoning on museum-style queries.
Paper (PDF)
Cultural-heritage NLP project

Airborne Insights: Privacy Risks of Leaked Sensor Data in Smart Buildings (2023)

From time-series air measurements in classroom environments, this project extracts features and predicts occupancy and related information using machine learning techniques.Airborne Insights diagram

SGX-Step Contributor (2023)

SGX-Step is a practical attack framework for precise enclave execution control — I was fortunate enough to maintain the codebase in 2023 for newer Linux kernel patches under the supervision of Prof. Jo Van Bulck.
SGX-Step project


Some projects I worked on include: