CNN-Transformer on raw handwriting + Qwen3-ASR + fine-tuned UmBERTo on Italian speech. Late fusion over two binary diagnostic tasks (HC vs. Mild-AD and HC vs. MCI).
macro-F1 0.88 · sensitivity 0.91 (HC vs. Mild-AD)
PyTorchHuggingFaceQwen3-ASRUmBERTo
Re-implemented all 5 models from Pan et al. (INTERSPEECH 2021), replacing Kaldi ASR with Wav2Vec2 and WhisperX. KenLM-guided hypothesis sweeping reconstructs lattice uncertainty without raw lattice access.
85.92% accuracy · original paper 84.51%
Wav2Vec2WhisperXBERTADReSSo-2021
Systematic layer-level sweep across four SSL models (Wav2Vec2, WavLM, HuBERT, Whisper-small) to identify the most informative representations for depression detection on DAIC-WOZ and E-DAIC-WOZ.
PyTorchWav2Vec2WavLMHuBERTWhisper
Modular Python ASR library with a unified Transcriber API and pluggable driver registry. Word-level timestamps via forced alignment, multi-file batch transcription, pip-installable with optional per-backend dependencies.
PythonPyTorchQwen3-ASR
Fine-tuned BERTweet-base on TweetEval for irony detection and 4-class emotion classification. LLM-based data augmentation with DeepSeek-V3 for minority classes.
Irony F1 0.767 · Emotion F1 0.815
BERTweetHuggingFaceDeepSeek-V3TweetEval
DETR-based detection with Hungarian assignment. Cost matrix combines IoU and cosine similarity on RoI-aligned CNN features.
HOTA 25.8 · MOTA 24.4 on MOT17 test set
PyTorchDETROpenCVMOT17
Custom 8-bit SPI counter peripheral across three abstraction levels: RTL in SystemVerilog with testbench, FPGA synthesis (Yosys/nextpnr on Tang Nano 9K), and a bit-banging bare-metal C driver on Raspberry Pi 3B+. Extended with an ARM32 single-cycle processor simulation.
SystemVerilogCARM AssemblyTang Nano 9K
Interactive equirectangular 360° image and video navigator: rectilinear projection from spherical coordinates, keyboard-driven FOV/latitude/longitude control, zoom, and screenshot capture. Tkinter GUI for file selection and initial parameters.
PythonOpenCVTkinter
Pedestrian detection using HOG feature extraction and a trained SVM classifier on the WiderPerson dataset. Includes feature extraction pipeline, preprocessing comparison, and sliding-window inference.
PythonOpenCVscikit-learnHOGWiderPerson
Full-stack e-commerce app: Kotlin/Material Design 3 Android frontend (Retrofit2 + OkHttp) consuming a Django REST API backed by MySQL. Features product browsing, cart, wishlist, checkout with saved addresses and cards, order history, and reviews.
KotlinAndroidDjangoMySQLDocker