Ly Vireak Dara — Computer Vision Researcher

👋 I am an M.S. candidate in Artificial Intelligence at Xidian University, Xi'an, China (graduating July 2026). My research focuses on open-vocabulary object detection and multi-object tracking robustness under semantic distribution shift — the gap between what vision-language models are trained to recognize and what they encounter at deployment.

I am the first author of EnYOLO-World, which improves YOLO-World with PGI and FiLM-PAN (+3.1 AP, 52.2 AP on COCO, 78 FPS, under review at ELCVIA), and lead researcher on VocabDriftMOT, the first benchmark for vocabulary-drift failure modes in open-vocabulary multi-object tracking. Beyond research, I have 4 years of production software engineering experience and have shipped systems using PyTorch, Docker, GCP Cloud Run, FastAPI, and ChromaDB.

Experience

🎓

M.S. Candidate

Xidian University

2023 – 2026

💻

Software Engineer

KOSIGN Korea

2020 – 2023

🌐

Software Engineer

Web Essentials

2019 – 2020

🏛️

B.A. CS&E

RUPP Cambodia

2015 – 2019

Featured Projects

EnYOLO-World First Author · ELCVIA

2025

Enhanced open-vocabulary object detection combining YOLO-World with PGI (Programmable Gradient Information) backbone and FiLM-Augmented PAN. Achieves +3.1 AP over YOLO-World baseline, 52.2 AP on COCO, 78 FPS. Under review at ELCVIA (Scopus).

Object Detection YOLO-World PyTorch COCO LVIS

📄 Paper (Under Review)

VocabDriftMOT GitHub Live

2026

First benchmark for mid-stream vocabulary switching in open-vocabulary multi-object tracking. 510 annotated transition events across 21 MOT17 sequences and 3 detector variants. Introduces IDF1-AT and SJS metrics. Identifies 40× performance gap between synonym and hypernym transitions.

OV-MOT Benchmark PyTorch ByteTrack CLIP

📦 GitHub

Multimodal Visual RAG System Live

2026

Production image search system using natural language queries. CLIP embeddings + ChromaDB vector store + cross-encoder reranking + LLaMA 3 answer generation. Qwen2.5-VL-72B auto-describes uploaded images. Deployed on GCP Cloud Run.

RAG CLIP ChromaDB FastAPI Docker GCP

📦 GitHub 🌐 Demo

OpenDet-Bench Live

2026

Production MLOps platform for open-vocabulary detection and tracking. YOLOv8 + ByteTrack + ONNX (1.54× CPU speedup) + MLflow + Docker + GCP Cloud Run. Real-time video analytics with FastAPI, Redis, WebSocket, Prometheus/Grafana monitoring.

MLOps YOLOv8 ONNX MLflow GCP Grafana

📦 GitHub 🌐 Demo

Publications

ELCVIA
Under
Review

EnYOLO-World: Enhanced Open-Vocabulary Detection with PGI and FiLM-PAN

Electronic Letters on Computer Vision and Image Analysis (ELCVIA) · Under Review · Scopus Indexed

Ly Vireak Dara et al.

First Author

IJCNN
2025

Fine-Grained Meta-Learning with Semantic Augmentation for SAR Change Detection

IJCNN 2025 — 2025 International Joint Conference on Neural Networks · IEEE · pp. 1–8

R Wang, C Liu, L Sun, L Wang, VD Ly, C Jiao

Co-Author

JARS
2025

Fine-grained Meta-learning with Semantic Augmentation for Synthetic Aperture Radar Change Detection

Journal of Applied Remote Sensing · Vol. 19(4), 046502 · Oct 2025 · SPIE

R Wang, C Liu, L Wang, L Sun, V Dara Ly, C Jiao

Co-Author

IGARSS
2025

Non-Rigid Registration Based on Rotation-Invariant Feature Flow

IEEE IGARSS 2025 — IEEE International Geoscience and Remote Sensing Symposium · 2025

X Huang, R Wang, L Sun, I Islam, LV Dara, C Jiao

Co-Author

IGARSS
2025

An Edge-Enhanced Siamese Network for 3D Change Detection

IEEE IGARSS 2025 — IEEE International Geoscience and Remote Sensing Symposium · 2025

J Chen, R Wang, FU Maheri, LV Dara, C Jiao, W Li

Co-Author

Research Interests

Open-Vocabulary Object Detection Multi-Object Tracking Vision-Language Models Semantic Distribution Shift YOLO Architectures Scene Understanding

Academic Services

Reviewer

IEEE IGARSS 2025
ELCVIA Journal

Skills & Tools

Python, JavaScript, Java, TypeScript, SQL
PyTorch, TensorFlow, Scikit-learn, HuggingFace
YOLO (v5–v11), CLIP, ByteTrack, ONNX
OpenCV, TorchVision, Object Detection
Docker, GCP Cloud Run, MLflow, FastAPI
Redis, Prometheus, Grafana, Git
ChromaDB, FAISS, LangChain, Cross-Encoder
Pandas, NumPy, PostgreSQL, MongoDB

Currently Seeking

PhD in Computer Vision / VLMs
ML Engineer (Remote / EU / US)
Available from July 2026

Contact

lyvireakdara@gmail.com
github.com/Vireakdara
Phnom Penh, Cambodia