👋 I am an M.S. candidate in Artificial Intelligence at Xidian University, Xi'an, China (graduating July 2026). My research focuses on open-vocabulary object detection and multi-object tracking robustness under semantic distribution shift — the gap between what vision-language models are trained to recognize and what they encounter at deployment.

I am the first author of EnYOLO-World, which improves YOLO-World with PGI and FiLM-PAN (+3.1 AP, 52.2 AP on COCO, 78 FPS, under review at ELCVIA), and lead researcher on VocabDriftMOT, the first benchmark for vocabulary-drift failure modes in open-vocabulary multi-object tracking. Beyond research, I have 4 years of production software engineering experience and have shipped systems using PyTorch, Docker, GCP Cloud Run, FastAPI, and ChromaDB.

Experience

M.S. Candidate
Xidian University
2023 – 2026
Software Engineer
KOSIGN Korea
2020 – 2023
Software Engineer
Web Essentials
2019 – 2020
B.A. CS&E
RUPP Cambodia
2015 – 2019

Featured Projects

EnYOLO-World First Author · ELCVIA
2025
Enhanced open-vocabulary object detection combining YOLO-World with PGI (Programmable Gradient Information) backbone and FiLM-Augmented PAN. Achieves +3.1 AP over YOLO-World baseline, 52.2 AP on COCO, 78 FPS. Under review at ELCVIA (Scopus).
Object Detection YOLO-World PyTorch COCO LVIS
VocabDriftMOT GitHub Live
2026
First benchmark for mid-stream vocabulary switching in open-vocabulary multi-object tracking. 510 annotated transition events across 21 MOT17 sequences and 3 detector variants. Introduces IDF1-AT and SJS metrics. Identifies 40× performance gap between synonym and hypernym transitions.
OV-MOT Benchmark PyTorch ByteTrack CLIP
Multimodal Visual RAG System Live
2026
Production image search system using natural language queries. CLIP embeddings + ChromaDB vector store + cross-encoder reranking + LLaMA 3 answer generation. Qwen2.5-VL-72B auto-describes uploaded images. Deployed on GCP Cloud Run.
RAG CLIP ChromaDB FastAPI Docker GCP
OpenDet-Bench Live
2026
Production MLOps platform for open-vocabulary detection and tracking. YOLOv8 + ByteTrack + ONNX (1.54× CPU speedup) + MLflow + Docker + GCP Cloud Run. Real-time video analytics with FastAPI, Redis, WebSocket, Prometheus/Grafana monitoring.
MLOps YOLOv8 ONNX MLflow GCP Grafana

Publications

ELCVIA
Under
Review
EnYOLO-World: Enhanced Open-Vocabulary Detection with PGI and FiLM-PAN
Electronic Letters on Computer Vision and Image Analysis (ELCVIA) · Under Review · Scopus Indexed
Ly Vireak Dara et al.
First Author
IJCNN
2025
Fine-Grained Meta-Learning with Semantic Augmentation for SAR Change Detection
IJCNN 2025 — 2025 International Joint Conference on Neural Networks · IEEE · pp. 1–8
R Wang, C Liu, L Sun, L Wang, VD Ly, C Jiao
Co-Author
JARS
2025
Fine-grained Meta-learning with Semantic Augmentation for Synthetic Aperture Radar Change Detection
Journal of Applied Remote Sensing · Vol. 19(4), 046502 · Oct 2025 · SPIE
R Wang, C Liu, L Wang, L Sun, V Dara Ly, C Jiao
Co-Author
IGARSS
2025
Non-Rigid Registration Based on Rotation-Invariant Feature Flow
IEEE IGARSS 2025 — IEEE International Geoscience and Remote Sensing Symposium · 2025
X Huang, R Wang, L Sun, I Islam, LV Dara, C Jiao
Co-Author
IGARSS
2025
An Edge-Enhanced Siamese Network for 3D Change Detection
IEEE IGARSS 2025 — IEEE International Geoscience and Remote Sensing Symposium · 2025
J Chen, R Wang, FU Maheri, LV Dara, C Jiao, W Li
Co-Author

Research Interests

Open-Vocabulary Object Detection Multi-Object Tracking Vision-Language Models Semantic Distribution Shift YOLO Architectures Scene Understanding

Academic Services

Reviewer

  • IEEE IGARSS 2025
  • ELCVIA Journal

Skills & Tools

  • Python, JavaScript, Java, TypeScript, SQL
  • PyTorch, TensorFlow, Scikit-learn, HuggingFace
  • YOLO (v5–v11), CLIP, ByteTrack, ONNX
  • OpenCV, TorchVision, Object Detection
  • Docker, GCP Cloud Run, MLflow, FastAPI
  • Redis, Prometheus, Grafana, Git
  • ChromaDB, FAISS, LangChain, Cross-Encoder
  • Pandas, NumPy, PostgreSQL, MongoDB

Currently Seeking

  • PhD in Computer Vision / VLMs
  • ML Engineer (Remote / EU / US)
  • Available from July 2026

Contact

  • lyvireakdara@gmail.com
  • github.com/Vireakdara
  • Phnom Penh, Cambodia