Publications
Publication Metrics: 20 peer-reviewed publications; 223 citations; H-index 6.
Hybrid Retrieval-Augmented Generation for Robust Multilingual Document Question Answering
arXiv preprint · 2025
Cross-Lingual Learning for Low-Resource Khmer Scene Text Detection and Recognition
International Conference on Document Analysis and Recognition (ICDAR) · 2025
Visual Text Generation in Khmer Language: Challenges and Trends with Diffusion Models
International Conference on Document Analysis and Recognition (ICDAR) · 2025
WildKhmerST: A Comprehensive Benchmark Dataset for Khmer Scene Text Detection and Recognition
International Conference on Document Analysis and Recognition (ICDAR) · 2025
Confidence-based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation
Applied Sciences · 2025
Fusion of GNN and GBDT Models for Graph and Node Classification
International Workshop on Graph-Based Representations in Pattern Recognition (GbRPR) · 2025
IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering
Winter Conference on Applications of Computer Vision (WACV) Workshops · 2025
DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization
Winter Conference on Applications of Computer Vision (WACV) · 2025
GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) · 2025
KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark
Asian Conference on Computer Vision (ACCV) · 2024
Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting
International Conference on Document Analysis and Recognition (ICDAR) · 2024
LLMChain: Blockchain-based Reputation System for Sharing and Evaluating Large Language Models
IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC) · 2024
State-of-the-Art Khmer Text Recognition Using Deep Learning Models
ASEAN Conference on Emerging Technologies · 2024
VLCDoC: Vision-Language Contrastive Pre-training Model for Cross-Modal Document Classification
Pattern Recognition · 2023
Multimodal Document Understanding with Unified Vision and Language Cross-Modal Learning
PhD Thesis, Universite de La Rochelle · 2022
EAML: Ensemble Self-Attention-Based Mutual Learning Network for Document Image Classification
International Journal on Document Analysis and Recognition (IJDAR) · 2021
Cross-modal Deep Networks for Document Image Classification
IEEE International Conference on Image Processing (ICIP) · 2020
Visual and Textual Deep Feature Fusion for Document Image Classification
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) · 2020
Face Detection in Camera Captured Images of Identity Documents Under Challenging Conditions
International Conference on Document Analysis and Recognition Workshops (ICDARW) · 2019
