Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Future Blog Post

less than 1 minute read

Published:

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published:

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

publications

Visual and Textual Deep Feature Fusion for Document Image Classification

Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2020

Fusion of visual and textual deep features for document image classification.

Recommended citation: M. Farhan, N. I. Kajla, M. D. A. Awan, M. M. Luqman, M. Coustaty, S. Bakkali. "Visual and Textual Deep Feature Fusion for Document Image Classification." CVPRW 2020, pp. 562-563. https://openaccess.thecvf.com/content_CVPRW_2020/papers/w34/Bakkali_Visual_and_Textual_Deep_Feature_Fusion_for_Document_Image_Classification_CVPRW_2020_paper.pdf

LLMChain: Blockchain-based Reputation System for Sharing and Evaluating Large Language Models

Published in IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC), 2024

A blockchain-based reputation mechanism for sharing and evaluating large language models.

Recommended citation: M. A. Bouchiha, Q. Telnoff, S. Bakkali, R. Champagnat, M. Rabah, M. Coustaty, Y. Ghamri-Doudane. "LLMChain: Blockchain-based Reputation System for Sharing and Evaluating Large Language Models." COMPSAC 2024, pp. 439-448. https://arxiv.org/pdf/2404.13236

KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark

Published in Asian Conference on Computer Vision (ACCV), 2024

A benchmark for Khmer scene-text detection and recognition.

Recommended citation: V. Nom, S. Bakkali, M. M. Luqman, M. Coustaty, J-M. Ogier. "KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark." ACCV 2024, pp. 1777-1792. https://openaccess.thecvf.com/content/ACCV2024/papers/Nom_KhmerST_A_Low-Resource_Khmer_Scene_Text_Detection_and_Recognition_Benchmark_ACCV_2024_paper.pdf

GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification

Published in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

A cross-modal framework for robust document image retrieval in real-world settings.

Recommended citation: S. Bakkali, S. Biswas, Z. Ming, M. Coustaty, M. Rusinol, O. Ramos Terrades, J. Llados. "GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification." WACV 2025, pp. 1436-1446. https://openaccess.thecvf.com/content/WACV2025/papers/Bakkali_GlobalDoc_A_Cross-Modal_Vision-Language_Framework_for_Real-World_Document_Image_Retrieval_WACV_2025_paper.pdf

DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization

Published in Winter Conference on Applications of Computer Vision (WACV), 2025

Domain-adaptive pre-training for document-level abstractive summarization.

Recommended citation: P. P. M. Chau, S. Bakkali, A. Doucet. "DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization." WACV 2025, pp. 1303-1312. https://openaccess.thecvf.com/content/WACV2025W/VISIONDOCS/papers/Chau_DocSum_Domain-Adaptive_Pre-training_for_Document_Abstractive_Summarization_WACVW_2025_paper.pdf

IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering

Published in Winter Conference on Applications of Computer Vision (WACV) Workshops, 2025

Quality assessment for identity documents using bandpass filtering.

Recommended citation: M. Al-Ghadi, J. Voerman, S. Bakkali, M. Coustaty, O. Lessard, N. Sidere. "IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering." WACV Workshops 2025, pp. 716-723. https://openaccess.thecvf.com/content/WACV2025W/AI4MFDD/papers/Al-Ghadi_IDTrust_Deep_Identity_Document_Quality_Detection_with_Bandpass_Filtering_WACVW_2025_paper.pdf

Fusion of GNN and GBDT Models for Graph and Node Classification

Published in International Workshop on Graph-Based Representations in Pattern Recognition (GbRPR), 2025

Hybrid graph learning by combining GNNs with gradient-boosted decision trees.

Recommended citation: M. Farhan, N. I. Kajla, M. D. A. Awan, M. M. Luqman, M. Coustaty, S. Bakkali. "Fusion of GNN and GBDT Models for Graph and Node Classification." GbRPR 2025, pp. 167-178. https://hal.science/hal-05111226v1/file/Fusion%20of%20GNN%20and%20GBDT%20Models%20%28Published%20Work%29.pdf

Confidence-based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation

Published in Applied Sciences, 2025

This paper proposes confidence-based distillation strategies for low-resource neural machine translation.

Recommended citation: M. Zafar, P. J. Wall, S. Bakkali, et al. "Confidence-based Knowledge Distillation to Reduce Training Costs and Carbon Footprint for Low-Resource Neural Machine Translation." Applied Sciences 15(14):8091, 2025. https://www.mdpi.com/2076-3417/15/14/8091

WildKhmerST: A Comprehensive Benchmark Dataset for Khmer Scene Text Detection and Recognition

Published in International Conference on Document Analysis and Recognition (ICDAR), 2025

A large-scale benchmark for Khmer scene-text detection and recognition.

Recommended citation: S. Keo, V. Nom, S. Bakkali, M. M. Luqman, M. Rusinol, M. Coustaty, J-M. Ogier. "WildKhmerST: A Comprehensive Benchmark Dataset for Khmer Scene Text Detection and Recognition." ICDAR 2025, pp. 351-368. https://hal.science/hal-05120511/document

talks

teaching