Multimodal Document Understanding with Unified Vision and Language Cross-Modal Learning

Published in PhD Thesis, Universite de La Rochelle, 2022

Recommended citation: S. Bakkali. "Multimodal Document Understanding with Unified Vision and Language Cross-Modal Learning." PhD Thesis, Universite de La Rochelle, 2022. [Paper]

PhD thesis on unified vision-language learning for multimodal document understanding.