Xavier Juanola Molet

I’m Xavier, and my academic journey began with a Bachelor’s degree in Physics from the Universitat de Barcelona (UB) in 2018, followed by a Master’s in Artificial Intelligence from Universitat Pompeu Fabra (UPF) in 2019. I further expanded my expertise by pursuing advanced studies with a Master’s in High Energy Physics, Astrophysics, and Cosmology at Universitat Autonoma de Barcelona.

For the past three and a half years, I’ve worked as a Data Scientist in industry, applying my skills in real-world scenarios. Currently, I’m in the third year of my PhD at UPF, focusing on Audio-Visual Sound Source Localization within the Intelligent Multimodal Vision Analysis (IMVA) group under Professor Gloria Haro’s supervision. My research integrates multimodal deep learning techniques—leveraging both video and audio—to push the boundaries of computer vision. An enriching stint at New York University also broadened my perspective and deepened my research experience.

My work is driven by a passion for deep learning and computer vision, and I’m excited to continue contributing to the advancement of artificial intelligence through innovative research and practical applications.

Publications