Juan A. Rodriguez

Artificial Intelligence Researcher

prof_pic.jpg

ServiceNow Research

Mila, Quebec AI Institute

École de Technologie Superieure, University of Quebec

Montreal, Quebec, Canada

Hi, I am Juan Rodriguez, you can also call me Joan (in Catalan).

I am a Researcher at ServiceNow Research and a PhD student at Mila and École de Technologie Superieure, University of Quebec. I am based in Montreal, Canada, but I am from Barcelona, Spain. I am advised by Prof. Marco Pedersoli, Prof. Chris Pal, and Dr. David Vazquez

My research interests are in the intersection of Computer Vision and Natural Language Processing, with a focus on multimodal generative models. I am interested in learning how to leverage information from different modalities to generate more accurate and controlable outputs in all modalities. Recently I have been working with large language and vision models using transformers and diffusion. I have also been exploring the paradigm of generating code as an alternative for generating images (e.g. scalable vector graphics).

I obtained a M.Sc. in Computer Vision from Universitat Autònoma de Barcelona (UAB) with honors for my master thesis Text to Scientific Figure Generation performed at ServiceNow Research and advised by Dr. David Vazquez and Dr. Pau Rodríguez. Previously, I obtained a B.Sc. in Telecommunication Networks Engineering from Universitat Pompeu Fabra (UPF) and carried out my bachelor thesis on Handwritten Text Recognition advised by Prof. Xavier Binefa. I did research internships at UPF advised by Prof. Xavier Binefa and Prof. Miquel Oliver, and at CVC-UAB advised by Prof. Joost van de Weijer.

Publications

  1. ocr-vqgan.png
    OCR-VQGAN: Taming text-within-image generation
    Juan A. Rodriguez, David Vazquez, Issam Laradji, Marco Pedersoli, and Pau Rodriguez
    In WACV 2023 (Oral), 2023
  2. figgen.png
    FigGen: Text to Scientific Figure Generation
    Juan A. Rodriguez, David Vazquez, Issam Laradji, Marco Pedersoli, and Pau Rodriguez
    In ICLR 2023 (Tiny paper track), 2023
  3. affective.png
    Affective State-Based Framework for e-Learning Systems
    Juan A. Rodriguez, Joaquim Comas, and Xavier Binefa
    In CCIA 2021 (Oral), 2021
  4. starvector.png
    StarVector: Generating Scalable Vector Graphics Code from Images
    Juan A. Rodriguez, Shubham Agarwal, Issam H. Laradji, Pau Rodriguez, David Vazquez, Christopher Pal, and Marco Pedersoli
    In preprint, 2023