Skip to main content

Cross-Modal Contrastive Learning for Text-to-Image Generation

Understanding Contextual Facial Expressions Across the Globe

KELM: Integrating Knowledge Graphs with Language Model Pre-training Corpora

Project Guideline: Enabling Those with Low Vision to Run Independently

Learning to Manipulate Deformable Objects

ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

Accelerating Eye Movement Research for Wellness and Accessibility

Crisscrossed Captions: Semantic Similarity for Images and Text

Introducing FELIX: Flexible Text Editing Through Tagging and Insertion

Do Wide and Deep Networks Learn the Same Things?

Google at ICLR 2021