Visual Genome

Visual Genome is a dataset, a knowledge base, an ongoing effort to connect structured image concepts to language.

  • 108,077 Images
  • 5.4 Million Region Descriptions
  • 1.7 Million Visual Question Answers
  • 3.8 Million Object Instances
  • 2.8 Million Attributes
  • 2.3 Million Relationships
  • Everything Mapped to Wordnet Synsets

Company Type: Nonprofit

Region: US & Canada

Industry Category: Visual Imaging

Research: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Toolkit: Visual Genome API

Video