Readers/Structured Data

The Structured Content team, in the Wikimedia Product department at the Wikimedia Foundation, focuses on enabling growth and consumption of visual (and other non-textual) knowledge content on Wikipedia.

We started our current line of work in Fiscal Year 2023-2024, as part of last year’s Annual Plan, and we will continue to provide support for Fiscal Year 2024-2025 as part of objective and key result WE2.3 (“Guide contributors to add images and references that comply with project guidelines and increase trust in content, for example, by flagging potential issues during their upload/addition”).

The team previously focused on building features that use and allow creation of structured data associated with MediaWiki pages (see also Structured Data Across Wikimedia), as well as improving media quality on Wikimedia Commons, which includes improving the upload process (such as UploadWizard improvements), detecting potentially problematic uploads and improving media metadata.

Mission of the team

The mission of the team is to enable growth and consumption of visual (and other non-textual) knowledge content on Wikipedia. This means focusing on growth of articles with visual content and visual references, as well as on an increased interaction with articles with visual content.

Imagine a future where every article on Wikipedia, whenever appropriate, has high quality images that help drive consumption of knowledge. Moreover, finding images for your purpose, both on Wikimedia Commons and Wikipedia, is a seamless and easy activity. Visual content is available across all topics and languages, is of high quality, has all the necessary metadata, and is easily accessible to users for consumption, contribution, and moderation.

Two important enablers to grow visual content and visual knowledge on Wikipedia are:

  • Identifying visual knowledge gaps (e.g. articles without images/visual references, or missing images on Commons)
  • Creating the right tools to seamlessly discover relevant visual content to grow its coverage on Wikipedia

Team’s product portfolio

The Structured Content team's product portfolio encompasses various tools for structuring and discovery of visual content:

Structured Data Across Wikimedia

A project designed to help structure content on wikitext pages in a way that will be machine-recognizable and -relatable, to make reading, editing, and searching easier and more accessible across projects and on the Internet.

MediaSearch

A new search back- and front-end for finding files which leverages categories, structured data and wikitext from Wikimedia Commons and Wikidata to find its results.

Image Suggestions

Image Suggestion combines the results of the Image Suggestions algorithm and MediaSearch to provide suggestions for potential images matches to unillustrated articles.

Structured Data on Commons

Structured data on Commons is multilingual information about a media file that can be understood by humans, with enough consistency that it can also be uniformly processed by machines.

UploadWizard improvements

Improving the current user experience with UploadWizard to reduce the number of bad uploads, and in perspective, reduce the burden on moderators and improve the quality of content.

Commons uploads detection tools

We developed two tools to automatically detect content when uploaded on Commons, in order to facilitate their evaluation by the community:

How to contact us

If you have questions or clarifications to ask, please contact us on Wikimedia Commons.