The Multilingual Picture Dataset (Multipic)

The Multilingual Picture Dataset (Multipic)

This project aims to develop a picture database normed in several languages and dialects, including minority languages. Multipic has currently been normed in 34 languages or language varieties, with more on the way, as detailed in the table below:

Multipic languages and language varieties (forthcoming norms in italics)
Albanian Finnish Portuguese
Alemannic French European
Arabic (Lebanese) Metropolitan Brazilian
Armenian Québecois Romanian
Basque Galician Russian
Catalan German Scots
Chinese Greek Scottish Gaelic
Cantonese Standard Serbian
Mandarin Cypriot Silesian
Cornish Hawaiian Slovak
Croatian Hebrew Spanish
Czech Hungarian Mexican
Dutch Indonesian Peninsular
Standard Italian Rioplatense
Flemish Japanese Thai
Kashubian Turkish
English Korean Standard
American Lombard Cypriot
Australian Malay Ukrainian
British Norwegian Vietnamese
Malay Polish Welsh
Farsi

More details can be found in the related publications:

First set of norms: https://www.nature.com/articles/s41597-022-01552-7 

Cantonese: https://link.springer.com/article/10.3758/s13428-024-02362-y

Galician: https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2025.1551000/full

The norms can be accessed here: https://figshare.com/articles/dataset/Untitled_Item/19328939

If you are interested in norming Multipic in a new language, feel free to contact Christos Pliatsikas or Jon Andoni Dunabeitia