The Multilingual Picture Dataset (Multipic)

The Multilingual Picture Dataset (Multipic)

This project aims to develop a picture database normed in several languages and dialects, including minority languages. Multipic has currently been normed in 33 languages or language varieties, with more on the way, as detailed in the table below:

Languages with available Multipic norms Forthcoming
Arabic (Lebanese) Finnish Portuguese Albanian
Basque French Russian Brazilian Portuguese
Catalan Metropolitan Serbian Farsi
Chinese Québécois Slovak Galician
Cantonese German Spanish Japanese
Mandarin Greek Peninsular Latvian
Czech Standard Rioplatense Maltese
Dutch Cypriot Turkish Maltese English
Standard Hebrew Welsh Scottish Gaelic
Flemish Hungarian Slovenian
English Italian Ukrainian
American Korean Vietnamese
Australian Malay
British Norwegian
Malay Polish

 

More details can be found in the related publication here:  https://www.nature.com/articles/s41597-022-01552-7 

If you are interested in norming Multipic in a new language, feel free to contact Christos Pliatsikas or Jon Andoni Dunabeitia