The Multilingual Picture Dataset (Multipic)

The Multilingual Picture Dataset (Multipic)

This project aims to develop a picture database normed in several languages and dialects, including minority languages. Multipic has currently been normed in 33 languages or language varieties, with more on the way, as detailed in the table below:

Languages with available Multipic norms Forthcoming
Arabic (Lebanese) Finnish Portuguese Albanian
Basque French Russian Brazilian Portuguese
Catalan Metropolitan Serbian Galician
Chinese Québécois Slovak Japanese
Cantonese German Spanish Latvian
Mandarin Greek Peninsular Maltese
Czech Standard Rioplatense Maltese English
Dutch Cypriot Turkish Scottish Gaelic
Standard Hebrew Welsh Slovenian
Flemish Hungarian Vietnamese
English Italian
American Korean
Australian Malay
British Norwegian
Malay Polish

 

More details can be found in the related publication here:  https://www.nature.com/articles/s41597-022-01552-7 

If you are interested in norming Multipic in a new language, feel free to contact Christos Pliatsikas or Jon Andoni Dunabeitia