Navigation Aids

 
 
 
 
 
Click here to IM, text, or chat
Languages in the Penn Libraries Collections
FindIt:

Sidebar

Main Content

Languages in the Penn Library Collections

What are the languages Penn Library books and other materials use?

Franklin, the Penn Library's online catalog, employs language codes compliant with the ISO 639-2 and ANSI Z39.53 standards managed by the Library Congress. Although Franklin users may limit search results to specific languages, it is not possible for public users to search directly by language code.

This table, compiled on 19 February 2004, counts active titles -- individual bibliographic records -- in Franklin. The counts have been cleaned: for instance, titles identified as "Miscellaneous languages" have been examined and placed under recognizable (and in some cases, non-standard) language names.

Pct of TotalLanguage NameTitles
62.1%English1,820,606
10.1%German296,258
6.1%French177,507
3.1%Spanish90,957
2.5%Italian73,934
2.1%Arabic60,944
1.5%Chinese44,320
1.4%Russian39,588
1.3%Hebrew37,583
1.3%Latin36,650
1.1%Japanese32,826
0.8%Hindi23,715
0.5%Bengali15,981
0.5%Urdu15,600
0.5%Dutch13,782
0.4%Tamil11,039
0.3%Persian9,479
0.3%Portuguese8,834
0.3%Lithuanian8,805
0.3%Swedish8,340
0.3%Turkish8,130
0.2%Polish6,845
0.2%Sanskrit6,608
0.2%Marathi6,434
0.2%Gujarati5,756
0.2%Telugu5,177
0.2%Yiddish4,437
0.1%Danish4,355
0.1%Malayalam4,235
0.1%Tibetan3,824
0.1%Greek, Modern (1453- )3,267
0.1%Nepali2,771
0.1%Sinhalese2,636
0.1%Ukrainian2,414
0.1%Greek, Ancient (to 1453)2,389
0.1%Korean2,351
0.1%Czech2,326
0.1%Panjabi1,637
0.1%Armenian1,507
0.0%Catalan1,419
0.0%Norwegian1,360
0.0%Frisian1,066
0.0%Hungarian1,038
0.0%Serbian941
0.0%Finnish887
0.0%Romanian887
0.0%Rajasthani880
0.0%Turkish, Ottoman874
0.0%Croatian873
0.0%Romance (Other)724
0.0%Latvian638
0.0%Maithili632
0.0%Mongolian616
0.0%Bulgarian595
0.0%Newari584
0.0%Sindhi517
0.0%Welsh469
0.0%Pushto434
0.0%Yoruba423
0.0%French, Middle (ca. 1400-1600)421
0.0%Irish393
0.0%Icelandic357
0.0%English, Middle (1100-1500)323
0.0%French, Old (ca. 842-1400)321
0.0%Indic (Other)311
0.0%Pali307
0.0%Slovak303
0.0%Prakrit languages295
0.0%Braj284
0.0%Konkani279
0.0%Bhojpuri261
0.0%Kannada250
0.0%Syriac233
0.0%Slovenian215
0.0%German, Middle High (ca. 1050-1500)189
0.0%Raeto-Romance169
0.0%Swahili164
0.0%Kazakh161
0.0%Estonian148
0.0%Khasi148
0.0%Afrikaans143
0.0%Belarusian142
0.0%Baluchi139
0.0%Church Slavic139
0.0%Galician139
0.0%Kashmiri120
0.0%Lahnda119
0.0%Mayan languages119
0.0%Assamese118
0.0%Macedonian110
0.0%Azerbaijani106
0.0%English, Old (ca. 450-1100)106
0.0%Provençal (to 1500)104
0.0%Occitan (post-1500)98
0.0%Sino-Tibetan (Other)97
0.0%Tagalog96
0.0%Amharic94
0.0%Aramaic92
0.0%Akkadian87
0.0%Somali86
0.0%Marwari84
0.0%Ladino81
0.0%Indonesian76
0.0%Scots75
0.0%Coptic74
0.0%Shona74
0.0%Niger-Kordofanian (Other)73
0.0%Georgian72
0.0%Awadhi70
0.0%Thai65
0.0%Pahlavi64
0.0%Malagasy63
0.0%Central American Indian (Other)62
0.0%Dogri60
0.0%Tigrinya56
0.0%Basque55
0.0%Romani54
0.0%Oriya53
0.0%Magahi52
0.0%Sorbian languages51
0.0%Scottish Gaelic47
0.0%Austronesian (Other)46
0.0%Dravidian (Other)46
0.0%Nahuatl45
0.0%Uzbek45
0.0%Lushai44
0.0%Egyptian43
0.0%Kurdish39
0.0%Manipuri39
0.0%Slavic (Other)39
0.0%Esperanto38
0.0%Germanic (Other)38
0.0%Creoles and Pidgins, French-based (Other)37
0.0%Cree36
0.0%Algonquian (Other)35
0.0%Dakota35
0.0%Judeo-Arabic34
0.0%North American Indian (Other)34
0.0%Samaritan Aramaic34
0.0%Albanian33
0.0%Sotho33
0.0%Ethiopic32
0.0%Himachali32
0.0%Quechua32
0.0%Malay31
0.0%Hausa30
0.0%Avestan28
0.0%Eskimo languages27
0.0%Dutch, Middle (ca. 1050-1350)26
0.0%Vietnamese25
0.0%Breton24
0.0%Ojibwa24
0.0%Athapascan (Other)23
0.0%Gothic23
0.0%Zulu23
0.0%Berber (Other)22
0.0%Sumerian22
0.0%Altaic (Other)20
0.0%Ganda20
0.0%Javanese20
0.0%Munda (Other)20
0.0%Mandingo19
0.0%South American Indian (Other)19
0.0%Finno-Ugrian (Other)18
0.0%Ndebele (Zimbabwe)17
0.0%Papuan (Other)17
0.0%Tajik17
0.0%Burmese16
0.0%Kinyarwanda16
0.0%Hawaiian15
0.0%Oromo15
0.0%Sami15
0.0%German, Old High (ca. 750-1050)14
0.0%Samoan14
0.0%Turkmen14
0.0%Wolof14
0.0%Hiligaynon13
0.0%Ndonga13
0.0%Nilo-Saharan (Other)13
0.0%Tatar13
0.0%Zapotec13
0.0%Ainu12
0.0%Lao12
0.0%Mohawk12
0.0%Navajo12
0.0%Serbo-Croatian [script not known]12
0.0%Kurukh11
0.0%Twi11
0.0%Uighur11
0.0%Afroasiatic (Other)10
0.0%Bambara10
0.0%Bihari10
0.0%Indo-European (Other)10
0.0%Iranian (Other)10
0.0%Micmac10
0.0%Bantu (Other)9
0.0%Chechen9
0.0%Creoles and Pidgins, Portuguese-based (Other)9
0.0%Iroquoian (Other)9
0.0%Judeo-Persian9
0.0%Khmer9
0.0%Sundanese9
0.0%Burushaski8
0.0%Cherokee8
0.0%Creek8
0.0%Creoles and Pidgins (Other)8
0.0%Delaware8
0.0%Fula8
0.0%Moldavian8
0.0%Mooré8
0.0%Shan8
0.0%Ugaritic8
0.0%Balinese7
0.0%Bemba7
0.0%Chagatai7
0.0%Dyula7
0.0%7
0.0%Apache languages6
0.0%Aymara6
0.0%Caucasian (Other)6
0.0%Guarani6
0.0%Mapuche6
0.0%Palauan6
0.0%Tigré6
0.0%Celtic (Other)5
0.0%Choctaw5
0.0%Igbo5
0.0%Khoisan (Other)5
0.0%Luo (Kenya and Tanzania)5
0.0%Papiamento5
0.0%Santali5
0.0%Sogdian5
0.0%Tswana5
0.0%Xhosa5
0.0%Bashkir4
0.0%Bikol4
0.0%Chuvash4
0.0%Faroese4
0.0%Gilbertese4
0.0%Iloko4
0.0%Kongo4
0.0%Kuanyama4
0.0%Kyrgyz4
0.0%Maltese4
0.0%Niuean4
0.0%Nubian languages4
0.0%Nyanja4
0.0%Old Persian (ca. 600-400 B.C.)4
0.0%Otomian languages4
0.0%Rarotongan4
0.0%Rundi4
0.0%Semitic (Other)4
0.0%Arawak3
0.0%Australian languages3
0.0%Avaric3
0.0%Carib3
0.0%Chinook jargon3
0.0%Creoles and Pidgins, English-based (Other)3
0.0%Cushitic (Other)3
0.0%Duala3
0.0%Ewe3
0.0%Grebo3
0.0%Kara-Kalpak3
0.0%Kawi3
0.0%Maori3
0.0%Mongo-Nkundu3
0.0%Ponape3
0.0%Siksika3
0.0%Songhai3
0.0%Tuvinian3
0.0%Banda2
0.0%Bosnian2
0.0%Elamite2
0.0%Fang2
0.0%Fijian2
0.0%Garhwali2
0.0%Herero2
0.0%Kabyle2
0.0%Kusaie2
0.0%Luba-Katanga2
0.0%Mon-Khmer (Other)2
0.0%Philippine (Other)2
0.0%Salishan languages2
0.0%Siouan (Other)2
0.0%Swazi2
0.0%Tahitian2
0.0%Zuni2
0.0%Achinese1
0.0%Afar1
0.0%Akan1
0.0%Aljamía1
0.0%Arapaho1
0.0%Artificial (Other)1
0.0%Basa1
0.0%Batak1
0.0%Bislama1
0.0%Bugis1
0.0%Caddo1
0.0%Cornish1
0.0%Dayak1
0.0%Dzongkha1
0.0%Gayo1
0.0%Gondi1
0.0%Hmong1
0.0%Hupa1
0.0%Iberian1
0.0%Inuktitut1
0.0%Kalâtdlisut1
0.0%Karen1
0.0%Kikuyu1
0.0%Kru1
0.0%Lozi1
0.0%Madurese1
0.0%Manchu1
0.0%Manx1
0.0%Mende1
0.0%Minangkabau1
0.0%Mojo1
0.0%Mpongwe1
0.0%Nauru1
0.0%Nzima1
0.0%Ossetic1
0.0%Pampanga1
0.0%Pangasinan1
0.0%Sango (Ubangi Creole)1
0.0%Sardinian1
0.0%Selkup1
0.0%Serer1
0.0%Tai1
0.0%Terena1
0.0%Tonga (Nyasa)1
0.0%Tsimshian1
0.0%Tsonga1
0.0%Venda1
0.0%Wakashan languages1
0.0%Walamo1
0.0%Washo1
(100.0%)
97.4%Total (Single-language titles)2,931,066
0.1%Multiple languages2,717
2.5%Undetermined, Code Missing, N/A76,638
100.0%Total (inc. Multiple languages, etc.)3,010,421

The fine print

"Is this everything?"
No. Although this table reports languages of books, journals, videos, sound recordings, and electronic resources in Franklin, you should use the "Percent of Total" column as a guide to the Penn Library collections, rather than the "Titles counted" column. Active or unsuppressed bibliographic records in Franklin may have been missed if they lacked a language code. Items using two or more languages may have been coded for the prominent language or relegated to "Multiple languages", depending upon cataloging practice. And, of course, a "single-language" journal may have an article or two in another language!

"Where's my language?"
This table uses MARC 21 language codes, as maintained by the Library of Congress for the bibliographic description of information resources. Sparsely-published languages may be grouped into generic categories, such as "Bantu (other)" or "Eskimo languages" [a superseded name replaced by several other names but used here as a fossil code]. An interesting discussion of the language coding is provided at Ethnologue's web page, "Mapping between ISO 639 Language Codes and the Languages Identified in the Ethnologue". For more information on MARC 21 language codes, see "MARC code list for languages" (Library of Congress web).

Mistakes have been made.
This table uses MARC 21 language codes appearing in the MARC bibliographic record format's field 008/35-37 "Fixed-Length Data Elements / Language". Fossil codes and errors have been corrected through examination of individual Franklin records. Several non-standard language names appear: for instance, Ainu and Burashaski, treated by MARC as "Miscellaneous languages", were added to highlight their presence; identifications for Serbian and Croatian were based upon coding or bibliographic notes, but Serbo-Croatian was added where Roman or Cyrillic script was not indicated.

*