Sidebar
Main Content
Languages in the Penn Library Collections
What are the languages Penn Library books and other materials use?
Franklin, the Penn Library's online catalog, employs language codes compliant with the ISO 639-2 and ANSI Z39.53 standards managed by the Library Congress. Although Franklin users may limit search results to specific languages, it is not possible for public users to search directly by language code.
This table, compiled on 19 February 2004, counts active titles -- individual bibliographic records -- in Franklin. The counts have been cleaned: for instance, titles identified as "Miscellaneous languages" have been examined and placed under recognizable (and in some cases, non-standard) language names.
| Pct of Total | Language Name | Titles |
|---|---|---|
| 62.1% | English | 1,820,606 |
| 10.1% | German | 296,258 |
| 6.1% | French | 177,507 |
| 3.1% | Spanish | 90,957 |
| 2.5% | Italian | 73,934 |
| 2.1% | Arabic | 60,944 |
| 1.5% | Chinese | 44,320 |
| 1.4% | Russian | 39,588 |
| 1.3% | Hebrew | 37,583 |
| 1.3% | Latin | 36,650 |
| 1.1% | Japanese | 32,826 |
| 0.8% | Hindi | 23,715 |
| 0.5% | Bengali | 15,981 |
| 0.5% | Urdu | 15,600 |
| 0.5% | Dutch | 13,782 |
| 0.4% | Tamil | 11,039 |
| 0.3% | Persian | 9,479 |
| 0.3% | Portuguese | 8,834 |
| 0.3% | Lithuanian | 8,805 |
| 0.3% | Swedish | 8,340 |
| 0.3% | Turkish | 8,130 |
| 0.2% | Polish | 6,845 |
| 0.2% | Sanskrit | 6,608 |
| 0.2% | Marathi | 6,434 |
| 0.2% | Gujarati | 5,756 |
| 0.2% | Telugu | 5,177 |
| 0.2% | Yiddish | 4,437 |
| 0.1% | Danish | 4,355 |
| 0.1% | Malayalam | 4,235 |
| 0.1% | Tibetan | 3,824 |
| 0.1% | Greek, Modern (1453- ) | 3,267 |
| 0.1% | Nepali | 2,771 |
| 0.1% | Sinhalese | 2,636 |
| 0.1% | Ukrainian | 2,414 |
| 0.1% | Greek, Ancient (to 1453) | 2,389 |
| 0.1% | Korean | 2,351 |
| 0.1% | Czech | 2,326 |
| 0.1% | Panjabi | 1,637 |
| 0.1% | Armenian | 1,507 |
| 0.0% | Catalan | 1,419 |
| 0.0% | Norwegian | 1,360 |
| 0.0% | Frisian | 1,066 |
| 0.0% | Hungarian | 1,038 |
| 0.0% | Serbian | 941 |
| 0.0% | Finnish | 887 |
| 0.0% | Romanian | 887 |
| 0.0% | Rajasthani | 880 |
| 0.0% | Turkish, Ottoman | 874 |
| 0.0% | Croatian | 873 |
| 0.0% | Romance (Other) | 724 |
| 0.0% | Latvian | 638 |
| 0.0% | Maithili | 632 |
| 0.0% | Mongolian | 616 |
| 0.0% | Bulgarian | 595 |
| 0.0% | Newari | 584 |
| 0.0% | Sindhi | 517 |
| 0.0% | Welsh | 469 |
| 0.0% | Pushto | 434 |
| 0.0% | Yoruba | 423 |
| 0.0% | French, Middle (ca. 1400-1600) | 421 |
| 0.0% | Irish | 393 |
| 0.0% | Icelandic | 357 |
| 0.0% | English, Middle (1100-1500) | 323 |
| 0.0% | French, Old (ca. 842-1400) | 321 |
| 0.0% | Indic (Other) | 311 |
| 0.0% | Pali | 307 |
| 0.0% | Slovak | 303 |
| 0.0% | Prakrit languages | 295 |
| 0.0% | Braj | 284 |
| 0.0% | Konkani | 279 |
| 0.0% | Bhojpuri | 261 |
| 0.0% | Kannada | 250 |
| 0.0% | Syriac | 233 |
| 0.0% | Slovenian | 215 |
| 0.0% | German, Middle High (ca. 1050-1500) | 189 |
| 0.0% | Raeto-Romance | 169 |
| 0.0% | Swahili | 164 |
| 0.0% | Kazakh | 161 |
| 0.0% | Estonian | 148 |
| 0.0% | Khasi | 148 |
| 0.0% | Afrikaans | 143 |
| 0.0% | Belarusian | 142 |
| 0.0% | Baluchi | 139 |
| 0.0% | Church Slavic | 139 |
| 0.0% | Galician | 139 |
| 0.0% | Kashmiri | 120 |
| 0.0% | Lahnda | 119 |
| 0.0% | Mayan languages | 119 |
| 0.0% | Assamese | 118 |
| 0.0% | Macedonian | 110 |
| 0.0% | Azerbaijani | 106 |
| 0.0% | English, Old (ca. 450-1100) | 106 |
| 0.0% | Provençal (to 1500) | 104 |
| 0.0% | Occitan (post-1500) | 98 |
| 0.0% | Sino-Tibetan (Other) | 97 |
| 0.0% | Tagalog | 96 |
| 0.0% | Amharic | 94 |
| 0.0% | Aramaic | 92 |
| 0.0% | Akkadian | 87 |
| 0.0% | Somali | 86 |
| 0.0% | Marwari | 84 |
| 0.0% | Ladino | 81 |
| 0.0% | Indonesian | 76 |
| 0.0% | Scots | 75 |
| 0.0% | Coptic | 74 |
| 0.0% | Shona | 74 |
| 0.0% | Niger-Kordofanian (Other) | 73 |
| 0.0% | Georgian | 72 |
| 0.0% | Awadhi | 70 |
| 0.0% | Thai | 65 |
| 0.0% | Pahlavi | 64 |
| 0.0% | Malagasy | 63 |
| 0.0% | Central American Indian (Other) | 62 |
| 0.0% | Dogri | 60 |
| 0.0% | Tigrinya | 56 |
| 0.0% | Basque | 55 |
| 0.0% | Romani | 54 |
| 0.0% | Oriya | 53 |
| 0.0% | Magahi | 52 |
| 0.0% | Sorbian languages | 51 |
| 0.0% | Scottish Gaelic | 47 |
| 0.0% | Austronesian (Other) | 46 |
| 0.0% | Dravidian (Other) | 46 |
| 0.0% | Nahuatl | 45 |
| 0.0% | Uzbek | 45 |
| 0.0% | Lushai | 44 |
| 0.0% | Egyptian | 43 |
| 0.0% | Kurdish | 39 |
| 0.0% | Manipuri | 39 |
| 0.0% | Slavic (Other) | 39 |
| 0.0% | Esperanto | 38 |
| 0.0% | Germanic (Other) | 38 |
| 0.0% | Creoles and Pidgins, French-based (Other) | 37 |
| 0.0% | Cree | 36 |
| 0.0% | Algonquian (Other) | 35 |
| 0.0% | Dakota | 35 |
| 0.0% | Judeo-Arabic | 34 |
| 0.0% | North American Indian (Other) | 34 |
| 0.0% | Samaritan Aramaic | 34 |
| 0.0% | Albanian | 33 |
| 0.0% | Sotho | 33 |
| 0.0% | Ethiopic | 32 |
| 0.0% | Himachali | 32 |
| 0.0% | Quechua | 32 |
| 0.0% | Malay | 31 |
| 0.0% | Hausa | 30 |
| 0.0% | Avestan | 28 |
| 0.0% | Eskimo languages | 27 |
| 0.0% | Dutch, Middle (ca. 1050-1350) | 26 |
| 0.0% | Vietnamese | 25 |
| 0.0% | Breton | 24 |
| 0.0% | Ojibwa | 24 |
| 0.0% | Athapascan (Other) | 23 |
| 0.0% | Gothic | 23 |
| 0.0% | Zulu | 23 |
| 0.0% | Berber (Other) | 22 |
| 0.0% | Sumerian | 22 |
| 0.0% | Altaic (Other) | 20 |
| 0.0% | Ganda | 20 |
| 0.0% | Javanese | 20 |
| 0.0% | Munda (Other) | 20 |
| 0.0% | Mandingo | 19 |
| 0.0% | South American Indian (Other) | 19 |
| 0.0% | Finno-Ugrian (Other) | 18 |
| 0.0% | Ndebele (Zimbabwe) | 17 |
| 0.0% | Papuan (Other) | 17 |
| 0.0% | Tajik | 17 |
| 0.0% | Burmese | 16 |
| 0.0% | Kinyarwanda | 16 |
| 0.0% | Hawaiian | 15 |
| 0.0% | Oromo | 15 |
| 0.0% | Sami | 15 |
| 0.0% | German, Old High (ca. 750-1050) | 14 |
| 0.0% | Samoan | 14 |
| 0.0% | Turkmen | 14 |
| 0.0% | Wolof | 14 |
| 0.0% | Hiligaynon | 13 |
| 0.0% | Ndonga | 13 |
| 0.0% | Nilo-Saharan (Other) | 13 |
| 0.0% | Tatar | 13 |
| 0.0% | Zapotec | 13 |
| 0.0% | Ainu | 12 |
| 0.0% | Lao | 12 |
| 0.0% | Mohawk | 12 |
| 0.0% | Navajo | 12 |
| 0.0% | Serbo-Croatian [script not known] | 12 |
| 0.0% | Kurukh | 11 |
| 0.0% | Twi | 11 |
| 0.0% | Uighur | 11 |
| 0.0% | Afroasiatic (Other) | 10 |
| 0.0% | Bambara | 10 |
| 0.0% | Bihari | 10 |
| 0.0% | Indo-European (Other) | 10 |
| 0.0% | Iranian (Other) | 10 |
| 0.0% | Micmac | 10 |
| 0.0% | Bantu (Other) | 9 |
| 0.0% | Chechen | 9 |
| 0.0% | Creoles and Pidgins, Portuguese-based (Other) | 9 |
| 0.0% | Iroquoian (Other) | 9 |
| 0.0% | Judeo-Persian | 9 |
| 0.0% | Khmer | 9 |
| 0.0% | Sundanese | 9 |
| 0.0% | Burushaski | 8 |
| 0.0% | Cherokee | 8 |
| 0.0% | Creek | 8 |
| 0.0% | Creoles and Pidgins (Other) | 8 |
| 0.0% | Delaware | 8 |
| 0.0% | Fula | 8 |
| 0.0% | Moldavian | 8 |
| 0.0% | Mooré | 8 |
| 0.0% | Shan | 8 |
| 0.0% | Ugaritic | 8 |
| 0.0% | Balinese | 7 |
| 0.0% | Bemba | 7 |
| 0.0% | Chagatai | 7 |
| 0.0% | Dyula | 7 |
| 0.0% | Gã | 7 |
| 0.0% | Apache languages | 6 |
| 0.0% | Aymara | 6 |
| 0.0% | Caucasian (Other) | 6 |
| 0.0% | Guarani | 6 |
| 0.0% | Mapuche | 6 |
| 0.0% | Palauan | 6 |
| 0.0% | Tigré | 6 |
| 0.0% | Celtic (Other) | 5 |
| 0.0% | Choctaw | 5 |
| 0.0% | Igbo | 5 |
| 0.0% | Khoisan (Other) | 5 |
| 0.0% | Luo (Kenya and Tanzania) | 5 |
| 0.0% | Papiamento | 5 |
| 0.0% | Santali | 5 |
| 0.0% | Sogdian | 5 |
| 0.0% | Tswana | 5 |
| 0.0% | Xhosa | 5 |
| 0.0% | Bashkir | 4 |
| 0.0% | Bikol | 4 |
| 0.0% | Chuvash | 4 |
| 0.0% | Faroese | 4 |
| 0.0% | Gilbertese | 4 |
| 0.0% | Iloko | 4 |
| 0.0% | Kongo | 4 |
| 0.0% | Kuanyama | 4 |
| 0.0% | Kyrgyz | 4 |
| 0.0% | Maltese | 4 |
| 0.0% | Niuean | 4 |
| 0.0% | Nubian languages | 4 |
| 0.0% | Nyanja | 4 |
| 0.0% | Old Persian (ca. 600-400 B.C.) | 4 |
| 0.0% | Otomian languages | 4 |
| 0.0% | Rarotongan | 4 |
| 0.0% | Rundi | 4 |
| 0.0% | Semitic (Other) | 4 |
| 0.0% | Arawak | 3 |
| 0.0% | Australian languages | 3 |
| 0.0% | Avaric | 3 |
| 0.0% | Carib | 3 |
| 0.0% | Chinook jargon | 3 |
| 0.0% | Creoles and Pidgins, English-based (Other) | 3 |
| 0.0% | Cushitic (Other) | 3 |
| 0.0% | Duala | 3 |
| 0.0% | Ewe | 3 |
| 0.0% | Grebo | 3 |
| 0.0% | Kara-Kalpak | 3 |
| 0.0% | Kawi | 3 |
| 0.0% | Maori | 3 |
| 0.0% | Mongo-Nkundu | 3 |
| 0.0% | Ponape | 3 |
| 0.0% | Siksika | 3 |
| 0.0% | Songhai | 3 |
| 0.0% | Tuvinian | 3 |
| 0.0% | Banda | 2 |
| 0.0% | Bosnian | 2 |
| 0.0% | Elamite | 2 |
| 0.0% | Fang | 2 |
| 0.0% | Fijian | 2 |
| 0.0% | Garhwali | 2 |
| 0.0% | Herero | 2 |
| 0.0% | Kabyle | 2 |
| 0.0% | Kusaie | 2 |
| 0.0% | Luba-Katanga | 2 |
| 0.0% | Mon-Khmer (Other) | 2 |
| 0.0% | Philippine (Other) | 2 |
| 0.0% | Salishan languages | 2 |
| 0.0% | Siouan (Other) | 2 |
| 0.0% | Swazi | 2 |
| 0.0% | Tahitian | 2 |
| 0.0% | Zuni | 2 |
| 0.0% | Achinese | 1 |
| 0.0% | Afar | 1 |
| 0.0% | Akan | 1 |
| 0.0% | Aljamía | 1 |
| 0.0% | Arapaho | 1 |
| 0.0% | Artificial (Other) | 1 |
| 0.0% | Basa | 1 |
| 0.0% | Batak | 1 |
| 0.0% | Bislama | 1 |
| 0.0% | Bugis | 1 |
| 0.0% | Caddo | 1 |
| 0.0% | Cornish | 1 |
| 0.0% | Dayak | 1 |
| 0.0% | Dzongkha | 1 |
| 0.0% | Gayo | 1 |
| 0.0% | Gondi | 1 |
| 0.0% | Hmong | 1 |
| 0.0% | Hupa | 1 |
| 0.0% | Iberian | 1 |
| 0.0% | Inuktitut | 1 |
| 0.0% | Kalâtdlisut | 1 |
| 0.0% | Karen | 1 |
| 0.0% | Kikuyu | 1 |
| 0.0% | Kru | 1 |
| 0.0% | Lozi | 1 |
| 0.0% | Madurese | 1 |
| 0.0% | Manchu | 1 |
| 0.0% | Manx | 1 |
| 0.0% | Mende | 1 |
| 0.0% | Minangkabau | 1 |
| 0.0% | Mojo | 1 |
| 0.0% | Mpongwe | 1 |
| 0.0% | Nauru | 1 |
| 0.0% | Nzima | 1 |
| 0.0% | Ossetic | 1 |
| 0.0% | Pampanga | 1 |
| 0.0% | Pangasinan | 1 |
| 0.0% | Sango (Ubangi Creole) | 1 |
| 0.0% | Sardinian | 1 |
| 0.0% | Selkup | 1 |
| 0.0% | Serer | 1 |
| 0.0% | Tai | 1 |
| 0.0% | Terena | 1 |
| 0.0% | Tonga (Nyasa) | 1 |
| 0.0% | Tsimshian | 1 |
| 0.0% | Tsonga | 1 |
| 0.0% | Venda | 1 |
| 0.0% | Wakashan languages | 1 |
| 0.0% | Walamo | 1 |
| 0.0% | Washo | 1 |
| (100.0%) | ||
| 97.4% | Total (Single-language titles) | 2,931,066 |
| 0.1% | Multiple languages | 2,717 |
| 2.5% | Undetermined, Code Missing, N/A | 76,638 |
| 100.0% | Total (inc. Multiple languages, etc.) | 3,010,421 |
The fine print
"Is this everything?"
No. Although this table reports languages of books, journals, videos, sound recordings, and electronic resources in Franklin, you should use the "Percent of Total" column as a guide to the Penn Library collections, rather than the "Titles counted" column. Active or unsuppressed bibliographic records in Franklin may have been missed if they lacked a language code. Items using two or more languages may have been coded for the prominent language or relegated to "Multiple languages", depending upon cataloging practice. And, of course, a "single-language" journal may have an article or two in another language!
"Where's my language?"
This table uses MARC 21 language codes, as maintained by the Library of Congress for the bibliographic description of information resources. Sparsely-published languages may be grouped into generic categories, such as "Bantu (other)" or "Eskimo languages" [a superseded name replaced by several other names but used here as a fossil code]. An interesting discussion of the language coding is provided at Ethnologue's web page, "Mapping between ISO 639 Language Codes and the Languages Identified in the Ethnologue". For more information on MARC 21 language codes, see "MARC code list for languages" (Library of Congress web).
Mistakes have been made.
This table uses MARC 21 language codes appearing in the MARC bibliographic record format's field 008/35-37 "Fixed-Length Data Elements / Language". Fossil codes and errors have been corrected through examination of individual Franklin records. Several non-standard language names appear: for instance, Ainu and Burashaski, treated by MARC as "Miscellaneous languages", were added to highlight their presence; identifications for Serbian and Croatian were based upon coding or bibliographic notes, but Serbo-Croatian was added where Roman or Cyrillic script was not indicated.



