Bibliographic Control of Chinese Material in the United Kingdom*
Sandra Gilkes
uczcsmg@ucl.ac.uk
Libraries that cater for material in Chinese need to be able to provide access to their collections for those learning the language and those who can read and understand the language already. For this to be effective, the Chinese language libraries in the United Kingdom need to assess who their readers are and what their requirements might be. For the most part, the majority of Chinese material users in U.K. libraries are those studying Chinese language and literature and who, therefore, have some level of competence in the Chinese language. Given the outline of the need to provide access as mentioned, the main consideration for a library to cater for Chinese is whether to adopt a romanisation scheme that the majority of users can use and recognise so that even if there is a limited amount of ability to read Chinese (because readers are learning the language), use of the romanisation scheme can at least help locate the required item. At this point, Pinyin seems to be the romanisation scheme that is being more widely brought into practice in libraries. For Chinese, there are several schemes that can be used for the activity of romanisation, in itself a form of script conversion. However, since Pinyin was created in China and is increasingly being used outside of China as well. In 1997, the Library of Congress decided to switch all its records from Wade-Giles to Pinyin romanisation. Therefore, it would seem appropriate that those libraries that need to use a romanisation scheme should, for reasons of standardisation and facilitating access, opt of Pinyin.
For material in non-roman scripts such as Chinese, bibliographic control, defined as the way in which a library organises access to its collection, includes the activity of romanisation. According to Spalding, romanisation is "the representation of roman letters of names and words originally written in some other writing system"[1]. There are several romanisation schemes for Chinese. Since it is suggested that this is the scheme that is to use in libraries, Pinyin is explained in greater detail: It is a romanisation scheme based on the 26 roman letters of the alphabet except v, four modified roman letters and the four diagraphs zh, ch, sh, and ng as well as the letter u with umlaut to represent y. The Pinyin system also employs four diacritical marks for the tones of the Chinese pronunciation. In this instance, Chinese refers to the standard pronunciation of Mandarin or Putonghua, and the four tones of pronunciation of Chinese - high, high rising, falling and rising, falling - all of which are indicated with diacritical marks. When using Pinyin, words are romanised into a single linguistic unit. There are intrinsic advantages to the use of Pinyin over other schemes, and these are given below:
Examples of how Pinyin is expressed are included below in the visual and phonetic comparison between Wade-Giles and Pinyin romanisation schemes and in appendix 1.
Pinyin | Wade-Giles | 中文 |
---|---|---|
Bingju Fenxi | Ping Chu Fen Hsi | 病句分析 |
Putonghua Zhengyin | Pu Tung Hua Cheng Yin Shou Tse | 普通话正音 |
Beijing | Peking[2] | 北京 |
Given the developments in automation and the subsequent attempts at using the vernacular Chinese language for creating catalogue records and providing records for readers, it is interesting to consider what the two libraries in the United Kingdom that have the most significant Chinese collections do in order to ensure bibliographic control of their collections. Having chosen to romanise using the Pinyin scheme, the libraries must, then, find a way to create cataloguing records and enable public access to the catalogue. In the United Kingdom, those libraries that have the largest and most significant collection of Chinese literature are the British Library and the Library of the School of Oriental and African Studies.
1. The British Library:
The use of technology in libraries and for those libraries that have material in Chinese has increased significantly in the recent years. There are two trends to this development:
1. The development of computers and the Internet for the Chinese language.
2. The use of computers in libraries for cataloguing Chinese language material.
Automation for libraries with the Chinese collections is now at the stage of being able to input in the vernacular Chinese characters, and this extra facility has been the reason for a reconsideration of whether to use a romanisation scheme at all. In the United Kingdom, Allegro C and Innopac are the most widely used computer systems for this. They enable cataloguers to create records using the vernacular script; assuming that the readers want to be able to use a catalogue in the vernacular Chinese script, this is most ideal. However, given that as stated, there are valid reasons for the use of the romanisation scheme Pinyin. The best solution would seem to catalogue and have access to the catalogue by use of both the vernacular Chinese script and Pinyin. (Hereby, I draw the reader's attention to a forthcoming publication on this precise subject: "Information Processing for CJKV" by Ken Lunde to be released by O'Reilly Associates, Inc. in October 1998 in the United States of America.)
The automation facilities for Chinese characters, i.e. non-roman, are also connected to machine readable data character coding and the standards that exist for this. Some of the standards used for this exchange of bibliographic information in Chinese are listed:
A significant amount of library catalogues are now also accessible via the Internet. It is interesting to note, therefore, that the medium of the Internet must, as a consequence, also be able to cater for material in different scripts. The means to do this is by using the appropriate software for transcribing into and out of the Chinese script. Without the appropriate software, the vernacular Chinese text appears, as no doubt the majority of Internet users have experienced, illegible. In order to read the web-pages, the codes such as:
Given that the Internet is increasing in significance for libraries as they seek to provide on-line access to their collections, an overview of the current facilities of the Internet is relevant. Examples of some of the software that can be used for Chinese are given below:
Using the Internet for Chinese can best be achieved by the use of those Internet Service Providers (ISP) that offer Chinese facilities, as listed:
Cinet-L news items 5/7/98 and 5/14/98 indicate that Yahoo Chinese has also been created. This is a 10,000 site index created in both simplified and traditional Chinese, with search results display in both styles. (According to Digital, 30% of documents on the web are in languages other than English.) In addition, Netscape has also voiced its intention to launch a Chinese-language guide to global computer networks, given that industry analysts believe the number of Internet users in China will increase to 6 million by the year 2000. When using the Internet, notices that the GB or Big 5 Code or EACC must be used or relevant software must be provided, appear.
An analysis of the subject of romanisation for Chinese in libraries should come to a conclusion as to whether there should be romanisation at all, and whether this is the most appropriate way for bibliographic control of Chinese collections in libraries in the United Kingdom. The arguments in favour of romanisation in libraries, which should consider the position of the cataloguer and the library users at the same time, are very different to those arguments in favour for the use of romanisation per se. For example, in the case of geographical place names as considered in the works of Aurousseau. (The Rendering of Geographical Place Name -1957). However, once the library has opted to choose Pinyin, it should be noted that the means to use this for Chinese can be adapted to the on-line catalogue such as Allegro C and Innopac, and that this can be used for input and creation of cataloguing records in the vernacular Chinese. It is suggested that this is the best means for the attainment of bibliographic control for libraries. Given the increasing use of computer in libraries, it would seem sensible to suggest that those libraries establishing CJK collections are able to provide both the Chinese vernacular and the romanised form in Pinyin. It should also be noted that whatever the scheme chosen and whether cataloguing is carried out using a romanisation scheme or not, a library that holds material in non-roman scripts such as Chinese needs also to provide the appropriate library personnel, who, regardless of nationality, should be able to instruct users in the use of the OPAC for Chinese and the romanisation scheme being used in the library.
Postscript:This is a brief overview of current practice in libraries in the United Kingdom. It is interesting to consider whether it is in fact appropriate for a U.K. library to provide catalogue records in a different language, albeit one that uses roman letters, but increasingly, in the vernacular Chinese script. Should any British library, funded by the U.K. population, actually provide access via a different language, (although this is also done for other languages such as French or German,) let alone different scripts, even if the majority of its readers, despite learning the Chinese language, are actually of English or other national origin. Why not consider cataloguing Chinese material in English by means of transcribing into the English phonetic alphabet? At this point, the ratio of Chinese nationalities and other groups would need to be assessed as would the percentage of those library users reside in the U.K. to consider where the largest user group comes from and what their preferences are. By contrast, the National Library of China catalogues English material by sinisation into Pinyin. In this instance, when the majority of readers are of Chinese origin, although there are others of differing nationality, and although those seeking English-language material, are learning the English language, access to those books is provided by using a vernacular Chinese Pinyin catalogue record. The National Library of China does not, for example, catalogue its English material in English to provide access to English books, even if its library readers, who are mainly Chinese, might be learning English which to them is a foreign language. On the other hand, remembering that this National library is in China, it seems appropriate for it to catalogue using Pinyin and not English. It is important to remember that bibliographic control, despite the implications, should not be established in order to complicate access but in order to provide facilitated access to readers.
Appendices:
1. Chinese Romanisation Table for Pinyin- University of UCLA
http://www.testlibrary.ucla.edu/libraries/eastasian/chitable.html
Note: this table is included on the Internet by the library for the use of library users who wish to acces the catalogue and thereby the CJK collection of the material in the library. It indicates the different romanisation schemes that the library offers -Wade-Giles and Pinyin as well as Bopamofa the Taiwanese romanisation scheme. It is interesting to note that this is actually provided on the Internet, without readers actually having to adopt/use another software package in order to make the Chinese script legible.
2. List of URL's used in the research:
3. Notes:
[1] Spalding, Summer C. (1977). "Romanisation re-examined". In Library Resources and Technical Services, 77, p.3.