《胡壮麟语言学教程修订版课堂笔记和讲义精选(共4页).doc》由会员分享,可在线阅读,更多相关《胡壮麟语言学教程修订版课堂笔记和讲义精选(共4页).doc(4页珍藏版)》请在taowenge.com淘文阁网|工程机械CAD图纸|机械工程制图|CAD装配图下载|SolidWorks_CaTia_CAD_UG_PROE_设计图分享下载上搜索。
1、精选优质文档-倾情为你奉上Chapter 10 Language and ComputerWhat is computational linguistics?Computational linguistics is a branch of applied linguistics, dealing with computer processing of human language.(1)It includes the analysis of language data so as to establish the order in which learners acquire various
2、grammatical rules or the frequency of occurrence of some particular item.(2)It includes electronic production of artificial speech and the automatic recognition of human speech.(3)It includes research on automatic translation between natural languages.(4)It also includes text processing and communic
3、ation between people and computers.10.1Computer-assisted language learning (CALL)10.1.1CAL / CAI vs. CALLCAI (Computer-assisted instruction) means the use of a computer in a teaching program. CAL (Computer-assisted learning) refers to the use of a computer in teaching and learning and in order to he
4、lp achieve educational objectives. CAI aims at seeing educational problems on the part of the teacher, whereas CAL emphasizes the use of a computer in both teaching and learning. CALL (Computer-assisted language learning) means the use of a computer in the teaching or learning of a second or foreign
5、 language. If CAI or CAL deals with teaching and learning problems in general, CALL deals with language teaching and learning in particular.10.1.2Phases of CALL development (4 periods)1.During this period, computers were large mainframe machines kept in research institutions.2.Small computers appear
6、ed and cost cheaper than before, which made a generation of programs possible.3.The learning was not so much supplied by the language of the text itself as by the cognitive problem-solving techniques and the interaction between students in the group.4.Instead of writing specific programs for languag
7、e teaching, word-processing has adapted to language teaching by enabling students to compose and try our their writings in a non-permanent form.10.1.3Technology1.Customizing, template, and authoring programs.2.Computer networks.3.Compact disk technology4.Digitized sound.10.2Machine translation (MT)1
8、0.2.1History of development1.The independent work by MT researchers2.Towards good quality output3.The development of translate tools10.2.2Research methods1.Linguistic approach2.The practical approaches(1)The transfer approach(2)The inter-lingual approach(3)Knowledge-based approach10.2.3MT Quality10.
9、2.4MT and the Internet10.2.5Spoken language translation10.2.6MT and human translationAt the beginning of the new century, it is apparent that MT and human translation can and will co-exist in relative harmony. Those skills which the human translators can contribute will always be in demand.(1) When
10、translation has to be of “publishable” quality, both human translation and MT have their roles. MT plays an important part in large scale and rapid translation of boring technical documentation, highly repetitive software localization manuals, and many other situations where the costs of human trans
11、lation are much higher than the ones of MT. By contrast, the human translators are and will remain unrivalled for non-repetitive linguistically sophisticated texts (e.g. in literature and law), and even for one-off texts in specific highly-specialized technical subjects.(2) For the translation of te
12、xts where the quality of output is much less important, MT is often an ideal solution.(3) For the one-to-one interchange of information, there will probably always be a role for the human translators. But for the translation of personal letters, MT systems are likely to be increasingly used; and, fo
13、r electronic mail and for the extraction of information from web pages and computer-based information services, MT is the only feasible solution.(4) As for spoken translation, there must surely always be a market for the human translators. But MT systems are opening up new areas where human translat
14、ion has never featured: the production of draft versions for authors writing in a foreign language, who need assistance in the translation of information from databases; and no doubt, more such new applications will appear in the future as the global communication networks expand and as the realisti
15、c usuality of MT becomes familiar to a wider public.10.3Corpus linguistics10.3.1Definition1.Corpus (pl. corpora): A collection of linguistic data, either compiled as written texts or as a transcription of recorded speech. The main purpose of a corpus is to verify a hypothesis about language for exam
16、ple, to determine how the usage of a particular sound, word, or syntactic construction varies.2.Corpus linguistics: Corpus linguistics deals with the principles and practice of using corpora in language study. A computer corpus is a large body of machine-readable texts.10.3.2Criticisms and the reviv
17、al of corpus linguistics10.3.3Concordance10.3.4Text encoding and annotation1.It should be possible to remove the annotation from an annotated corpus in order to revert to the raw corpus.2.It should be possible to extract the annotation by themselves from the text.3.The annotation scheme should be ba
18、sed on guidelines which are available to the end user.4.It should be made clear how and by whom the annotation was carried out.5.The end user should be made aware that the corpus annotation is not infallible, but simply a potentially useful tool.6.Annotation schemes should be based as far as possibl
19、e on widely agreed and theory-neutral principles.7.No annotation scheme has a priori right to be considered as a standard.10.3.5The roles of corpus data1.Speech research2.Lexical studies3.Semantics4.Sociolinguistics5.Psycholinguistics10.4Information retrieval (IR)10.4.1Scope definedData retrieval vs
20、. information retrievalData retrievalInformation retrievalMatchingExact matchPartial or best matchInferenceDeductionInductionData retrievalInformation retrievalModelDeterministicProbabilisticClassificationMonotheticPolytheticQuery languageArtificialNaturalQuery specificationCompleteIncompleteItems wantedMatchingRelevantError responseSensitiveInsensitive10.4.2An information retrieval system10.4.3Three main areas of research1.Content analysis2.Information structure3.Evaluation10.5Mail and news专心-专注-专业