语料库研究基本方法.pptx

上传人:莉*** 文档编号:87289060 上传时间:2023-04-16 格式:PPTX 页数:26 大小:191.56KB
返回 下载 相关 举报
语料库研究基本方法.pptx_第1页
第1页 / 共26页
语料库研究基本方法.pptx_第2页
第2页 / 共26页
点击查看更多>>
资源描述

《语料库研究基本方法.pptx》由会员分享,可在线阅读,更多相关《语料库研究基本方法.pptx(26页珍藏版)》请在taowenge.com淘文阁网|工程机械CAD图纸|机械工程制图|CAD装配图下载|SolidWorks_CaTia_CAD_UG_PROE_设计图分享下载上搜索。

1、主要内容语料库语言学的性质语料库语言学的性质几个常用术语几个常用术语语料库研究的基本方法语料库研究的基本方法第1页/共26页语料库语言学的性质1理性主义与经验主义理性主义与经验主义Rationalism:I think therefore I am.Empiricism:My mind is a blank slate.Seeing is believing.第2页/共26页语料库语言学的性质1the Wax Argument:He considers a piece of wax;his senses inform him that it has certain characteristic

2、s,such as shape,texture,size,color,smell,and so forth.When he brings the wax towards a flame,these characteristics change completely.However,it seems that it is still the same thing:it is still a piece of wax,even though the data of the senses inform him that all of its characteristics are different

3、.第3页/共26页语料库语言学的性质the Wax Argument:Therefore,in order to properly grasp the nature of the wax,he cannot use the senses.He must use his mind.Descartes concludes:“And so something which I thought I was seeing with my eyes is in fact grasped solely by the faculty of judgment which is in my mind.1第4页/共2

4、6页语料库语言学的性质Empiricism:Empiricism emphasizes those aspects of scientific knowledge that are closely related to evidence,especially as discovered in experiments.It is a fundamental part of the scientific method that all hypotheses and theories must be tested against observations of the natural world,r

5、ather than resting solely on reasoning and intuition.1第5页/共26页语料库语言学的性质Science is considered to be methodologically empirical in nature.Corpus linguistics is empirical in nature.1第6页/共26页语料库语言学的性质语言研究中的数据类型语言研究中的数据类型内省数据(内省数据(introspective data):rationalism实验数据(实验数据(experimental data):empiricism真实数据

6、(真实数据(anthentic data):empricism1第7页/共26页语料库语言学的性质语料库语言学提倡真实数据语料库语言学提倡真实数据我们不排斥其他数据类型我们不排斥其他数据类型1第8页/共26页语料库语言学的性质即便在语料库语言学阵营之中即便在语料库语言学阵营之中Corpus-driven:minimum theory-reliance.Exclusive reliance on corpus data for all theoriesCorpus-based:Reliance on corpus data for hypothesis-testingCorpus-referen

7、ced/informed:Occasionally resorting to corpus data for illustrations 1第9页/共26页语料库语言学的性质我们坚决反对不顾语言事实的任何论断我们坚决反对不顾语言事实的任何论断No introspection can claim credence without verification through real language data(Teubert 2005).1第10页/共26页几个常用术语2CorpusCorpus linguistics第11页/共26页几个常用术语Token,type,lemmaThe littl

8、e boy looked at the other boys.2第12页/共26页几个常用术语Collocation is defined as a sequence of words which co-occur more often than would be expected by chance.a big smoker a strong smoker a hard smoker a heavy smoker a furious smoker 2第13页/共26页几个常用术语It is quite possible,in fact,to describe a woman as hands

9、ome.However,this implies that she is not beautiful at all in the traditional sense of female beauty,but rather that she is mature in age,has large features and a certain strength of character.Similarly,a man could be described as beautiful,but this would usually imply that he had feminine features.2

10、第14页/共26页几个常用术语Colligation is defined as a sequence of grammatical categories which co-occur more often than would be expected by chance.2第15页/共26页几个常用术语Semantic prosody is instantiated when a word such as CAUSE co-occurs regularly with words that share a given meaning or meanings,and then acquires

11、some of the meaning(s)of those words as a result.This acquired meaning is known as semantic prosody.(Stewart 2010)2第16页/共26页语料库研究的基本方法3Corpus-based approach:a hypothesis-testing approachCorpus-driven approach:with as“few preconceived ideas”as possible,“keeping the amount of theory-reliance to a mini

12、mum in order not to hinder the process of discovering new phenomena”(Rmer 2005)第17页/共26页语料库研究的基本方法Both approaches almost always involve a comparion of some kind.3第18页/共26页语料库研究的基本方法Sizes of corpora in comparison(Rayson 2003)Small bigEqual sizes3第19页/共26页语料库研究的基本方法Types of comparisonAcross genresAcro

13、ss usersAcross different timesAcross(varieties of)language(s)3第20页/共26页语料库研究的基本方法Corpus comparability3第21页/共26页语料库研究的基本方法Linguistic features in corpus comparisonLexicalLexico-grammaticalSyntacticDiscoursal3第22页/共26页语料库研究的基本方法Statistic tests in corpus comparisonSimple:Relationship(correlation,etc)Dif

14、ference(chi-square,loglikelihood,etc.)Complicated:regression analysis,factor analysis,cluster analysis,correspondence analysis3第23页/共26页语料库研究的基本方法语语料料库库研究问题研究问题研究设计研究设计软件软件统计检验统计检验结结论论?参参照照语语料料库库对比对比结果:结果:词汇词汇短语短语搭配搭配语义韵语义韵类联接类联接句式句式等等数据呈现数据呈现数据分析、解释与讨论数据分析、解释与讨论3第24页/共26页内容55Thank you.第25页/共26页感谢您的观看!第26页/共26页

展开阅读全文
相关资源
相关搜索

当前位置:首页 > 应用文书 > PPT文档

本站为文档C TO C交易模式,本站只提供存储空间、用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。本站仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知淘文阁网,我们立即给予删除!客服QQ:136780468 微信:18945177775 电话:18904686070

工信部备案号:黑ICP备15003705号© 2020-2023 www.taowenge.com 淘文阁