当前位置:
文档之家› 计算机语言学语料库语言学机器翻译电子教案
计算机语言学语料库语言学机器翻译电子教案
0. Warm-up Questions
In what ways can computer facilitate our language learning?
To what extent do you rely on computer in your English learning?
How to improve the output quality of machine translation?
2. CALL
2.3 Types of CALL programs
➢ Davies & Higgins (1985): Gapmaster, Mazes, etc. ➢ Jones & Fortescue (1987): Matchmaster, Wordstore, etc. ➢ Higgins (1993): Customizing, Computer networks, etc
➢ Corpus: a collection of linguistic data, either compiled as written texts or as transcription of recorded speech.
➢ Corpus linguistics deals with the principles and practice of using corpora in language study.
English Linguistics: An Introduction
Chapter 10 Language and Computer
Chapter 10 Language and Computer
0. Warm-up Questions 1. Computational Linguistics 2. CALL 3. Machine Translation 4. Corpus Linguistics
3. Machine Translation
3.1 Introduction
➢ Definition: the use of machine (usually computers) to translate text (or speech) from one natural L to another.
2.4 Advantages and Problems
➢ Advantages ✓ Motivation, adaptive, authenticity, critical thinking ➢ Problems (Limitations of the technology) ✓ ability (human-like interaction), availability (cost), etc.
3. Machine Translation
3.3 Research methods
➢ Rule-based: Transfer- & dictionary-based, interlingual ➢ Knowledge-based: semantic, pragmatic, real-world ➢ Corpus-based: statistical, example-based
In terms of function, there are four common types of corpora:
➢ General corpora: broadly homogeneous ➢ Specialized corpora: for specific purposes ➢ Sample corpora: genre-based ➢ Monitor corpora: gigantic, ever moving store
4.2 Features of the corpus
➢ Representativeness ➢ Finite size ➢ Machine-readable form ➢ A standard reference
4. Corpus Linguistics
4.3 Types of the corpus (p273)
1.2 Related subjects
➢ Programmed instruction 编序教学法、程式化教学 ➢ Speech synthesis 言语合成 ➢ Automatic recognition of human speech ➢ Automatic translation of natural languages ➢ Communication between people and computers ➢ Text processing, etc
What is the impact of the Internet on machine translation?
1. Computational Linguistics
1.பைடு நூலகம் Definition (p226)
➢ A branch of applied linguistics, dealing with computer processing of human language.
➢ Types: Unassisted MT and Assisted MT; T2T MT, S2S MT, S2T MT, T2S MT
3.2 History of development
➢ 1950s: independent work by MT researchers ➢ 1960s: hope for good quality ➢ Since 1970s: computer-based tools
3.4 Advantages and Problems
➢ Advantages: cost-effective, time-saving ➢ Problems: output quality hard to ensure (reasons?)
4. Corpus Linguistics
4.1 Definition (p238)