Applications of Computational Linguistics
21 December 2001

I-CALL and Second Language

Acquisition (SLA)
 
 
 
 
 

Contents



 
 
 
 



Thomas B. Leisibach

Blotzheimerstrasse 42

CH-4055 Basel

EPi: www.ep-i.ch / leisibach@ep-i.ch

tomtec: www.tomtec.ch / webmaster@tomtec.ch





 
 
 
 

The topic which I Iike to present to you this afternoon deals with so-called "intelligent" language tutor-systems. Therefore we will have to check out the terms intelligent on one hand and language-tutor system on the other.

First of all itís the text which all of you have read I suppose:

I-CALL and second language acquisition by Nina Garrett. At the moment, sheís a director at the Centre for Language Study at Yale University in New Haven/Connecticut. The text we have read was written in 1995, 6 years ago then. In Computer Science it is quite important to know that because things are changing very fast.
 
 

Teacher participation in I-CALL Development

In the beginning of her paper, sheís investigating a very important issue, namely the participation of teachers in the process of computer-assisted language learning-systems. Language teachers have been little involved in the development of the systems for the most part what is quite astonishing I think. Why that? She says that foreign languages is a problematic field that for complicated reasons has not till now been very open to innovation in the teaching of language itself. That is because most departments of foreign languages consider themselves to be departments of literature and they regard only literary theory as being of intellectual importance to the discipline. Language teaching is still seen as the service end of the field and most language teachers are still trained primarily in methodology, not in linguistics or even applied linguistics.
 
 

Common assumptions about technology and Language Learning

Adaptation to individual preferences

Learner-Centeredness The issue of learner-centeredness is closely related to the idea of individualization. In contrast, in a teacher-centered classroom the teacher controls all language interactions; almost all dialogue is between the teacher and one student at a time. In a learner-centered classroom the students talk to each other as much as to the teacher and they have a fair degree of autonomy in participating in classroom activities. With teacher-centered material design everything the student does is scripted and anticipated by the teacher in the person of the lesson designer. In learner-centered materials the student can make many more choices about what to work on.

Intelligent language-acquisition systems vary along the learner-centeredness perspective which is entirely appropriate as Nina Garrett states.

But it may be premature to assume that a high degree of learner-centeredness necessarily benefits language learning. If learners themselves do not understand their own styles and strategies youíre not doing them good by turning over control of the learning activities to them.
 
 

The value of help and feedback ICALLís most salient feature is the ability to generate highly specific feedback whether in form of error analysis or of responses to learner initiatives. By now itís not very much known about the extent to which help and feedback actually contribute to the language development of the average learner. There is evidence that many, perhaps even most, learners ignore the availability of help materials even in tasks where it can be seen clearly that they need them. Furthermore itís not known when learners actually do use them whether that use results in learning (that means in change of their knowledge state) or whether it merely helps them over a momentary difficulty.

Therefore one of the best known aspects in I-CALL is the development of sophisticated parsers and the effort to tailor their output to provide linguistically precise feedback of grammatical structure for the benefit of learners.
 
 

Pedagogical and theoretical Issues

Nina Garrett points out that approaches to language teaching and learning can be grouped into two broad categories:

This difference exists also in non-technology based approaches to language teaching. She states these two positions have become more extreme and have hardened into dogma. So the difference has become the basis for a dangerous polarization of the field.

Current Bases for CALL and I-CALL

The theory produced in the 1970s and the 1980s is basis for most language material development today. This position held the following important points:


 
 

Out of these points two major assumptions arise:

Several authors asserts that SL-Learning by others should be structured to be as much as possible like non-classroom first language learning by very young children. This is not possible. Itís obvious that adult learners simply cannot set aside their adult cognitive processing and ignore all that they know about language and language use. The influences of their schooling need to be considered too. Itís recognizable that the learning process is enormously variable. Individual learners vary even more in the learning styles and strategies they employ in classroom than they did as babies. So Nina Garrett claims that the development of ICALL systems for SLA should not be shaped to resemble the FLA closely. Theories of language, language processing and language acquisition: the need for a new synthesis for ICALL

In an next part of the paper Nina Garrett comes back to the distinction between language as linguistic system and language as a communicative behaviour. And there she states that the choice of goal dictates the choice of method. But it has to be understood that identification of a goal is not a theory of how the goal is to be reached. The most important basis for theories of language and theories of language acquisition is a theory of language processing. It has to be understood how people associate meaning with linguistic form, and how the association is stored, retrieved and deployed in communication that means how language knowledge is organised and used by the mind.

Processing is primary she states and the nature of language systems must be understood relative to it.

Former study of processability and learnability does not address the actual activities of processing and learning of real communicative language in real time by real people. It addresses only the logical problem of language processing and acquisition. So N.G. pleads that we should move our efforts into direction of psycholinguistics rather than on a linguistic or sociolinguistic theory. Then she contrasts the language-as-system and the language-as-communicative-behaviour concerning the kind of feedback each provides to the learners. Systems based on grammar parsing return detailed specifications of the linguistic problem whereas the language-as-communicative behaviour systems return a message about some logical problem with the communicative act.

But each of the systems can give learners only partial understanding of their production or comprehension but no insight of what thinking underlies that surface. Parsing for instance is basically an analysis of language form, that means what the error is, not an analysis of language processing, that means, why the learner made the error. No matter what oneís theory of language is:

describing an error is not the same as explaining why the learner made it. Neither linguistic feedback nor communicative feedback can provide the kind of psycholinguistic information the learner should have, information about how meaning and form are connected. I-CALL certainly needs to develop ways of recognizing what level of feedback a learner error requires. When is it an error of form, when is it an error of meaning?

At the end of the paper Miss Garrett is thinking about the partnership between teacher and technology. The advantage of the computer is the ability to record, tabulate and organize data on the learning history of individual learners that goes beyond the human beingsí and it can diagnose individual learner problems more accurately than even the most attentive teacher. But there are language learning activities that absolutely require direct interaction with the teacher and always will, spontaneous oral communication for instance.

The organization and supervision, the whole process of language learning will always require extensive teacher involvement and although computers will not replace teachers, teachers who use computers well will replace those who do not.

ALICE-chan

ALICE-chan (Ac) is a language training environment for Japanese that uses NLP as a basis both for assisting instructors in preparing exercises and for evaluating student responses.

Thatís why I-CALL systems must be able to respond to input noisier than that used by other NLP programs. ALICE's approach to language teaching was that it is a process that involves a combination of exposure, explanation, and practice. In NLP applications, a parser analyses a sentence according to the lexical items and rules provided in the Target Language (TL) grammar.

Project Goals and Design Principles

The primary function of ALICE is to be a tool for research in SLA. Four design principles underly ALICE:

Exercises

In one of ALICEí exercises students must answer a question about a list of historical events. The students answer must contain a positive or negative sentence using the adverb moo or the adverb mada.
 
 

1008 until 1616: Questions

Studentís answer in the box (A), has made two errors

ALICE can deliver many exercices like the type shown here, each exercise can be implemented as MC, fill-in-the-blank or full sentence response. Exercise Authoring The first step in authoring an exercise is to enter text into the ALICE text editor. The text of the exercise should contain the background information to the question and the correct answer to question number 1.

Next step is to identify the words and sentences that will be blanked out for students to fill in. In this figure here the answer for question 1 has been selected for blanking out. The system sends the identified words and sentences to the NLP programs. The NLP programs analyze the selected material and display the analysis as a feature structure at the bottom of the screen (which can be see here).

The feature structure represents the words in the Japanese sentence, their meaning in English gloss, the grammatical features such as tense or aspect.

The feature structure is stored as part of the exercise and compared to a feature structure of the studentís answers during error detection.

® Thatís why the matching of feature structures proofs to be more flexible than matching the sentences themselves because feature structure abstracts away from the surface form of the sentence. ALICE can therefore accept sentences that have the same grammatical features and semantical roles even if they use a different word order or different but equivalent inflectional marking. This increases the range of studentís responses that can be accepted as correct in order to allow for natural variation in the wording of the sentences.
 
 
 
 

NLP An NLP system is designed to accept any combination of romaji (Roman characters) and Japanese characters as input and can return a mixture of Japanese and English in its output.

There is a lexicon which contains information that allows the system to recognize words in all of their morphological variance and to identify syntactic and semantic feature of each word.

Each lexical entry consists of two main parts:

The keys indicate possible orthographical realizations of a word. The second part of a lexical entry contains a list of feature-value pairs (S stands for "sense"). Its value is a short English gloss containing the meaning of the word. The feature M is a ramaji spelling of the citation form of the morpheme.

Morphological analysis of Japanese is complicated by the fact that there are no spaces between words in written Japanese. Correct morphological analysis depends on correct segmentation that means dividing the sentence into different words. Morphological analysis is guided by the special features L and R.

Processes involved are highly complex and not objective of this presentation.

In addition there is another analysis of syntactic structure. The goal of syntactic analysis is to identify the predicate of each clause, the predicateís syntactic and morphological features and a grammatical function for every other element of the clause. There are three stages:

FOLIE Figure 5.8 "Analysis of Student Input"
 
 

To come to an end of this study the question arises what the advantages and disadvantages of ALICE-chan are:

® Advantages for authors include automation of exercise creation and feedback. The students offers better explanation of errors and more chances for communication. However NLP offers also many potential pitfalls:

For instance concerning automation of authoring: there is a high-level of automation achieved. The author only needs to type one correct answer for each exercise item. The NLP system analyses that sentence for structure, grammatical relations and morphological features and stores the analysis as a feature structure as I said before.

That structure characterizes a class of correct answers having similar features. Unfortunately, full automation is not possible for all sentence types because of the problem of ambiguity. Sentences may have multiple meanings, that must be represented by different feature structures. When the NLP programs do not have enough information to resolve the ambiguity they must resort to interactive disambiguation dialogs which requires a bit of extra work from the author. Another problem is dealing with error detection and feedback. The authors do not claim that ALICE-chanís feedback is pedagogical optimal. It contains many technical terms which may be slightly confusing. But ALICE can find the location of errors and can explain them in terms of linguistic relations.

Ambiguity is one of the most pervasive problems in NLP. Humans resolve ambiguity naturally using background knowledge to determine the interpretations that are appropriate in particular contexts. One solution to ambiguity can be the interaction with the user. ALICE provides a disambiguator, that is, a dialog which asks the user if it has detected ambiguities. Another possible disadvantage of NLP is that they take longer to develop than simple CALL systems because of the complexity of NLP programs and the size of grammars and lexicons. On the other hand NLP-based systems are quite portable due to the separation of data and programs. The ALICE parcer for example does not contain any specific knowledge of Japanese, instead, the parser only knows how to apply rules to sentences. If it is given Japanese rules, it will apply Japanese rules, if it is given Spanish rules, it will apply Spanish rules. The same parser can therefore be used for any language.

The authors of ALICE plan to extend the NLP coverage in the near future to several new languages including Korean, Spanish, German and English.
 
 




Focus on Grammar

The older CALL systems and also most of the newer, commercial CALL systems rely on simple techniques such as multiple choice questions where the input by students is severely limited.







Typical CALL activities today make heavy use of the computer's capabilities of storing large amounts of data, e.g. written language, but increasingly now also spoken language and pictures or video. In this way, students can get ample input of the foreign language. In order to check their progress and provide an opportunity for language production, however, some output is necessary as well. This normally takes the form of answers to multiple choice questions or fill in the blank texts with a highly constrained choice of words or phrases

The grammatical transformation of sentences or short answers to given questions are a further type of output students can be asked to produce. All of this kind of language output is relatively easy to check, using simple pattern-matching techniques, but cannot be called creative or very close to real life situations outside school. It does, however, have the advantage that feedback by the machine can be produced very fast, but unfortunately the informational content of feedback which can be achieved with such a technique is extremely limited. Such feedback is necessarily binary, either right or wrong, and can only be varied on a stylistic level.

Error-specific and individualised feedback (The German Tutor)

This paper treats error-specific and individualised feedback in a web-based language tutoring system.

Immediate and individualised learner feedback has long recognised as a significant advantage of CALL over more traditional language instruction. Sophisticated error analysis is crucial for a meaningful SL environment. A number of studies in the recent years have investigated metalinguistic feedback vs. traditional feedback in different CALL environments. It was found that NLP-based intelligent feedback which explains the source of an error is more effective than traditional feedback. Several studies found that metalinguistic feedback is very effective to adult second language learners. This paper here focuses on learner-computer interaction during the error correction process.

In particular, learnersí responses to metalinguistic feedback from an ILTS are examined.

In this study, answers to the following three questions are pursued:

The German Tutor (GT)

The GT is an ILTS that forms the grammar component of a web-based introductory course for German. It contains a grammar and a parser which analyses sentences from the student and detects grammatical and other errors. The goal of the German Tutor is to provide meaningful and interactive vocabulary and grammar practice for learners of German.

In the GT students can choose from a variety of different exercise types (dictation, form sentences and so on).

The pedagogical goal behind an ILTS is to provide error-specific feedback. For example, if a student chooses an incorrect article in German the error might be due to incorrect inflection for gender, number, or case.

Meaningful tasks and interactivity require intelligence on the part of the computer program. The German Tutor emulates two significant aspects of a student-teacher interaction: