191x Filetype PDF File size 0.56 MB Source: www.cse.iitb.ac.in
Hindi Wordnet for Language Teaching: Experiences and Lessons Learnt
1 2 1
HanumantRedkar , RajitaShukla , SandhyaSingh ,
1 1 1
Jaya Saraswati , Laxmi Kashyap , Diptesh Kanojia ,
1 1 1
Preethi Jyothi , Malhar Kulkarni and Pushpak Bhattacharyya
1
Indian Institute of Technology Bombay, Mumbai, India.
2
Bennett University, Greater Noida, India.
{hanumantredkar, rajita.shukla38, sandhya.singh}@gmail.com,
{jayasaraswati, laxmi.kashyap, dipteshkanojia}@gmail.com,
{preethijb, malharku and pushpakbh}@gmail.com
Abstract ance of one entity entails the appearance of the
other in the mind. Thus, it was found that subjects
Thispaperreportstheworkrelatedtomak-
respond quicker than normal to the word ‘nurse’ if
1
ing Hindi Wordnet available as a digital
it follows a highly associated word such as ‘doctor’
resource for language learning and teach-
(Church and Hanks, 1990). This property of the
ing, and the experiences and lessons that
mentallexiconisstructurally built in the Wordnets
were learnt during the process. The lan-
and manifests itself in the lexical and semantic re-
guage data of the Hindi Wordnet has been
lations which are encoded in it. Thus, a Wordnet
suitably modified and enhanced to make
is a ready resource of vocabulary of a language,
it into a language learning aid. This aid
whichcapturesassociativelearninginitsstructure.
is based on modern pedagogical axioms
Conventionalsourcesofvocabularylearning,such
and is aligned to the learning objectives
as the dictionaries and thesauri, do not have these
of the syllabi of the school education in
relations due to the very nature of their composi-
India. To make it into a comprehen-
tion. This is the motivation to present Hindi Word-
sive language tool, grammatical informa-
net as a tool for vocabulary learning and teaching.
tion has also been encoded, as far as these
Thesecondmotivationisthefactthateducation
can be marked on the lexical items. The
is undergoing rapid digitalization. Innovative in-
delivery of information is multi-layered,
struction techniques, which can cater to the tenets
multi-sensory and is available across mul-
of anywhere, anytime, any size learning, flipped
tiple digital platforms. The front end has
classroomapproachandblendedlearningenviron-
been designed to offer an eye-catching
ments, are the need of the hour. Such technology
user-friendly interface which is suitable
based learning solutions can help in better learner
for learners starting from age six onward.
engagement in classrooms and can also be used
Preliminary testing of the tool has been
ininformalteaching–learningenvironments,asthe
done and it has been modified as per the
delivery of knowledge is in a multi-sensory mode
feedbacks that were received. Above all,
andisavailableacrossvariousdigitalplatforms. In
the entire exercise has offered gainful in-
the form of a digital resource, the language learn-
sights into learning based on associative
ing aid that is presented here also seeks to redress
networks and how knowledge based on
the long standing problem of the burden of the
such networks can be made available to
school bag of young learners in India. This peda-
modernlearners.
gogical application of Hindi Wordnet will address
all the above issues and provide a resource which
1 Introduction
will cater to the various gap areas of learning.
A Wordnet is a large digital lexical database of
The rest of the paper is organized as follows:
a language in which information is organised
Section 2 provides the related work. Section 3 dis-
around cognitive synonym sets or synsets (Fell-
cusses the need for association capturing in lan-
baum, 1998). The underlying basis of such orga-
guage learning, which provides the basis for using
nization are the word association studies in psy-
Wordnet as a language learning resource. Section
cholinguistics, which proved that our mental lexi-
4 presents the digital aid developed for Hindi lan-
con is structured on associations, i.e. an appear-
guage teaching and learning. Section 5 discusses
1
http://www.cfilt.iitb.ac.in/wordnet/webhwn/index.php the process, experiences and lessons learnt while
developing this aid. Section 6 briefs the field test 3 LanguageLearningthrough
and user feedback followed by conclusion and fu- Association Capturing
ture work in section 7.
3.1 Needforassociation capturing
Association capturing lies at the core of language
2 Related Work
learning. Thetermassociationisusedheretorefer
In digital educational technology, the literature
to the connection or relation between ideas, con-
shows that psychological aspects of language
cepts, or words, which exist in the human mind
learning has been explored under various condi-
and manifest in such a way that an appearance of
tions due to continuous advancements on the tech-
one entity entails the appearance of the other in
nological front. The research indicates that multi-
the mind (Sinopalnikova, 2004). Learning or in-
modal learning has always resulted in better re-
structionstrategiesmustbeabletoencapsulatethis
tention (Dale, 1969). When the information en-
association for the creation of better pedagogical
ters the system through various senses, it helps the
techniques, as associative networks help not only
brain to circumvent the limited processing capa-
in understanding new knowledge but also to retain
bilities of each individual senses and allows for
it firmly in the mind.
greater total information to be processed (Clark
The understanding of how the meaning of a
and Paivio, 1991). With technology in place,
wordisunderstoodandretainedinthehumanmind
the ease of multi-modal learning environments
is of crucial importance as vocabulary plays an im-
have been studied in different settings (Mayer and
portant role in all the competencies of language
Moreno, 2003; Moreno and Mayer, 2007; Shams
learning, such as speaking, reading, and writing.
and Seitz, 2008; Sankey et al., 2010). Mobile As-
Methods of vocabulary learning too have moved
sisted Language Learning (MALL) is also being
beyond the traditional ways which were based on
explored as mobile technologies are becoming an
the behavioristic theory (Demirezen, 1988) of lan-
integralpartoflifestyle. Thefindings(Yang,2013)
guage learning to the modern methods of vocabu-
showthat MALLhasnotreacheditspotential and
lary learning (Nation and Newton, 1997). The lat-
it is moving towards being the new stage of Com-
ter are basedonthecommunicativetheory(Brown,
puter Assisted Language Learning (CALL). Peda-
2000), where understanding a word involves com-
gogical experts have stressed on the need for im-
mitting to memory its form, capturing its meaning
proving various approaches to enhance the will-
and finally knowing how and where to use it.
ingness of the learners for self-directed technol-
The semantic network theory (Collins and Lof-
ogy to maximize the technology potential for lan-
tus, 1975; Collins and Quillian, 1972; Rips et al.,
guage learning (Lai et al., 2016). The use of gam-
1973;Smithetal.,1974)statesthataword’smean-
ification is seen as an effective pedagogical strat-
ing is defined as “whatever comes to mind when
egy which can engage and motivate the learner to
someonesaystheword”or”youshallknowaword
learn in a relaxed environment, which is funda-
by the company it keeps” (Firth, 1957). It implies
mentaltoanylearning(WerbachandHunter,2012;
that a word’s meaning is represented in the mental
Figueroa Flores, 2015).
lexicon by a set of associations of that word with
In language learning, using semantic network otherwords. Thus,themeaningofawordisunder-
relations for learning new word helps in better un- stood as collections of associated concepts . Also,
derstanding of its meaning (Lin, 1997). The word- it has been proved that the syntactic category of
net, a semantic based rich lexical resource, has a word and the associated words that come to the
been used for various language learning applica- mind is the same (Fillenbaum and Jones, 1965).
tions such as - the semantic and lexical relations Since it is rare in any discourse for adjacent words
between synsets enables the learners to know the to be from the same syntactic category, therefore
connotationsofawordalongwithitsvariouspossi- this cannot be explained as association by being
ble contexts (Hiray, 2015). A gamification system contiguous. This association is because a word’s
based on wordnet was used to assess the depth of meaning is represented in the mental lexicon by a
word knowledge of a learner(Brumbaugh, 2015). set of nodes and the links between them. Here the
Asystemwasexperimentedforsimilarlookingand nodes represent concepts whose meaning the net-
nearsynonymswordlearningbasedonwordnetfor work is trying to capture, and the links represent
English language learners (Sun et al., 2011). relationships between concepts.
These nodes and links translate into word rela- Another evaluation was done based on four dif-
tionships and meaning relationships or lexical and ferent criteria, which were, (i) Number of Rec-
semantic relations. These lexical and semantic re- ommendations – those recommended by most of
lations have been found to be cognitive universals the ten ESL resources, (ii) Authority – authors of
(Lin,1997),i.e.,theserelationsarefoundinalllan- the sites should be related to the field of language,
guages. Exploring the various relations in the se- (iii) Simplicity – in terms of presentation, be user-
mantic field such as hypernymy, hyponymy, etc., friendly andmaterialclassifiedasperdifferentlev-
consolidates a new word’s position in the student’s els of learning and (iv) Currency – the site is reg-
mind. ularly updated. Here a total of 6 useful sites were
marked, out of which Princeton WordNet received
a ranking of 4. These results can be projected on
3.2 Associative network based Wordnet for
HindiWordnetaswell,sincestructurallyWordnets
language learning
are similar. Thus, empirically, a strong case for us-
As stated above, the learning and teaching of vo-
ing wordnet for vocabulary teaching is made.
cabulary with associative networks is helpful as
new words are presented in semantically related
group. Thus,itisbestwhenadjectivesaretaughtin 4 ADigitalAidforHindiLanguage
clusters around antonymic pairs, nouns are taught Teaching and Learning
as hyponymsofothernounsorwithsynonymsand
verbs are presented as groups of troponyms or en-
In order to employ association capturing in learn-
tailments. For learners, meaning is captured and
ingandtocatertotheneedsofrapiddigitizationof
retained when, for example, it is said that a horse is
education,thevastlexicaldatabaseofHindiWord-
a kind of an animal (hypernymy relation), uniform
net has been transformed for pedagogical purpose
is opposite of diverse (antonymy relation), snore is
in the form of an e-learning tool - Hindi Shab-
2
a part of sleep (entailment relation) and cultivate
damitra . In this tool, Hindi wordnet data is modi-
has as its object land (argument relation). Such
fied / simplified and further augmentedwithaudio-
associations are not captured in conventional dic-
visual features and grammatical properties, and is
tionaries as much of the structural information is
presented in a learner-friendly format for language
omitted from them and can only be provided by
teaching and learning. Depending upon the under-
semantic networks like Wordnets. A classroom
standinglevelofauserandthepurposeofuse,this
teacher will certainly not be able to provide as-
digitalaidfollowstheselectiveinformationpresen-
sociative information to students, by merely using
tation approach. Henceforth, in this paper, this e-
a conventional dictionary. A thesaurus does give
learning tool will be referred as Digital Aid.
synonyms, but lack in lexico-semantic relations.
Moreover,boththedictionariesandthesauridonot
4.1 Whyselective information presentation?
givethegrammaticalfeaturesthathavebeenadded
to the teaching aid (discussed in section 5.4).
The Digital Aid caters to various types of users,
Various tests have proven that amongst the
viz., school students, teachers, parents, language
plethora of web-based resources for language
learners, content managers, proofreaders, natural
teaching and learning, which have come up due
language processing researchers, mobile/web ser-
to the exponential growth of the Internet and the
vice providers, tourists, etc. As per the need
WorldWideWeb,PrincetonWordNethasemerged
of these users the information content has been
as one of the most reliable, authentic and useful
moulded, keeping in sight the cognitive load that
sites (Hiray, 2015). Infirst of twosuchevaluations,
the user may be able to cope with. This can avoid
three criteria, which were, (i) Non-commerciality
unnecessary learning efforts, wastage of time and
–suchsites were freely available, (ii) Adequacy –
the burden of information overloading. In Digi-
those that covered all the 570 word families fea-
tal Aid, this is achieved through the multi-layered
turing in the AWL (Academic Word List) 1 and
information rendering approach, suitable for both
(iii) Authenticity – the academic vocabulary ex-
formal and informal learning environments.
ercises belong to the site itself. The result of this
evaluation has marked 14 useful sites in which the
th 2
Princeton WordNet stood 4 . urlhttp://www.cfilt.iitb.ac.in/hindishabdamitra/
4.2 Multi-modal learning - a psychological understand and learn the concept. Audio pro-
aspect nunciationofasearchedwordandapictureor
an illustration of a given concept is provided.
Humansaregeneticallyprogrammedtocommuni-
Apart from this, the corresponding English
cateandunderstandthingsthroughamulti-sensory
WordNetsynsetis provided. For those words
learningsystem. Thebrainprocessestheinputsre-
which do not have a corresponding synset it
ceived from different senses to comprehend con-
is given a bilingual mapping (Singh et al., ).
cepts. As per the principles of multimedia learn-
ing (Gilakjani et al., 2011), multi-modal learning
• Level 2 (Intermediate): In level 2, the inter-
leads to comprehensive learning of a concept with
mediateusersaretargeted. Here,usersareex-
higher retention rate. They have shown to stimu-
pected to have basic knowledge of Hindi. In
late thinking and learning as compared to only text
this level, all the information from level 1 is
basedcomputerinterfaces(ClarkandPaivio,1991;
rendered and additionally, more grammatical
Mayer and Moreno, 2003; Moreno and Mayer,
features such as gender for nouns, antonym
2007;ShamsandSeitz,2008;Sankeyetal.,2010).
for verb and adverb, type of verb, countability
For language learners, having a multi-modal e-
for adjective, type of adjective, spelling vari-
learning tool provides motivation for learning the
ation, etc. are appended.
second language along with the independence to
learn at one’s own pace (Lai et al., 2016). It also
• Level3(Proficient): Inlevel3,additionalin-
provides the confidence of learning and handling a
formation is presented which is necessary for
digital device with ease from an early age.
the intended user. Here, instead of simplified
In Digital Aid, information is provided in the
gloss and simplified example sentence(s), the
form of text as well as audio-visual inputs. The
Hindi wordnet’s original gloss is rendered.
textual information pertains to the gloss (original
This is because, it is expected that these users
Hindi wordnet or simplified), word usage, syn-
have a good grip over Hindi and can under-
onyms, grammatical features, lexico-semantic re-
standthecomplexityofalanguage. Fromthis
lations, ontological information. It also has audio
level onward it is very important that the con-
pronunciations of words and pictures/illustration
cept should be clearly explained, so that the
of concept, etc. This helps in learning and under-
learners can understand the fine grained dif-
standing the concept with great ease.
ference between two synsets. Figure 1 shows
ascreenshotofthetoolrenderingawordभजन
4.3 Multi-layered presentation - an
(bhajana, hymn) at Proficient level.
incremental learning approach
The presented Digital Aid is designed keeping in
mind various aspects of language teaching and
learning. It is a five layered model where selec-
tive informationisrenderedineverylayer,depend-
ing uponthetype/needoftheuserandhis/hercog-
nitive competence. This multi-layered model is
structured in a level-wise and class-wise manner.
Level-wise information presentation
The level-wise module is designed for informal
setup where a user can select any of the five levels
Figure 1: Screenshot of the Digital Aid rendering
depending upon his level of expertise. The infor-
a word भजन (bhajana, hymn) at Proficient level
mation is rendered level-wise as follows:
• Level 1 (Beginner): Level 1 is meant for • Level4(Advanced): Atlevel4,otherseman-
users who are new language learners. Here tic relations such as hypernymy, hyponymy,
information such as simplified concept defi- etc. are introduced along with all the infor-
nition, word usage, synonyms, grammatical mation presented at level 3. Here, all the
features is selectively presented. The gloss available synsets in Hindi wordnet and all the
is simplified so that the beginners can easily grammatical features are rendered.
no reviews yet
Please Login to review.