1151 norm sets
Free-association norms provide essential empirical data for investigating linguistic, semantic, and cultural phenomena in the cognitive sciences. Although large-scale norms exist for languages such as English, Dutch, Spanish, and Mandarin Chinese, no comparable resource has been available for German. To address this gap, we present free-association norms for 5,877 German cue words as part of the German version of the multilingual Small World of Words (SWOW) project. We describe the data collection procedures, participant characteristics, and our comprehensive preprocessing pipeline before introducing the resulting SWOW-DE data set. Using data from three established psycholinguistic paradigms, we show that SWOW-DE norms robustly predict performance in lexical decision tasks, relatedness judgments, and psycholinguistic word ratings. Furthermore, we demonstrate that SWOW-DE responses compare favorably with existing German resources and provide a preliminary cross-linguistic comparison revealing both shared and language-specific association patterns, highlighting promising directions for future research. Overall, SWOW-DE represents the largest collection of German free associations to date and offers a unique resource for linguistic, psychological, and cross-cultural research.
A basic task in first language acquisition likely involves discovering the boundaries between words or morphemes in input where these basic units are not overtly segmented. A number of unsupervised learning algorithms have been proposed in the last 20 years for these purposes, some of which have been implemented computationally, but whose results remain difficult to compare across papers. We created a tool that is open source, enables reproducible results, and encourages cumulative science in this domain. WordSeg has a modular architecture: It combines a set of corpora description routines, multiple algorithms varying in complexity and cognitive assumptions (including several that were not publicly available, or insufficiently documented), and a rich evaluation package. In the paper, we illustrate the use of this package by analyzing a corpus of child-directed speech in various ways, which further allows us to make recommendations for experimental design of follow-up work. Supplementary materials allow readers to reproduce every result in this paper, and detailed online instructions further enable them to go beyond what we have done. Moreover, the system can be installed within container software that ensures a stable and reliable environment. Finally, by virtue of its modular architecture and transparency, WordSeg can work as an open-source platform, to which other researchers can add their own segmentation algorithms.
The Child Language Data Exchange System (CHILDES) has played a critical role in research on child language development, particularly in characterizing the early language learning environment. Access to these data can be both complex for novices and difficult to automate for advanced users, however. To address these issues, we introduce childes-db, a database-formatted mirror of CHILDES that improves data accessibility and usability by offering novel interfaces, including browsable web applications and an R application programming interface (API). Along with versioned infrastructure that facilitates reproducibility of past analyses, these interfaces lower barriers to analyzing naturalistic parent–child language, allowing for a wider range of researchers in language and cognitive development to easily leverage CHILDES in their work.
We report on a psycholinguistic database of Chinese character handwriting based on a large-scale study that involved 203 participants, each handwriting 200 characters randomly sampled from a cohort of 1,600 characters. Apart from collecting writing latencies, durations, and accuracy, we also compiled 14 lexical variables for each character. Regressions showed that frequency, age of acquisition, and the word context (in which a character appears) are all-around and influential predictors of orthographic access (as reflected in writing latency), motor execution of handwriting (as reflected in writing duration), and accuracy. In addition, phonological factors (phonogram status, spelling regularity, and homophone density) impacted orthographic access but not handwriting execution. Semantic factors (imageability and concreteness) only affected accuracy. These results suggest, among other things, that phonology is consulted in orthographic access while handwriting. As the first of its kind, this database can be used as a source of secondary data analyses and a tool for stimulus construction in handwriting research.
Studies have shown that logographemes and radicals, subcharacter units in Chinese characters, are represented in the orthographic lexicon and are functional processing units in the writing of Chinese characters. Nevertheless, there is no consensus regarding how characters should be segmented into logographemes and radicals. This article reports handwriting data for a list of 209 Chinese characters (95 nonphonetic compounds and 114 phonetic compounds) in a copying task. To validate the constituent logographemes and radicals of the target Chinese characters, comparisons among between-radical interstroke intervals (ISIs), between-logographeme ISIs, and within-logographeme ISIs, as well as their interactions with orthographic factors including character frequency, stroke number, and configuration, were conducted using factorial analyses. The results showed that the ISI comparison method is effective in validating the constituent logographemes and radicals in Chinese characters. On the basis of this list of 209 stimuli, another 1,227 Chinese characters that share the same set of radicals with the stimuli were further identified. Their constituent logographemes were deduced accordingly. Altogether, the over 1,000 Chinese characters with validated constituent logographemes will serve as a powerful reference for future psycholinguistic and neurolinguistic research. Future potential applications are discussed.
While a reasonable amount of work has gone into automatically geoparsing text at the city or higher levels of granularity for different types of texts in different domains, there is relatively little research on geoparsing fine-grained locations such as buildings, green spaces and street names in text. This paper reports on how the Edinburgh Geoparser performs on this task for different types of literary text set in Edinburgh, the first UNESCO City of Literature. The non-copyrighted gold standard datasets created for this purpose are released along with this article.
Abstract Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating natural-sounding vocalizations. Soundgen (https://CRAN.R-project.org/package=soundgen) is an open-source R package that synthesizes nonverbal vocalizations based on meaningful acoustic parameters, which can be specified from the command line or in an interactive app. This tool was validated by comparing the perceived emotion, valence, arousal, and authenticity of 60 recorded human nonverbal vocalizations (screams, moans, laughs, and so on) and their approximate synthetic reproductions. Each synthetic sound was created by manually specifying only a small number of high-level control parameters, such as syllable length and a few anchors for the intonation contour. Nevertheless, the valence and arousal ratings of synthetic sounds were similar to those of the original recordings, and the authenticity ratings were comparable, maintaining parity with the originals for less complex vocalizations. Manipulating the precise acoustic characteristics of synthetic sounds may shed light on the salient predictors of emotion in the human voice. More generally, soundgen may prove useful for any studies that require precise control over the acoustic features of nonspeech sounds, including research on animal vocalizations and auditory perception.
We present a corpus of Finnish news articles with a manually prepared named entity annotation. The corpus consists of 953 articles (193,742 word tokens) with six named entity classes (organization, location, person, product, event, and date). The articles are extracted from the archives of Digitoday, a Finnish online technology news source. The corpus is available for research purposes. We present baseline experiments on the corpus using a rule-based and two deep learning systems on two, in-domain and out-of-domain, test sets.
Most words are ambiguous, with interpretation dependent on context. Advancing theories of ambiguity resolution is important for any general theory of language processing, and for resolving inconsistencies in observed ambiguity effects across experimental tasks. Focusing on homonyms (words such as bank with unrelated meanings EDGE OF A RIVER vs. FINANCIAL INSTITUTION), the present work advances theories and methods for estimating the relative frequency of their meanings, a factor that shapes observed ambiguity effects. We develop a new method for estimating meaning frequency based on the meaning of a homonym evoked in lines of movie and television subtitles according to human raters. We also replicate and extend a measure of meaning frequency derived from the classification of free associates. We evaluate the internal consistency of these measures, compare them to published estimates based on explicit ratings of each meaning’s frequency, and compare each set of norms in predicting performance in lexical and semantic decision mega-studies. All measures have high internal consistency and show agreement, but each is also associated with unique variance, which may be explained by integrating cognitive theories of memory with the demands of different experimental methodologies. To derive frequency estimates, we collected manual classifications of 533 homonyms over 50,000 lines of subtitles, and of 357 homonyms across over 5000 homonym–associate pairs. This database—publicly available at: www.blairarmstrong.net/homonymnorms/—constitutes a novel resource for computational cognitive modeling and computational linguistics, and we offer suggestions around good practices for its use in training and testing models on labeled data.
Ratings of body–object interaction (BOI) measure the ease with which the human body can interact with a word’s referent. Researchers have studied the effects of BOI in order to investigate the relationships between sensorimotor and cognitive processes. Such efforts could be improved, however, by the availability of more extensive BOI norms. In the present work, we collected BOI ratings for over 9,000 words. These new norms show good reliability and validity and have extensive overlap with the words used both in other lexical and semantic norms and in the available behavioral megastudies (e.g., the English Lexicon Project, Balota, Yap, Cortese, Hutchison, Kessler, & Loftis in Behavior Research Methods, 39, 445–459, 2007; and the Calgary Semantic Decision Project, Pexman, Heard, Lloyd, & Yap in Behavior Research Methods, 49, 407–417, 2017). In analyses using the new BOI norms, we found that high-BOI words tended to be more concrete, more graspable, and more strongly associated with sensory, haptic, and visual experience than are low-BOI words. When we used the new norms to predict response latencies and accuracy data from the behavioral megastudies, we found that BOI was a stronger predictor of responses in the semantic decision task than in the lexical decision task. These findings are consistent with a dynamic, multidimensional account of lexical semantics. The norms described here should be useful for future research examining the effects of sensorimotor experience on performance in tasks involving word stimuli.
Algeria’s socio-linguistic situation is known as a complex phenomenon involving several historical, cultural and technological factors. However, there are three languages that are mainly spoken in Algeria (Arabic, Tamazight and French) and they can be mixed in the same sentence (code-switching). Moreover, there are several varieties of dialects that differ from one region to another and sometimes within the same region. This paper aims to provide a new multi-purpose parallel corpus (i.e., DZDC12 corpus), which will serve as a testbed for various natural language processing and information retrieval applications. In particular, it can be a useful tool to study Arabic–French code-switching phenomenon, Algerian Romanized Arabic (Arabizi), different Algerian sub-dialects, sentiment analysis, gender writing style, machine translation, abuse detection, etc. To the best of our knowledge, the proposed corpus is the first of its kind, where the texts are written in Latin script and crawled from Facebook. More specifically, this corpus is organised by gender, region and city, and is transliterated into Arabic script and translated into Modern Standard Arabic. In addition, it is annotated for emotion detection and abuse detection, and annotated at the word level. This article focuses in particular on Algeria’s socio-linguistic situation and the effect of social media networks. Furthermore, the general guidelines for the design of DZDC12 corpus are described as well as the dialects clustering over the map.
Around the world, a growing interest has been seen in learner translator corpora, which are invaluable resources for teaching and research. This paper introduces a new resource to support researchers from different interdisciplinary areas such as computational linguistics, descriptive translation studies, computer-aided translation technology, Arabic machine translation applications, cognitive science, and translation pedagogy. Motivated by the lack of learner translator resources that provide data about learners of translation from and into Arabic, the undergraduate learner translator corpus (ULTC) is an ongoing, error-tagged sentence-aligned parallel corpus of English, Arabic, and French, with Arabic as its main language. The present corpus, consisting of parallel texts of female learners of translation from English or French into Arabic, is the first of its kind in terms of the languages represented, tasks covered, and number of students involved. It is also unique in terms of combining many complementary corpora of cross-lingual data, each of which has its own web-based query interface and corpus analysis tools. This paper describes the ULTC compilation process, preliminary findings, and planned future expansion and research.
The article details the formational process of the FinnTransFrame corpus, a part of the FinnFrameNet project. In addition to a large annotated frame semantic corpus of natural language examples, the project created a separate corpus of examples translated from English to Finnish. The research question when creating the FinnTransFrame corpus was to see to what extent the various frames of the original Berkeley FrameNet transfer into Finnish in translated examples, i.e. what are the main problems and how can they be categorized? A variety of Berkeley FrameNet examples were chosen from different frames and then translated by professionals. The FinnFrameNet annotation team checked all the examples and their translations to see if the frames remained intact in translation. Problematic examples were tagged according to the type of the encountered problem, with the main focus on the type of fine-grained mismatches of meaning that caused frame changes even when the translation was the best possible one. The frame-loss amounted to 4.2% of the 88,209 relevant example sentences. Filtering out sentences with other types of problems, we found that 88.1% of all the frame instances still translated into Finnish with their frame intact. In addition, the article analyzes the error types in the problematic frames.
Spoken corpora are important for speech research, but are expensive to create and do not necessarily reflect (read or spontaneous) speech ‘in the wild’. We report on our conversion of the preexisting and freely available Spoken Wikipedia into a speech resource. The Spoken Wikipedia project unites volunteer readers of Wikipedia articles. There are initiatives to create and sustain Spoken Wikipedia versions in many languages and hence the available data grows over time. Thousands of spoken articles are available to users who prefer a spoken over the written version. We turn these semi-structured collections into structured and time-aligned corpora, keeping the exact correspondence with the original hypertext as well as all available metadata. Thus, we make the Spoken Wikipedia accessible for sustainable research. We present our open-source software pipeline that downloads, extracts, normalizes and text–speech aligns the Spoken Wikipedia. Additional language versions can be exploited by adapting configuration files or extending the software if necessary for language peculiarities. We also present and analyze the resulting corpora for German, English, and Dutch, which presently total 1005 h and grow at an estimated 87 h per year. The corpora, together with our software, are available via http://islrn.org/resources/684-927-624-257-3/. As a prototype usage of the time-aligned corpus, we describe an experiment about the preferred modalities for interacting with information-rich read-out hypertext. We find alignments to help improve user experience and factual information access by enabling targeted interaction.
Advances in emotional speech recognition and synthesis essentially rely on the availability of annotated emotional speech corpora. As a low resource language, the Thai language critically lacks corpora of emotional speech, although a few corpora have been constructed for speech recognition and synthesis. This paper presents the design of a Thai emotional speech corpus (namely EMOLA), its construction and annotation process, and its analysis. In the corpus design, four basic types with twelve subtypes of emotions are defined with consideration of the Pleasure-Arousal-Dominance emotional state model. To construct the corpus, a series of Thai dramas (1397 min) were selected and its video clips of approximately 868 min were annotated. As a result, 8987 transcriptions (of conversation turns) were derived in total, with each transcription tagged as one basic type and a few subtypes. Finally, an analysis was conducted to describe the characteristics of this corpus in three sets of statistics: collection-level, annotator-oriented and actor-oriented statistics.
The Large Database of English Compounds (LADEC) consists of over 8,000 English words that can be parsed into two constituents that are free morphemes, making it the largest existing database specifically for use in research on compound words. Both monomorphemic (e.g., wheel) and multimorphemic (e.g., teacher) constituents were used. The items were selected from a range of sources, including CELEX, the English Lexicon Project, the British Lexicon Project, the British National Corpus, and Wordnet, and were hand-coded as compounds (e.g., snowball). Participants rated each compound in terms of how predictable its meaning is from its parts, as well as the extent to which each constituent retains its meaning in the compound. In addition, we obtained linguistic characteristics that might influence compound processing (e.g., frequency, family size, and bigram frequency). To show the usefulness of the database in investigating compound processing, we conducted a number of analyses that showed that compound processing is consistently affected by semantic transparency, as well as by many of the other variables included in LADEC. We also showed that the effects of the variables associated with the two constituents are not symmetric. In short, LADEC provides the opportunity for researchers to investigate a number of questions about compounds that have not been possible to investigate in the past, due to the lack of sufficiently large and robust datasets. In addition to directly allowing researchers to test hypotheses using the information included in LADEC, the database will contribute to future compound research by allowing better stimulus selection and matching.
In the domain of cognitive studies on the lexico-semantic representational system, one of the most important means of ensuring effective experimental designs is using ecological stimulus sets accompanied by normative data on the most relevant variables affecting the processing of their items. In the context of image sets, color photographs are particularly suited to this purpose as they reduce the difficulty of visual decoding processes that may emerge with traditional image sets of line drawings. This is especially so in clinical populations. In this study we provide Italian norms for a set of 357 high quality image-items belonging to 23 semantic subcategories from the Moreno-Martínez and Montoro database. Data from several variables affecting image processing were collected from a sample of 255 Italian-speaking participants: age of acquisition, familiarity, lexical frequency, manipulability, name agreement, typicality and visual complexity. Lexical frequency data were derived from t)
The aim of word sense disambiguation (WSD) is to correctly identify the meaning of a word in context. All natural languages exhibit word sense ambiguities and these are often hard to resolve automatically. Consequently WSD is considered an important problem in natural language processing (NLP). Standard evaluation resources are needed to develop, evaluate and compare WSD methods. A range of initiatives have lead to the development of benchmark WSD corpora for a wide range of languages from various language families. However, there is a lack of benchmark WSD corpora for South Asian languages including Urdu, despite there being over 300 million Urdu speakers and a large amounts of Urdu digital text available online. To address that gap, this study describes a novel benchmark corpus for the Urdu Lexical Sample WSD task. This corpus contains 50 target words (30 nouns, 11 adjectives, and 9 verbs). A standard, manually crafted dictionary called Urdu Lughat is used as a sense inventory. Four baseline WSD approaches were applied to the corpus. The results show that the best performance was obtained using a simple Bag of Words approach. To encourage NLP research on the Urdu language the corpus is freely available to the research community.
This paper introduces Emilia, a speech corpus created to build a female voice in Spanish spoken in Buenos Aires for the Aromo text-to-speech system. Aromo is a unit selection text-to-speech system, which employs diphones as units of synthesis. The key requirements and design criteria for Emilia were: to synthesize any text in Spanish into high-quality speech with a minimum corpus size. The text corpus was designed to guarantee the phonetic and prosodic coverage. A three-stage strategy was used: in the first stage, 741 sentences were designed with all of the syllables of Spanish spoken in Argentina, with and without stress, and in all positions within the word; in the second stage, 852 sentences were added to balance out the distribution of the diphones; and after a perceptual evaluation of the quality of synthesized speech, in the third and final stage, 625 sentences were added to achieve the specified unit coverage, and to introduce sentences with more complex syntactic and prosodic structures. Issues from all three corpus building stages are reported. The paper also presents the results from the quality perceptual evaluations of the synthesized voice. Emilia has a duration of three hours and 15 minutes; its speech quality synthesized with Aromo system is similar to the level obtained with commercial systems, with a real-time ratio less than one.
This article describes the procedures employed during the development of the first comprehensive machine-readable Turkish Sign Language (TiD) resource: a bilingual lexical database and a parallel corpus between Turkish and TiD. In addition to sign language specific annotations (such as non-manual markers, classifiers and buoys) following the recently introduced TiD knowledge representation (Eryiğit et al. 2016), the parallel corpus contains also annotations of dependency relations, which makes it the first parallel treebank between a sign language and an auditory-vocal language.
The LENA system has revolutionized research on language acquisition, providing both a wearable device to collect day-long recordings of children’s environments, and a set of automated outputs that process, identify, and classify speech using proprietary algorithms. This output includes information about input sources (e.g., adult male, electronics). While this system has been tested across a variety of settings, here we delve deeper into validating the accuracy and reliability of LENA’s automated diarization, i.e., tags of who is talking. Specifically, we compare LENA’s output with a gold standard set of manually generated talker tags from a dataset of 88 day-long recordings, taken from 44 infants at 6 and 7 months, which includes 57,983 utterances. We compare accuracy across a range of classifications from the original Lena Technical Report, alongside a set of analyses examining classification accuracy by utterance type (e.g., declarative, singing). Consistent with previous validations, we find overall high agreement between the human and LENA-generated speaker tags for adult speech in particular, with poorer performance identifying child, overlap, noise, and electronic speech (accuracy range across all measures: 0–92%). We discuss several clear benefits of using this automated system alongside potential caveats based on the error patterns we observe, concluding with implications for research using LENA-generated speaker tags.
Language is one the earliest capacities affected by cognitive change. To monitor that change longitudinally, we have developed a web portal for remote linguistic data acquisition, called Talk2Me, consisting of a variety of tasks. In order to facilitate research in different aspects of language, we provide baselines including the relations between different scoring functions within and across tasks. These data can be used to augment studies that require a normative model; for example, we provide baseline classification results in identifying dementia. These data are released publicly along with a comprehensive open-source package for extracting approximately two thousand lexico-syntactic, acoustic, and semantic features. This package can be applied arbitrarily to studies that include linguistic data. To our knowledge, this is the most comprehensive publicly available software for extracting linguistic features. The software includes scoring functions for different tasks. [ABSTRACT FROM)
We present a new set of subjective Age of Acquisition (AoA) ratings for 299 words (158 nouns, 141 verbs) in seven languages from various language families and cultural settings: American English, Czech, Scottish Gaelic, Lebanese Arabic, Malaysian Malay, Persian, and Western Armenian. The ratings were collected from a total of 173 participants and were highly reliable in each language. We applied the same method of data collection as used in a previous study on 25 languages which allowed us to create a database of fully comparable AoA ratings of 299 words in 32 languages. We found that in the seven languages not included in the previous study, the words are estimated to be acquired at roughly the same age as in the previously reported languages, i.e. mostly between the ages of 1 and 7 years. We also found that the order of word acquisition is moderately to highly correlated across all 32 languages, which extends our previous conclusion that early words are acquired in similar order acr)
The Moral Foundations Dictionary (MFD) is a useful tool for applying the conceptual framework developed in Moral Foundations Theory and quantifying the moral meanings implicated in the linguistic information people convey. However, the applicability of the MFD is limited because it is available only in English. Translated versions of the MFD are therefore needed to study morality across various cultures, including non-Western cultures. The contribution of this paper is two-fold. We developed the first Japanese version of the MFD (referred to as the J-MFD) using a semi-automated method—this serves as a reference when translating the MFD into other languages. We next tested the validity of the J-MFD by analyzing open-ended written texts about the situations that Japanese participants thought followed and violated the five moral foundations. We found that the J-MFD correctly categorized the Japanese participants’ descriptions into the corresponding moral foundations, and that the Moral F)
We introduce a dataset for studying the evolution of words, constructed from WordNet and the Google Books Ngram Corpus. The dataset tracks the evolution of 4,000 synonym sets (synsets), containing 9,000 English words, from 1800 AD to 2000 AD. We present a supervised learning algorithm that is able to predict the future leader of a synset: the word in the synset that will have the highest frequency. The algorithm uses features based on a word’s length, the characters in the word, and the historical frequencies of the word. It can predict change of leadership (including the identity of the new leader) fifty years in the future, with an F-score considerably above random guessing. Analysis of the learned models provides insight into the causes of change in the leader of a synset. The algorithm confirms observations linguists have made, such as the trend to replace the -ise suffix with -ize, the rivalry between the -ity and -ness suffixes, and the struggle between economy (shorter words ar)
We present the design and development of a South African directory enquiries corpus. It contains audio and orthographic transcriptions of a wide range of South African names produced by first-language speakers of four languages, namely Afrikaans, English, isiZulu and Sesotho. Useful as a resource to understand the effect of name language and speaker language on pronunciation, this is the first corpus to also aim to identify the “intended language”: an implicit assumption with regard to word origin made by the speaker of the name. We describe the design, collection, annotation, and verification of the corpus. This includes an analysis of the algorithms used to tag the corpus with meta information that may be beneficial to pronunciation modelling tasks.
Ongoing advances in computer technology have opened up a deluge of new datasets for understanding human behavior (Goldstone & Lupyan, 2016). Many of these datasets provide information on the use of written language. However, data on naturally occurring spoken-language conversations are much more difficult to obtain. A major exception to this is the TalkBank system, which provides online multimedia data for 14 types of spoken-language data: language in aphasia, child language, stuttering, child phonology, autism spectrum disorder, bilingualism, Conversation Analysis, classroom discourse, dementia, right hemisphere damage, Danish conversation, second language learning, traumatic brain injury, and daylong recordings in the home. The present report reviews these resources and describes the ways they are being used to further our understanding of human language and communication.
The application of word associations has become increasingly widespread. However, the association norms produced by traditional free association tests tend not to exceed 10,000 stimulus words, making the number of associated words too small to be representative of the overall language. In this study we used text corpora totaling over 400 million Chinese words, along with a multitude of association measures, to automatically construct a Chinese Lexical Association Database (CLAD) comprising the lexical association of over 80,000 words. Comparison of the CLAD with a database of traditional Chinese word association norms shows that word associations extracted from large text corpora are similar in strength to those elicited from free association tests but contain a much greater number of associative word pairs. Additionally, the relatively small numbers of participants involved in the creation of traditional norms result in relatively coarse scales of association measurement, whereas the differentiation of association strengths is greatly enhanced in the CLAD. The CLAD provides researchers with a great supplement to traditional word association norms. A query website at www.chinesereadability.net/LexicalAssociation/CLAD/ affords access to the database.
Embodiment theory suggests that, during the processing of words related to movement, as in the case of action verbs, somatotopic activation is produced in the motor and premotor cortices. In the same way, some studies have demonstrated that patients with frontal-lobe damage, such as Parkinson’s patients, have difficulties processing that kind of stimulus. At the moment, no standardized data exist concerning the motor content of Spanish verbs. Therefore, the aim of the present research was to develop a database of 4,565 verbs in Spanish through a survey filled out by 152 university students. The value for the motor content was obtained by calculating the average value from the answers of the participants. In addition, the reliability of the results was estimated, as well as their convergent validity, using diverse correlation coefficients. The database and the raw responses of the participants can be downloaded from this website: https://inco.grupos.uniovi.es/enlaces.
Automatic term extraction is a productive field of research within natural language processing, but it still faces significant obstacles regarding datasets and evaluation, which require manual term annotation. This is an arduous task, made even more difficult by the lack of a clear distinction between terms and general language, which results in low inter-annotator agreement. There is a large need for well-documented, manually validated datasets, especially in the rising field of multilingual term extraction from comparable corpora, which presents a unique new set of challenges. In this paper, a new approach is presented for both monolingual and multilingual term annotation in comparable corpora. The detailed guidelines with different term labels, the domain- and language-independent methodology and the large volumes annotated in three different languages and four different domains make this a rich resource. The resulting datasets are not just suited for evaluation purposes but can also serve as a general source of information about terms and even as training data for supervised methods. Moreover, the gold standard for multilingual term extraction from comparable corpora contains information about term variants and translation equivalents, which allows an in-depth, nuanced evaluation.
Word associations have been used widely in psychology, but the validity of their application strongly depends on the number of cues included in the study and the extent to which they probe all associations known by an individual. In this work, we address both issues by introducing a new English word association dataset. We describe the collection of word associations for over 12,000 cue words, currently the largest such English-language resource in the world. Our procedure allowed subjects to provide multiple responses for each cue, which permits us to measure weak associations. We evaluate the utility of the dataset in several different contexts, including lexical decision and semantic categorization. We also show that measures based on a mechanism of spreading activation derived from this new resource are highly predictive of direct judgments of similarity. Finally, a comparison with existing English word association sets further highlights systematic improvements provided through these new norms.
The research of the word is still very much the research of the noun. Adjectives have been largely overlooked, despite being the second-largest word class in many languages and serving an important communicative function, because of the rich, nuanced qualifications they afford. Adjectives are also ideally suited to study the interface between cognition and emotion, as they naturally cover the entire range of lexicosemantic variables such as imageability (infinite–green), and affective variables such as valence (sad–happy). We illustrate this by showing how the centrality of words in the mental lexicon varies as a function of the words’ affective dimensions, using newly collected norms for 1,000 Dutch adjectives. The norms include the lexicosemantic variables age of acquisition, familiarity, concreteness, and imageability; the affective variables valence, arousal, and dominance; and a variety of distributional variables, including network statistics resulting from a large-scale word association study. The norms are freely available from https://osf.io/nyg8v/, for researchers studying adjectives specifically or for whom adjectives constitute convenient stimuli to study other topics, such as vagueness, inference, spatial cognition, or affective word processing.
Here we describe the Jena Speaker Set (JESS), a free database for unfamiliar adult voice stimuli, comprising voices from 61 young (18–25 years) and 59 old (60–81 years) female and male speakers uttering various sentences, syllables, read text, semi-spontaneous speech, and vowels. Listeners rated two voice samples (short sentences) per speaker for attractiveness, likeability, two measures of distinctiveness (“deviation”-based [DEV] and “voice in the crowd”-based [VITC]), regional accent, and age. Interrater reliability was high, with Cronbach’s α between .82 and .99. Young voices were generally rated as more attractive than old voices, but particularly so when male listeners judged female voices. Moreover, young female voices were rated as more likeable than both young male and old female voices. Young voices were judged to be less distinctive than old voices according to the DEV measure, with no differences in the VITC measure. In age ratings, listeners almost perfectly discriminated young from old voices; additionally, young female voices were perceived as being younger than young male voices. Correlations between the rating dimensions above demonstrated (among other things) that DEV-based distinctiveness was strongly negatively correlated with rated attractiveness and likeability. By contrast, VITC-based distinctiveness was uncorrelated with rated attractiveness and likeability in young voices, although a moderate negative correlation was observed for old voices. Overall, the present results demonstrate systematic effects of vocal age and gender on impressions based on the voice and inform as to the selection of suitable voice stimuli for further research into voice perception, learning, and memory.
We present a new dataset of English word recognition times for a total of 62 thousand words, called the English Crowdsourcing Project. The data were collected via an internet vocabulary test in which more than one million people participated. The present dataset is limited to native English speakers. Participants were asked to indicate which words they knew. Their response times were registered, although at no point were the participants asked to respond as quickly as possible. Still, the response times correlate around .75 with the response times of the English Lexicon Project for the shared words. Also, the results of virtual experiments indicate that the new response times are a valid addition to the English Lexicon Project. This not only means that we have useful response times for some 35 thousand extra words, but we now also have data on differences in response latencies as a function of education and age.
This article presents the development of the “Hoosier Vocal Emotions Corpus,” a stimulus set of recorded pseudo-words based on the pronunciation rules of English. The corpus contains 73 controlled audio pseudo-words uttered by two actresses in five different emotions (i.e., happiness, sadness, fear, anger, and disgust) and in a neutral tone, yielding 1,763 audio files. In this article, we describe the corpus as well as a validation study of the pseudo-words. A total of 96 native English speakers completed a forced choice emotion identification task. All emotions were recognized better than chance overall, with substantial variability among the different tokens. All of the recordings, including the ambiguous stimuli, are made freely available, and the recognition rates and the full confusion matrices for each stimulus are provided in order to assist researchers and clinicians in the selection of stimuli. The corpus has unique characteristics that can be useful for experimental paradigms that require controlled stimuli (e.g., electroencephalographic or fMRI studies). Stimuli from this corpus could be used by researchers and clinicians to answer a variety of questions, including investigations of emotion processing in individuals with certain temperamental or behavioral characteristics associated with difficulties in emotion recognition (e.g., individuals with psychopathic traits); in bilingual individuals or nonnative English speakers; in patients with aphasia, schizophrenia, or other mental health disorders (e.g., depression); or in training automatic emotion recognition algorithms. The Hoosier Vocal Emotions Corpus is available at https://psycholinguistics.indiana.edu/hoosiervocalemotions.htm.
Studies on morphological processing in French, as in other languages, have shown disparate results. We argue that a critical and long-overlooked factor that could underlie these diverging results is the methodological differences in the calculation of morphological variables across studies. To address the need for a common morphological database, we present MorphoLex-FR, a sizeable and freely available database with 12 variables for prefixes, roots, and suffixes for the 38,840 words of the French Lexicon Project. MorphoLex-FR constitutes a first step to render future studies addressing morphological processing in French comparable. The procedure we used for morphological segmentation and variable computation is effectively the same as that in MorphoLex, an English morphological database. This will allow for cross-linguistic comparisons of future studies in French and English that will contribute to our understanding of how morphologically complex words are processed. To validate these variables, we explored their influence on lexical decision latencies for morphologically complex nouns in a series of hierarchical regression models. The results indicated that only morphological variables related to the suffix explained lexical decision latencies. The frequency and family size of the suffix exerted facilitatory effects, whereas the percentage of more frequent words in the morphological family of the suffix was inhibitory. Our results are in line with previous studies conducted in French and in English. In conclusion, this database represents a valuable resource for studies on the effect of morphology in visual word processing in French.
The paper describes the conversion of an LFG treebank of Polish into enhanced Universal Dependencies, and—more generally—identifies the kinds of information lost in translation from LFG to UD. The paper also presents the resulting UD treebank of Polish and compares it to the previous UD treebank of Polish.
The Glasgow Norms are a set of normative ratings for 5,553 English words on nine psycholinguistic dimensions: arousal, valence, dominance, concreteness, imageability, familiarity, age of acquisition, semantic size, and gender association. The Glasgow Norms are unique in several respects. First, the corpus itself is relatively large, while simultaneously providing norms across a substantial number of lexical dimensions. Second, for any given subset of words, the same participants provided ratings across all nine dimensions (33 participants/word, on average). Third, two novel dimensions—semantic size and gender association—are included. Finally, the corpus contains a set of 379 ambiguous words that are presented either alone (e.g., toast) or with information that selects an alternative sense (e.g., toast (bread), toast (speech)). The relationships between the dimensions of the Glasgow Norms were initially investigated by assessing their correlations. In addition, a principal component analysis revealed four main factors, accounting for 82% of the variance (Visualization, Emotion, Salience, and Exposure). The validity of the Glasgow Norms was established via comparisons of our ratings to 18 different sets of current psycholinguistic norms. The dimension of size was tested with megastudy data, confirming findings from past studies that have explicitly examined this variable. Alternative senses of ambiguous words (i.e., disambiguated forms), when discordant on a given dimension, seemingly led to appropriately distinct ratings. Informal comparisons between the ratings of ambiguous words and of their alternative senses showed different patterns that likely depended on several factors (the number of senses, their relative strengths, and the rating scales themselves). Overall, the Glasgow Norms provide a valuable resource—in particular, for researchers investigating the role of word recognition in language comprehension.
This paper introduces a large-scale, validated database for Persian called Sharif Emotional Speech Database (ShEMO). The database includes 3000 semi-natural utterances, equivalent to 3 h and 25 min of speech data extracted from online radio plays. The ShEMO covers speech samples of 87 native-Persian speakers for five basic emotions including anger, fear, happiness, sadness and surprise, as well as neutral state. Twelve annotators label the underlying emotional state of utterances and majority voting is used to decide on the final labels. According to the kappa measure, the inter-annotator agreement is 64% which is interpreted as “substantial agreement”. We also present benchmark results based on common classification methods in speech emotion detection task. According to the experiments, support vector machine achieves the best results for both gender-independent (58.2%) and gender-dependent models (female = 59.4%, male = 57.6%). The ShEMO will be available for academic purposes free of charge to provide a baseline for further research on Persian emotional speech.
This paper presents the DialogBank, a new language resource consisting of dialogues with gold standard annotations according to the ISO 24617-2 standard. Some of these dialogues have been taken from existing corpora and have been re-annotated, offering the possibility to compare annotations according to different schemes; others have been newly annotated directly according to the standard. The ISO standard annotations in the DialogBank make use of three alternative representation formats, which are shown to be interoperable. The (re-)annotation brought certain deficiencies and limitations of the ISO standard to light, which call for considering possible revisions and extensions, and for exploring the possible integration of dialogue act annotations with other semantic annotations.
Compared to early language development, later changes to the language system during orthography and literacy acquisition have not yet been researched in detail. We present a longitudinal corpus of texts on short picture stories written by German primary school children between grades 2 and 4 and grades 3 and 4. It includes 1,922 texts with 212,505 tokens (6,364 types) from 251 children. For each text, rich metadata is available, including age, grade and linguistic background (at least 60% of the children were multilingual). To our knowledge, our corpus is the largest longitudinal corpus of written texts by children at primary school age. Each word is included in its original spelling as well as in a normalized form (target hypothesis), specifying the intended word form, which we corrected for orthographic but not grammatical errors. Original and target word forms are aligned character-wise and the target word forms are enriched with phonological, syllabic, and morphological information. Additionally, for each target word form, we established key lexical variables, e.g., word frequency or summed bigram frequency, as specified in childLex. Where applicable, we also specify key features of German orthography (e.g., consonant doubling, vowel-lengthening <h>). Taken together, this information allows for a detailed assessment of the properties of words that tend to increase the likelihood of spelling errors. The corpus is available in different formats—as tab-delimited annotated token and type based lists, in an XML format, and via the corpus search tool ANNIS.
Modality exclusivity norms have been developed in different languages for research on the relationship between perceptual and conceptual systems. This paper sets up the first modality exclusivity norms for Chinese, a Sino-Tibetan language with semantics as its orthographically relevant level. The norms are collected through two studies based on Chinese sensory words. The experimental designs take into consideration the morpho-lexical and orthographic structures of Chinese. Study 1 provides a set of norms for Mandarin Chinese single-morpheme words in mean ratings of the extent to which a word is experienced through the five sense modalities. The degrees of modality exclusivity are also provided. The collected norms are further analyzed to examine how sub-lexical orthographic representations of sense modalities in Chinese characters affect speakers’ interpretation of the sensory words. In particular, we found higher modality exclusivity rating for the sense modality explicitly represent)
Corpus-based research has formed the backbone of linguistic research in recent decades. Large text corpora are used for solving various kinds of linguistic problems, including those of quantitative linguistics, cognitive linguistics, and psycholinguistics. This paper reports the creation of two corpora of contemporary Vietnamese. It also describes the construction of these two equally sized Vietnamese corpora (a corpus from Vietnamese film subtitles, subtlex-viet, and a general corpus of varieties of online newspapers and stories, genlex-viet). We document the general steps of the construction and extraction of linguistic information from the language corpora and provide a road map for others who would like to create similar corpora. The resultant corpora are available in three versions: plain text, tokenized, and POS tagged. In the second half of the paper, the construction of a lexical database derived from the corpora is described. The database includes measures such as frequency of occurrence, dispersion, Mutual Information, Inverse Document Frequency, as well as vector space measures based on Latent Semantic Analysis and Hyperspace Analogue to Language. We conclude by reporting a comparison of the lexical predictors and a validation using psycholinguistic data from visual lexical decision experiments.
Vowels in Arabic are optional orthographic symbols written as diacritics above or below letters. In Arabic texts, typically more than 97 percent of written words do not explicitly show any of the vowels they contain; that is to say, depending on the author, genre and field, less than 3 percent of words include any explicit vowel. Although numerous studies have been published on the issue of restoring the omitted vowels in speech technologies, little attention has been given to this problem in papers dedicated to written Arabic technologies. In this research, we present Arabic-Unitex, an Arabic Language Resource, with emphasis on vowel representation and encoding. Specifically, we present two dozens of rules formalizing a detailed description of vowel omission in written text. They are typographical rules integrated into large-coverage resources for morphological annotation. For restoring vowels, our resources are capable of identifying words in which the vowels are not shown, as well as words in which the vowels are partially or fully included. By taking into account these rules, our resources are able to compute and restore for each word form a list of compatible fully vowelized candidates through omission-tolerant dictionary lookup. In our previous studies, we have proposed a straightforward encoding of taxonomy for verbs (Neme in Proceedings of the international workshop on lexical resources (WoLeR) at ESSLLI, 2011) and broken plurals (Neme and Laporte in Lang Sci, 2013, http://dx.doi.org/10.1016/j.langsci.2013.06.002). While traditional morphology is based on derivational rules, our description is based on inflectional ones. The breakthrough lies in the reversal of the traditional root-and-pattern Semitic model into pattern-and-root, giving precedence to patterns over roots. The lexicon is built and updated manually and contains 76,000 fully vowelized lemmas. It is then inflected by means of finite-state transducers (FSTs), generating 6 million forms. The coverage of these inflected forms is extended by formalized grammars, which accurately describe agglutinations around a core verb, noun, adjective or preposition. A laptop needs one minute to generate the 6 million inflected forms in a 340-MB flat file, which is compressed in 2 min into 11 MB for fast retrieval. Our program performs the analysis of 5000 words/second for running text (20 pages/second). Based on these comprehensive linguistic resources, we created a spell checker that detects any invalid/misplaced vowel in a fully or partially vowelized form. Finally, our resources provide a lexical coverage of more than 99 percent of the words used in popular newspapers, and restore vowels in words (out of context) simply and efficiently.
What are known as specialized or specialist dictionaries are much more than lists of words and their definitions with occasional comments on things such as synonymy and homonymy. That is to say, a particular specialist term may be associated with many other concepts, including quotations, different senses, etymological categories, semantic categories, superordinate and subordinate terms in the terminological hierarchy, spelling variants, and references to background sources discussing the exact meaning and application of the term. The various concepts, in turn, form networks of mutual links, which makes the structure of the background concepts demanding to model when designing a database structure for this type of dictionary. The Dictionary of medical vocabulary in English, 1375–1550 is a specialized historical dictionary that covers the vast medical lexicon of the centuries examined. It comprises over 12,000 terms, each of them associated with a host of background concepts. Compiling the dictionary took over 15 years. The process started with an analysis of hand-written manuscripts and early printed books from different sources and ended with the electronic dictionary described in the present paper. Over these years, the conceptual structure, database schema, and requirements for essential use cases were iteratively developed. In our paper, we introduce the conceptual structure and database schema modelled for implementing an electronic dictionary that involves different use cases such as term insertion and linking a term to related concepts. The achieved conceptual model, database structure, and use cases provide a general framework for reference-oriented specialized dictionaries, including ones with a historical orientation.
Corpora play an important role when training machine learning systems for sentiment analysis. However, Spanish is underrepresented in these corpora, as most primarily include English texts. This paper describes 20 Spanish-language text corpora—collected to support different tasks related to sentiment analysis, ranging from polarity to emotion categorization. We present a brand-new framework for the characterization of corpora. This includes a number of features to help analyze resources at both corpus level and document level. This survey—besides depicting the overall landscape of corpora in Spanish—supports sentiment analysis practitioners with the task of selecting the most suitable resources.
Arabic is a widely-spoken language with a long and rich history, but existing corpora and language technology focus mostly on modern Arabic and its varieties. Therefore, studying the history of the language has so far been mostly limited to manual analyses on a small scale. In this work, we present a large-scale historical corpus of the written Arabic language, spanning 1400 years. We describe our efforts to clean and process this corpus using Arabic NLP tools, including the identification of reused text. We study the history of the Arabic language using a novel automatic periodization algorithm, as well as other techniques. Our findings confirm the established division of written Arabic into Modern Standard and Classical Arabic, and confirm other established periodizations, while suggesting that written Arabic may be divisible into still further periods of development.
This paper introduces MadSex, a spoken corpus of 54 sociolinguistic interviews in Spanish based on the topic of sexuality. It was collected in order to study the cognitive sociolinguistic variation of sexual concepts. The paper presents and justifies methodological decisions taken during design, collection and transcription stages. Informants were selected in Madrid, based on a pre-stratified sample divided by sex, age and level of education. The interview methodology relied on an opinion questionnaire designed for the indirect elicitation of sexual concepts, which overcame successfully the limitations imposed by the low frequencies of semantic variables in discourse and the impact of sexual taboo in interaction. Relevant aspects of fieldwork, empathy and ethical protocols are also detailed in the paper. Transcription and markup are explained. Finally, an overview of the corpus is given, as well as some research papers based on it. Examples of the questionnaire are also provided.
The paper describes the creation of the first open access multi-genre historical corpus of Emergent Modern Hebrew, made possible by implementation of digital humanities methods in the process of corpus curation, encoding, and dissemination. Corpus contents originate in the Ben-Yehuda Project, an open access repository of Hebrew literature online, and in digital images curated from the collections of the National Library of Israel, a selection of which have been transcribed through a dedicated crowdsourcing task that feeds back into the library’s online catalog. Texts in the corpus are encoded following best practices in the digital humanities, including markup of metadata that enables time-sensitive research, linguistic and other, of the corpus. Evaluation of morphological analysis based on Modern Hebrew language models is shown to distinguish between genres in the historical variety, highlighting the importance of ephemeral materials for linguistic research and for potential collaboration with libraries and cultural institutions in the process of corpus creation. We demonstrate the use of the corpus in diachronic linguistic research and suggest ways in which the association it provides between digital images and texts can be used to support automatic language processing and to enhance resources in the digital humanities.
The latest reference corpus of written Slovene, the Gigafida corpus, was created as part of the ‘Communication in Slovene’ project. In the same project, a web concordancer was designed for the broadest possible use, and tailored to the needs and abilities of user groups such as translators, writers, proofreaders and teachers. Two years after the corpus was published within the new tool, its features were assessed by the users. With an average rate of 4.36 on a scale between 1 and 5 (1 = I strongly disagree, 5 = I strongly agree), the results indicate that most survey participants agreed or strongly agreed with positive statements about the new implementations (e.g. “The corpus results are displayed in a clear manner”). This is a considerable improvement in user experience from the previous reference corpus of Slovene, i.e. the FidaPLUS corpus within the ASP32 concordancer (rated with 3.67). In the user feedback, the simplicity of search options and the interface clarity are highlighted as the main advantages, while for the future development, advanced visualizations of corpus data and improved search of word-phrases are suggested. The evaluation also highlighted some relevant user habits, such as not taking the time to learn systematically about the tool before they start using it. The findings will be implemented in future editions of the Gigafida corpus, but are relevant to any project that aims at facilitating a wider use of reference corpora and corpus-based resources.
We present DEMoS (Database of Elicited Mood in Speech), a new, large database with Italian emotional speech: 68 speakers, some 9 k speech samples. As Italian is under-represented in speech emotion research, for a comparison with the state-of-the-art, we model the ‘big 6 emotions’ and guilt. Besides making available this database for research, our contribution is three-fold: First, we employ a variety of mood induction procedures, whose combinations are especially tailored for specific emotions. Second, we use combinations of selection procedures such as an alexithymia test and self- and external assessment, obtaining 1,5 k (proto-) typical samples; these were used in a perception test (86 native Italian subjects, categorical identification and dimensional rating). Third, machine learning techniques—based on standardised brute-forced openSMILE ComParE features and support vector machine classifiers—were applied to assess how emotional typicality and sample size might impact machine learning efficiency. Our results are three-fold as well: First, we show that appropriate induction techniques ensure the collection of valid samples, whereas the type of self-assessment employed turned out not to be a meaningful measurement. Second, emotional typicality—which shows up in an acoustic analysis of prosodic main features—in contrast to sample size is not an essential feature for successfully training machine learning models. Third, the perceptual findings demonstrate that the confusion patterns mostly relate to cultural rules and to ambiguous emotions.
Definitional knowledge has proved to be essential in various Natural Language Processing tasks and applications, especially when information at the level of word senses is exploited. However, the few sense-annotated corpora of textual definitions available to date are of limited size: this is mainly due to the expensive and time-consuming process of annotating a wide variety of word senses and entity mentions at a reasonably high scale. In this paper we present SenseDefs, a large-scale high-quality corpus of disambiguated definitions (or glosses) in multiple languages, comprising sense annotations of both concepts and named entities from a wide-coverage unified sense inventory. Our approach for the construction and disambiguation of this corpus builds upon the structure of a large multilingual semantic network and a state-of-the-art disambiguation system: first, we gather complementary information of equivalent definitions across different languages to provide context for disambiguation; then we refine the disambiguation output with a distributional approach based on semantic similarity. As a result, we obtain a multilingual corpus of textual definitions featuring over 38 million definitions in 263 languages, and we publicly release it to the research community. We assess the quality of SenseDefs’s sense annotations both intrinsically and extrinsically on Open Information Extraction and Sense Clustering tasks.
This paper describes the ‘Corpus of American Danish’ (CoAmDa), a newly established corpus of spoken immigrant Danish in North and South America. The CoAmDa amounts to approx. 1.7 million tokens, making it one of the largest corpora of heritage language at present. With regard to text type, the CoAmDa is a non-standard multilingual spoken language resource as Danish is mixed with American English, Canadian English or Argentine Spanish, respectively, in every recording. The aim of this note is to document relevant aspects and specifications of the CoAmDA, viz. the audio data, the sociodemographic metadata of the speakers, the digitization process of analog data, the transcription procedures, the format and tagging of the speech files and the internal validation procedures. In so doing, we wish to share our experience and best practices with regard to achieving a spoken language resource of high quality with the interested public, in particular other researchers working on and with multilingual speech corpora.
In this paper we present US2016, the largest publicly available set of corpora of annotated dialogical argumentation. The annotation covers argumentative relations, dialogue acts and pragmatic features. The corpora comprise transcriptions of television debates leading up to the 2016 US presidential elections, and reactions to the debates on Reddit. These two constitutive parts of the corpora are integrated by means of the intertextual correspondence between them. The rhetorical richness and high argument density of the communicative context results in cross-genre corpora that are robust resources for the study of the dialogical dynamics of argumentation in three ways: first, in empirical strands of research in discourse analysis and argumentation studies; second, in the burgeoning field of argument mining where automatic techniques require such data; and third, in formulating algorithmic techniques for sensemaking through the development of Argument Analytics.
TED-Multilingual Discourse Bank, or TED-MDB, is a multilingual resource where TED-talks are annotated at the discourse level in 6 languages (English, Polish, German, Russian, European Portuguese, and Turkish) following the aims and principles of PDTB. We explain the corpus design criteria, which has three main features: the linguistic characteristics of the languages involved, the interactive nature of TED talks—which led us to annotate Hypophora, and the decision to avoid projection. We report our annotation consistency, and post-annotation alignment experiments, and provide a cross-lingual comparison based on corpus statistics.
Swiss dialects of German are, unlike many dialects of other standardised languages, widely used in everyday communication. Despite this fact, automatic processing of Swiss German is still a considerable challenge due to the fact that it is mostly a spoken variety and that it is subject to considerable regional variation. This paper presents the ArchiMob corpus, a freely available general-purpose corpus of spoken Swiss German based on oral history interviews. The corpus is a result of a long design process, intensive manual work and specially adapted computational processing. We first present the modalities of access of the corpus for linguistic, historic and computational research. We then describe how the documents were transcribed, segmented and aligned with the sound source. This work involved a series of experiments that have led to automatically annotated normalisation and part-of-speech tagging layers. Finally, we present several case studies to motivate the use of the corpus for digital humanities in general and for dialectology in particular.
PreMOn is a freely available linguistic resource for exposing predicate models (PropBank, NomBank, VerbNet, and FrameNet) and mappings between them (e.g., SemLink and the predicate matrix) as linguistic linked open data (LOD). It consists of two components: (1) the PreMOn Ontology, that builds on the OntoLex-Lemon model by the W3C ontology-Lexica community group to enable an homogeneous representation of data from various predicate models and their linking to ontological resources; and, (2) the PreMOn Dataset, a LOD dataset integrating various versions of the aforementioned predicate models and mappings, linked to other LOD ontologies and resources (e.g., FrameBase, ESO, WordNet RDF). PreMOn is accessible online in different ways (e.g., SPARQL endpoint), and extensively documented.
SUBTLEX-CAT is a word frequency and contextual diversity database for Catalan, obtained from a 278-million-word corpus based on subtitles supplied from broadcast Catalan television. Like all previous SUBTLEX corpora, it comprises subtitles from films and TV series. In addition, it includes a wider range of TV shows (e.g., news, documentaries, debates, and talk shows) than has been included in most previous databases. Frequency metrics were obtained for the whole corpus, on the one hand, and only for films and fiction TV series, on the other. Two lexical decision experiments revealed that the subtitle-based metrics outperformed the previously available frequency estimates, computed from either written texts or texts from the Internet. Furthermore, the metrics obtained from the whole corpus were better predictors than the ones obtained from films and fiction TV series alone. In both experiments, the best predictor of response times and accuracy was contextual diversity.
This article introduces the second version of the Tool for the Automatic Analysis of Cohesion (TAACO 2.0). Like its predecessor, TAACO 2.0 is a freely available text analysis tool that works on the Windows, Mac, and Linux operating systems; is housed on a user’s hard drive; is easy to use; and allows for batch processing of text files. TAACO 2.0 includes all the original indices reported for TAACO 1.0, but it adds a number of new indices related to local and global cohesion at the semantic level, reported by latent semantic analysis, latent Dirichlet allocation, and word2vec. The tool also includes a source overlap feature, which calculates lexical and semantic overlap between a source and a response text (i.e., cohesion between the two texts based measures of text relatedness). In the first study in this article, we examined the effects that cohesion features, prompt, essay elaboration, and enhanced cohesion had on expert ratings of text coherence, finding that global semantic similarity as reported by word2vec was an important predictor of coherence ratings. A second study was conducted to examine the source and response indices. In this study we examined whether source overlap between the speaking samples found in the TOEFL-iBT integrated speaking tasks and the responses produced by test-takers was predictive of human ratings of speaking proficiency. The results indicated that the percentage of keywords found in both the source and response and the similarity between the source document and the response, as reported by word2vec, were significant predictors of speaking quality. Combined, these findings help validate the new indices reported for TAACO 2.0.
The paper introduces the motivation for creating dedicated speech corpora of air traffic control communication, describes in detail the process of preparation of corpora for both automatic speech recognition and text-to-speech synthesis, presents an illustrative example of speech recognition system developed using the automatic speech recognition corpora and finally describes the technical aspects of the data and the distribution channel.
This article presents the Linguistic Annotated Bibliography (LAB) as a searchable Web portal to quickly and easily access reliable database norms, related programs, and variable calculations. These publications were coded by language, number of stimuli, stimuli type (i.e., words, pictures, symbols), keywords (i.e., frequency, semantics, valence), and other useful information. This tool not only allows researchers to search for the specific type of stimuli needed for experiments but also permits the exploration of publication trends across 100 years of research. Details about the portal creation and use are outlined, as well as various analyses of change in publication rates and keywords. In general, advances in computational power have allowed for the increase in dataset size in the recent decades, in addition to an increase in the number of linguistic variables provided in each publication.
Perceptual experience plays a critical role in the conceptual representation of words. Higher levels of semantic variables such as imageability, concreteness, and sensory experience are generally associated with faster and more accurate word processing. Nevertheless, these variables tend to be assessed mostly on the basis of visual experience. This underestimates the potential contributions of other perceptual modalities. Accordingly, recent evidence has stressed the importance of providing modality-specific perceptual strength norms. In the present study, we developed French Canadian norms of visual and auditory perceptual strength (i.e., the modalities that have major impact on word processing) for 3,596 nouns. We then explored the relationship between these newly developed variables and other lexical, orthographic, and semantic variables. Finally, we demonstrated the contributions of visual and auditory perceptual strength ratings to visual word processing beyond those of other semantic variables related to perceptual experience (e.g., concreteness, imageability, and sensory experience ratings). The ratings developed in this study are a meaningful contribution toward the implementation of new studies that will shed further light on the interaction between linguistic, semantic, and perceptual systems.
In the last decade, research has shown that word processing is influenced by the lexical and semantic features of words. However, norms for a crucial semantic variable—that is, conceptual familiarity—have not been available for a sizeable French database. We thus developed French Canadian conceptual familiarity norms for 3,596 nouns. This enriches Desrochers and Thompson’s (2009) database, in which subjective frequency and imageability values are already available for the same words. We collected online data from 313 Canadian French speakers. The full database of conceptual familiarity ratings is freely available at http://lingualab.ca/fr/projets/normes-de-familiarite-conceptuelle. We then demonstrated the utility of these new conceptual familiarity norms by assessing their contribution to lexical decision times. We conducted a stepwise regression model with conceptual familiarity in the last step. This allowed us to assess the independent contribution of conceptual familiarity beyond the contributions of other well-known psycholinguistic variables, such as frequency, imageability, and age of acquisition. The results showed that conceptual familiarity facilitated lexical decision latencies. In sum, these ratings will help researchers select French stimuli for experiments in which conceptual familiarity must be taken into account.
A limiting factor in understanding memory and language is often the availability of large numbers of stimuli to use and explore in experimental studies. In this study, we expand on three previous databases of concepts to over 4000 words including nouns, verbs, adjectives, and other parts of speech. Participants in the study were asked to provide lists of features for each concept presented (a semantic feature production task), which were combined with previous research in this area. These feature lists for each concept were then coded into their root word form and affixes (i.e., cat and s for cats) to explore the impact of word form on semantic similarity measures, which are often calculated by comparing concept feature lists (feature overlap). All concept features, coding, and calculated similarity information is provided in a searchable database for easy access and utilization for future researchers when designing experiments that use word stimuli. The final database of word pairs was combined with the Semantic Priming Project to examine the relation of semantic similarity statistics on semantic priming in tandem with other psycholinguistic variables.
Paired-associate learning is one of the most commonly used paradigms to study human memory. In many of these studies, participants are typically told to learn foreign language–English translations, such as Swahili–English or Lithuanian–English pairs. One limitation of these currently available foreign language–English translation norms is that their foreign languages are based on the alphabetic writing system, thereby preventing researchers from generalizing their findings to languages based on logographic writing systems. In the present study we collected normative data for 160 Chinese–English word pairs. Participants completed three study–test cycles, followed by metacognitive judgments on their learning experience. For each pair, we report recall performance, recall latency, ease of learning, and judgments of learning. A simultaneous multiple regression analysis with frequency (of both the English word and the Chinese character), word length (English), and number of strokes (Chinese) as predictors revealed that a greater number of strokes (or higher visual complexity) for the Chinese characters predicted lower target recall.
An unprecedented number of empirical studies have shown that iconic gestures—those that mimic the sensorimotor attributes of a referent—contribute significantly to language acquisition, perception, and processing. However, there has been a lack of normed studies describing generalizable principles in gesture production and in comprehension of the mappings of different types of iconic strategies (i.e., modes of representation; Müller, 2013). In Study 1 we elicited silent gestures in order to explore the implementation of different types of iconic representation (i.e., acting, representing, drawing, and personification) to express concepts across five semantic domains. In Study 2 we investigated the degree of meaning transparency (i.e., iconicity ratings) of the gestures elicited in Study 1. We found systematicity in the gestural forms of 109 concepts across all participants, with different types of iconicity aligning with specific semantic domains: Acting was favored for actions and manipulable objects, drawing for nonmanipulable objects, and personification for animate entities. Interpretation of gesture–meaning transparency was modulated by the interaction between mode of representation and semantic domain, with some couplings being more transparent than others: Acting yielded higher ratings for actions, representing for object-related concepts, personification for animate entities, and drawing for nonmanipulable entities. This study provides mapping principles that may extend to all forms of manual communication (gesture and sign). This database includes a list of the most systematic silent gestures in the group of participants, a notation of the form of each gesture based on four features (hand configuration, orientation, placement, and movement), each gesture’s mode of representation, iconicity ratings, and professionally filmed videos that can be used for experimental and clinical endeavors.
Most of the new words a reader will find are morphologically complex. Also, theoretical models of language processing propose that morphology plays an important role in visual word processing. Nevertheless, studies on the subject show contradicting results that are difficult to reconcile. One factor that may explain this is the lack of a sizeable and reliable morphological database. As a consequence, there are enormous methodological differences in the way the values for morphological variables are calculated across studies. We present a sizeable and freely available database with six new variables for affixes and three for roots for 68,624 words from the English Lexicon Project. We further studied by means of regression models the influence of these new variables on the lexical decision latencies of 4,724 morphologically complex nouns that included one root and one suffix. Results showed that root frequency and suffix length had a facilitatory effect, whereas the percentage of more frequent words in the morphological family of the suffix had an inhibitory effect on latencies. After controlling for collinearity, root family size, suffix family size, suffix P*, and suffix frequency also had facilitatory effects. These results shed new light on the importance of suffix length and the frequency of the lexical competitors of the family of a suffix. This database represents a valuable resource for studies on the effect of morphology in visual word processing in English and can be found at https://github.com/hugomailhot/MorphoLex-en .
The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity, the Lesser Sundas are little studied and for most of the region, written historical records, as well as archaeological and ethnographic data are lacking. In such circumstances the study of relationships between languages through their lexicon is a unique tool for making inferences about human (pre-)history and tracing population movements. However, the lack of a collective body of lexical data has severely limited our understanding of the history of the languages and peoples in the Lesser Sundas. The LexiRumah database fills this gap by assembling lexicons of Lesser Sunda languages from published and unpublished sources, and making those lexicons available online in a consistent format. T)
Words that correspond to a potential sensory experience-concrete words-have long been found to possess a processing advantage over abstract words in various lexical tasks. We collected norms of concreteness for a set of 1,659 French words, together with other psycholinguistic norms that were not available for these words-context availability, emotional valence, and arousal-but which are important if we are to achieve a better understanding of the meaning of concreteness effects. We then investigated the relationships of concreteness with these newly collected variables, together with other psycholinguistic variables that were already available for this set of words (e.g., imageability, age of acquisition, and sensory experience ratings). Finally, thanks to the variety of psychological norms available for this set of words, we decided to test further the embodied account of concreteness effects in visual-word recognition, championed by Kousta, Vigliocco, Vinson, Andrews, and Del Campo (Journal of Experimental Psychology: General, 140, 14-34, 2011). Similarly, we investigated the influences of concreteness in three word recognition tasks-lexical decision, progressive demasking, and word naming-using a multiple regression approach, based on the reaction times available in Chronolex (Ferrand, Brysbaert, Keuleers, New, Bonin, M{\'{e}}ot, Pallier, Frontiers in Psychology, 2; 306, 2011). The norms can be downloaded as supplementary material provided with this article.
This paper presents the different methodologies and resources used to build Galnet, the Galician version of WordNet. It reviews the different extraction processes and the lexicographical and textual sources used to develop this resource, and describes some of its applications in ontology research and terminology processing.
We present the Chinese Lexical Database (CLD): a large-scale lexical database for simplified Chinese. The CLD provides a wealth of lexical information for 3913 one-character words, 34,233 two-character words, 7143 three-character words, and 3355 four-character words, and is publicly available through http://www.chineselexicaldatabase.com. For each of the 48,644 words in the CLD, we provide a wide range of categorical predictors, as well as an extensive set of frequency measures, complexity measures, neighborhood density measures, orthography-phonology consistency measures, and information-theoretic measures. We evaluate the explanatory power of the lexical variables in the CLD in the context of experimental data through analyses of lexical decision latencies for one-character, two-character, three-character and four-character words, as well as word naming latencies for one-character and two-character words. The results of these analyses are discussed.
Sensory experience rating (SER) is a recently developed subjective lexical index that reflects the extent to which a word evokes a sensory and/or perceptual experience in a reader (Juhasz {\&} Yap, 2013; Juhasz, Yap, Dicke, Taylor, {\&} Gullick, 2011). In the present study, SERs for a set of 5,500 Spanish words were collected, which makes this the largest set of norms for SER in the Spanish language to date. Additionally, with the aim of further exploring the implications of this new indicator and its relations with other psycholinguistic variables, a variety of correlational and regression analyses are provided. The results showed that SERs significantly correlated with imageability, age of acquisition, and a number of variables related to perception and emotion. In addition, SERs predicted a significant amount of variance in lexical decision times when other variables were controlled.
VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development of VerbNet is a major undertaking, researchers have recently translated VerbNet classes from English to other languages. However, no systematic investigation has been conducted into the applicability and accuracy of such a translation approach across different, typologically diverse languages. Our study is aimed at filling this gap. We develop a systematic method for translation of VerbNet classes from English to other languages which we first apply to Polish and subsequently to Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on Polish demonstrate high translatability with all the classes (96% of English member verbs successfully translated into Polish) and strong inter-annotator agreement, revealing a promising degree of overlap in the resultant classifications. The results on other languages are equally promising. This demonstrates that VerbNet classes have strong cross-lingual potential and the proposed method could be applied to obtain gold standards for automatic verb classification in different languages. We make our annotation guidelines and the six language-specific verb classifications available with this paper.
Iconicity – the correspondence between form and meaning – may help young children learn to use new words. Early‐learned words are higher in iconicity than later learned words. However, it remains unclear what role iconicity may play in actual language use. Here, we ask whether iconicity relates not just to the age at which words are acquired, but also to how frequently children and adults use the words in their speech. If iconicity serves to bootstrap word learning, then we would expect that children should say highly iconic words more frequently than less iconic words, especially early in development. We would also expect adults to use iconic words more often when speaking to children than to other adults. We examined the relationship between frequency and iconicity for approximately 2000 English words. Replicating previous findings, we found that more iconic words are learned earlier. Moreover, we found that more iconic words tend to be used more by younger children, and adults use more iconic words when speaking to children than to other adults. Together, our results show that young children not only learn words rated high in iconicity earlier than words low in iconicity, but they also produce these words more frequently in conversation – a pattern that is reciprocated by adults when speaking with children. Thus, the earliest conversations of children are relatively higher in iconicity, suggesting that this iconicity scaffolds the production and comprehension of spoken language during early development.
The paper introduces a novel annotated corpus of Old and Middle Hungarian (16–18 century), the texts of which were selected in order to approximate the vernacular of the given historical periods as closely as possible. The corpus consists of testimonies of witnesses in trials and samples of private correspondence. The texts are not only analyzed morphologically, but each file contains metadata that would also facilitate sociolinguistic research. The texts were segmented into clauses, manually normalized and morphosyntactically annotated using an annotation system consisting of the PurePos PoS tagger and the Hungarian morphological analyzer HuMor originally developed for Modern Hungarian but adapted to analyze Old and Middle Hungarian morphological constructions. The automatically disambiguated morphological annotation was manually checked and corrected using an easy-to-use web-based manual disambiguation interface. The normalization process and the manual validation of the annotation required extensive teamwork and provided continuous feedback for the refinement of the computational morphology and iterative retraining of the statistical models of the tagger. The paper discusses some of the typical problems that occurred during the normalization procedure and their tentative solutions. Besides, we also describe the automatic annotation tools, the process of semi-automatic disambiguation, and the query interface, a special function of which also makes correction of the annotation possible. Displaying the original, the normalized and the parsed versions of the selected texts, the beta version of the first fully normalized and annotated historical corpus of Hungarian is freely accessible at the address http://tmk.nytud.hu/.
In this work we present the Talk of Norway (ToN) data set, a collection of Norwegian Parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata harvested from different sources, and augmented with language type, sentence, token, lemma, part-of-speech, and morphological feature annotations. We also present a pilot study on party classification in the Norwegian Parliament, carried out in the context of a cross-faculty collaboration involving researchers from both Political Science and Computer Science. Our initial experiments demonstrate how the linguistic and institutional annotations in ToN can be used to gather insights on how different aspects of the political process affect classification.
We present the RST Signalling Corpus (Das et al. in RST signalling corpus, LDC2015T10. https://catalog.ldc.upenn.edu/LDC2015T10, 2015), a corpus annotated for signals of coherence relations. The corpus is developed over the RST Discourse Treebank (Carlson et al. in RST Discourse Treebank, LDC2002T07. https://catalog.ldc.upenn.edu/LDC2002T07, 2002) which is annotated for coherence relations. In the RST Signalling Corpus, these relations are further annotated with signalling information. The corpus includes annotation not only for discourse markers which are considered to be the most typical (or sometimes the only type of) signals in discourse, but also for a wide array of other signals such as reference, lexical, semantic, syntactic, graphical and genre features as potential indicators of coherence relations. We describe the research underlying the development of the corpus and the annotation process, and provide details of the corpus. We also present the results of an inter-annotator agreement study, illustrating the validity and reproducibility of the annotation. The corpus is available through the Linguistic Data Consortium, and can be used to investigate the psycholinguistic mechanisms behind the interpretation of relations through signalling, and also to develop discourse-specific computational systems such as discourse parsing applications.
The use of immersive virtual reality as a research tool is rapidly increasing in numerous scientific disciplines. By combining ecological validity with strict experimental control, immersive virtual reality provides the potential to develop and test scientific theories in rich environments that closely resemble everyday settings. This article introduces the first standardized database of colored three-dimensional (3-D) objects that can be used in virtual reality and augmented reality research and applications. The 147 objects have been normed for name agreement, image agreement, familiarity, visual complexity, and corresponding lexical characteristics of the modal object names. The availability of standardized 3-D objects for virtual reality research is important, because reaching valid theoretical conclusions hinges critically on the use of well-controlled experimental stimuli. Sharing standardized 3-D objects across different virtual reality labs will allow for science to move forward more quickly.
Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
This article introduces a new corpus of eye movements in silent reading—the Russian Sentence Corpus (RSC). Russian uses the Cyrillic script, which has not yet been investigated in cross-linguistic eye movement research. As in every language studied so far, we confirmed the expected effects of low-level parameters, such as word length, frequency, and predictability, on the eye movements of skilled Russian readers. These findings allow us to add Slavic languages using Cyrillic script (exemplified by Russian) to the growing number of languages with different orthographies, ranging from the Roman-based European languages to logographic Asian ones, whose basic eye movement benchmarks conform to the universal comparative science of reading (Share, 2008). We additionally report basic descriptive corpus statistics and three exploratory investigations of the effects of Russian morphology on the basic eye movement measures, which illustrate the kinds of questions that researchers can answer using the RSC. The annotated corpus is freely available from its project page at the Open Science Framework: https://osf.io/x5q2r/.
In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque.
In this paper, we present SFU ReviewSP-NEG, the first Spanish corpus annotated with negation with a wide coverage freely available. We describe the methodology applied in the annotation of the corpus including the tagset, the linguistic criteria and the inter-annotator agreement tests. We also include a complete typology of negation patterns in Spanish. This typology has the advantage that it is easy to express in terms of a tagset for corpus annotation: the types are clearly defined, which avoids ambiguity in the annotation process, and they provide wide coverage (i.e. they resolved all the cases occurring in the corpus). We use the SFU ReviewSP as a base in order to make the annotations. The corpus consists of 400 reviews, 221,866 words and 9455 sentences, out of which 3022 sentences contain at least one negation structure.
In this study, we report the validation results of the EU-Emotion Voice Database, an emotional voice database available for scientific use, containing a total of 2,159 validated emotional voice stimuli. The EU-Emotion voice stimuli consist of audio-recordings of 54 actors, each uttering sentences with the intention of conveying 20 different emotional states (plus neutral). The database is organized in three separate emotional voice stimulus sets in three different languages (British English, Swedish, and Hebrew). These three sets were independently validated by large pools of participants in the UK, Sweden, and Israel. Participants' validation of the stimuli included emotion categorization accuracy and ratings of emotional valence, intensity, and arousal. Here we report the validation results for the emotional voice stimuli from each site and provide validation data to download as a supplement, so as to make these data available to the scientific community. The EU-Emotion Voice Database is part of the EU-Emotion Stimulus Set, which in addition contains stimuli of emotions expressed in the visual modality (by facial expression, body language, and social scene) and is freely available to use for academic research purposes.
Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain.
This paper presents the Irish Political Speech Database, an English-language database collected from Irish political recordings. The database is collected with automated indexing and content retrieval in mind, and thus is gathered from real-world recordings (such as television interviews and election rallies) which represent the nature and quality of recordings which will be encountered in practical applications. The database is labelled for six speaker attributes: boring; charismatic; enthusiastic; inspiring; likeable; and persuasive. Each of these traits is linked to the perceived ability or appeal of the speaker, and as such are relevant to a range of content retrieval and speech analysis tasks. The six base attributes are combined to form a metric of Overall Speaker Appeal. A set of baseline experiments is presented, which demonstrate the potential of this database for affective computing studies. Classification accuracies of up to 76% are achieved, with little feature or system optimisation.
A central issue in visual and spoken word recognition is the lexical representation of complex words-in particular, whether the lexical representation of complex words depends on semantic transparency: Is a complex verb like understand lexically represented as a whole word or via its base stand, given that its meaning is not transparent from the meanings of its parts? To study this issue, a number of stimulus characteristics are of interest that are not yet available in public databases of German. This article provides semantic association ratings, lexical paraphrases, and vector-based similarity measures for German verbs, measuring (a) the semantic transparency between 1,259 complex verbs and their bases, (b) the semantic relatedness between 1,109 verb pairs with 432 different bases, and (c) the vector-based similarity measures of 846 verb pairs. Additionally, we include the verb regularity of all verbs and two counts of verb family size for 184 base verbs, as well as estimates of age of acquisition and age of reading for 200 verbs. Together with lemma and type frequencies from public lexical databases, all measures can be downloaded along with this article. Statistical analyses indicate that verb family size, morphological complexity, frequency, and verb regularity affect the semantic transparency and relatedness ratings as well as the age of acquisition estimates, indicating that these are relevant variables in psycholinguistic experiments. Although lexical paraphrases, vector-based similarity measures, and semantic association ratings may deliver complementary information, the interrater reliability of the semantic association ratings for each verb pair provides valuable information when selecting stimuli for psycholinguistic experiments.
In this article, we present Procura-PALavras (P-PAL), a Web-based interface for a new European Portuguese (EP) lexical database. Based on a contemporary printed corpus of over 227 million words, P-PAL provides a broad range of word attributes and statistics, including several measures of word frequency (e.g., raw counts, per-million word frequency, logarithmic Zipf scale), morpho-syntactic information (e.g., parts of speech [PoSs], grammatical gender and number, dominant PoS, and frequency and relative frequency of the dominant PoS), as well as several lexical and sublexical orthographic (e.g., number of letters; consonant-vowel orthographic structure; density and frequency of orthographic neighbors; orthographic Levenshtein distance; orthographic uniqueness point; orthographic syllabification; and trigram, bigram, and letter type and token frequencies), and phonological measures (e.g., pronunciation, number of phonemes, stress, density and frequency of phonological neighbors, transposed and phonographic neighbors, syllabification, and biphone and phone type and token frequencies) for {\~{}}53,000 lemmatized and {\~{}}208,000 nonlemmatized EP word forms. To obtain these metrics, researchers can choose between two word queries in the application: (i) analyze words previously selected for specific attributes and/or lexical and sublexical characteristics, or (ii) generate word lists that meet word requirements defined by the user in the menu of analyses. For the measures it provides and the flexibility it allows, P-PAL will be a key resource to support research in all cognitive areas that use EP verbal stimuli. P-PAL is freely available at http://p-pal.di.uminho.pt/tools .
Human locomotion is a fundamental class of events, and manners of locomotion (e.g., how the limbs are used to achieve a change of location) are commonly encoded in language and gesture. To our knowledge, there is no openly accessible database containing normed human locomotion stimuli. Therefore, we introduce the GestuRe and ACtion Exemplar (GRACE) video database, which contains 676 videos of actors performing novel manners of human locomotion (i.e., moving from one location to another in an unusual manner) and videos of a female actor producing iconic gestures that represent these actions. The usefulness of the database was demonstrated across four norming experiments. First, our database contains clear matches and mismatches between iconic gesture videos and action videos. Second, the male actors and female actors whose action videos matched the gestures in the best possible way, perform the same actions in very similar manners and different actions in highly distinct manners. Third, all the actions in the database are distinct from each other. Fourth, adult native English speakers were unable to describe the 26 different actions concisely, indicating that the actions are unusual. This normed stimuli set is useful for experimental psychologists working in the language, gesture, visual perception, categorization, memory, and other related domains.
The adoption of electronic health records (EHRs) has enabled a wide range of applications leveraging EHR data. However, the meaningful use of EHR data largely depends on our ability to efficiently extract and consolidate information embedded in clinical text where natural language processing (NLP) techniques are essential. Semantic textual similarity (STS) that measures the semantic similarity between text snippets plays a significant role in many NLP applications. In the general NLP domain, STS shared tasks have made available a huge collection of text snippet pairs with manual annotations in various domains. In the clinical domain, STS can enable us to detect and eliminate redundant information that may lead to a reduction in cognitive burden and an improvement in the clinical decision-making process. This paper elaborates our efforts to assemble a resource for STS in the medical domain, MedSTS. It consists of a total of 174,629 sentence pairs gathered from a clinical corpus at Mayo Clinic. A subset of MedSTS (MedSTS_ann) containing 1068 sentence pairs was annotated by two medical experts with semantic similarity scores of 0–5 (low to high similarity). We further analyzed the medical concepts in the MedSTS corpus, and tested four STS systems on the MedSTS_ann corpus. In the future, we will organize a shared task by releasing the MedSTS_ann corpus to motivate the community to tackle the real world clinical problems.
Presents a study which aims to investigate SPALEX, a Spanish lexical decision database by focusing on native Spanish speakers at a global scale and with a vast amount of words, to provide a useful tool for researchers exploring the acquisition and processing of this language in native and foreign contexts. SPALEX contains data from a Spanish crowd-sourced lexical decision mega study. The authors collected the data through an online platform from May 12th, 2014 to December 19th, 2017. The majority of the data was acquired during the first month of the experiment, when an advertising campaign was done in order to attract the public’s attention. Participants also had the option of publishing their results via social networks, which led to attract more participants in a snow-ball sampling fashion. Additionally, the database contains information on participants that voluntarily provided information about their gender, age, country of origin, education level, handedness, native language, and best foreign language. In each experimental session, participants responded to 70 words and 30 non-words presented randomly and without repetition. Accuracy in SPALEX is expressed as 1 for correct answers and 0 for incorrect answers. Based on participants’ responses, the authors calculated percentage known, a measure of the percentage of participants that know a particular word. (PsycINFO Database Record (c) 2018 APA, all rights reserved)
Sentiment lexicons and word embeddings constitute well-established sources of information for sentiment analysis in online social media. Although their effectiveness has been demonstrated in state-of-the-art sentiment analysis and related tasks in the English language, such publicly available resources are much less developed and evaluated for the Greek language. In this paper, we tackle the problems arising when analyzing text in such an under-resourced language. We present and make publicly available a rich set of such resources, ranging from a manually annotated lexicon, to semi-supervised word embedding vectors and annotated datasets for different tasks. Our experiments using different algorithms and parameters on our resources show promising results over standard baselines; on average, we achieve a 24.9% relative improvement in F-score on the cross-domain sentiment analysis task when training the same algorithms with our resources, compared to training them on more traditional feature sources, such as n-grams. Importantly, while our resources were built with the primary focus on the cross-domain sentiment analysis task, they also show promising results in related tasks, such as emotion analysis and sarcasm detection.
Feature stability, time and tempo of change, and the role of genealogy versus areality in creating linguistic diversity are important issues in current computational research on linguistic typology. This paper presents a database initiative, DiACL Typology, which aims to provide a resource for addressing these questions with specific of the extended Indo-European language area of Eurasia, the region with the best documented linguistic history. The database is pre-prepared for statistical and phylogenetic analyses and contains both linguistic typological data from languages spanning over four millennia, and linguistic metadata concerning geographic location, time period, and reliability of sources. The typological data has been organized according to a hierarchical model of increasing granularity in order to create datasets that are complete and representative. [ABSTRACT FROM AUTHOR], Copyright of PLoS ONE is the property of Public Library of Science and its content may not be copied o)
SNOMED CT provides about 300,000 codes with fine-grained concept definitions to support interoperability of health data. Coding clinical texts with medical terminologies it is not a trivial task and is prone to disagreements between coders. We conducted a qualitative analysis to identify sources of disagreements on an annotation experiment which used a subset of SNOMED CT with some restrictions. A corpus of 20 English clinical text fragments from diverse origins and languages was annotated independently by two domain medically trained annotators following a specific annotation guideline. By following this guideline, the annotators had to assign sets of SNOMED CT codes to noun phrases, together with concept and term coverage ratings. Then, the annotations were manually examined against a reference standard to determine sources of disagreements. Five categories were identified. In our results, the most frequent cause of inter-annotator disagreement was related to human issues. In severa)
Normative databases containing psycholinguistic variables are commonly used to aid stimulus selection for investigations into language and other cognitive processes. Norms exist for many languages, but not for Thai. The aim of the present research, therefore, was to obtain Thai normative data for the BOSS, a set of 480 high resolution color photographic images of real objects (Brodeur et al. in PLoS ONE 5(5), 2010. https://doi.org/10.1371/journal.pone.0010773 ). Norms were provided by 584 Thai university students on eight dimensions: name agreement, object familiarity, visual complexity, category agreement, image agreement, two types of manipulability (graspability and mimeability), and age of acquisition. The results revealed comparatively similar levels of name agreement to Brodeur et al. especially when unfamiliar items were factored out. The pattern of intercorrelations among the Thai psycholinguistic norms was comparable to previous studies and our cross-linguistic correlations were robust for the same set of pictures in English and French. Conjointly, the findings extend the relevancy of the BOSS to Thailand, supporting this photographic resource for investigations of language and other cognitive processes in monolingual, multilingual, and brain-impaired populations.
A major obstacle for the design of rigorous, reproducible studies in moral psychology is the lack of suitable stimulus sets. Here, we present the Socio-Moral Image Database (SMID), the largest standardized moral stimulus set assembled to date, containing 2,941 freely available photographic images, representing a wide range of morally (and affectively) positive, negative and neutral content. The SMID was validated with over 820,525 individual judgments from 2,716 participants, with normative ratings currently available for all images on affective valence and arousal, moral wrongness, and relevance to each of the five moral values posited by Moral Foundations Theory. We present a thorough analysis of the SMID regarding (1) inter-rater consensus, (2) rating precision, and (3) breadth and variability of moral content. Additionally, we provide recommendations for use aimed at efficient study design and reproducibility, and outline planned extensions to the database. We anticipate that the SMID will serve as a useful resource for psychological, neuroscientific and computational (e.g., natural language processing or computer vision) investigations of social, moral and affective processes. The SMID images, along with associated normative data and additional resources are available at https://osf.io/2rqad/.
In this work, we report large-scale semantic role annotation of arguments in the Turkish dependency treebank, and present the first comprehensive Turkish semantic role labeling (SRL) resource: Turkish Proposition Bank (PropBank). We present our annotation workflow that harnesses crowd intelligence, and discuss the procedures for ensuring annotation consistency and quality control. Our discussion focuses on syntactic variations in realization of predicate-argument structures, and the large lexicon problem caused by complex derivational morphology. We describe our approach that exploits framesets of root verbs to abstract away from syntax and increase self-consistency of the Turkish PropBank. The issues that arise in the annotation of verbs derived via valency changing morphemes, verbal nominals, and nominal verbs are explored, and evaluation results for inter-annotator agreement are provided. Furthermore, semantic layer described here is aligned with universal dependency (UD) compliant treebank and released to enable more researchers to work on the problem. Finally, we use PropBank to establish a baseline score of 79.10 F1 for Turkish SRL using the mate-tool (an open-source SRL tool based on supervised machine learning) enhanced with basic morphological features. Turkish PropBank and the extended SRL system are made publicly available.
Treebanks are important resources for researchers in natural language processing. They provide training and testing materials so that different algorithms can be compared. However, it is not a trivial task to construct high-quality treebanks. We have not yet had a proper treebank for such a low-resource language as Vietnamese, which has probably lowered the performance of Vietnamese language processing. We have been building a consistent and accurate Vietnamese treebank to alleviate such situations. Our treebank is annotated with three layers: word segmentation, part-of-speech tagging, and bracketing. We developed detailed annotation guidelines for each layer by presenting Vietnamese linguistic issues as well as methods of addressing them. Here, we also describe approaches to controlling annotation quality while ensuring a reasonable annotation speed. We specifically designed an appropriate annotation process and an effective process to train annotators. In addition, we implemented several support tools to improve annotation speed and to control the consistency of the treebank. The results from experiments revealed that both inter-annotator agreement and accuracy were higher than 90%, which indicated that the treebank is reliable.
In this paper we introduce the Kurdish BLARK (Basic Language Resource Kit). The original BLARK has not considered multi-dialect characteristics and generally has targeted reasonably well-resourced languages. To consider these two features, we extended BLARK and applied the proposed extension to Kurdish. Kurdish language not only faces a paucity in resources, but also embraces several dialects within a complex linguistic context. This paper presents the Kurdish BLARK and shows that from Natural language processing and computational linguistics perspectives the revised BLARK provides a more applicable view of languages with similar characteristics to Kurdish.
Humans make moral judgments every day, and research demonstrates that these evaluations are based on a host of related event features (e.g., harm, legality). In order to acquire systematic data on how moral judgments are made, our assessments need to be expanded to include real-life, ecologically valid stimuli that take into account the numerous event features that are known to influence moral judgment. To facilitate this, Knutson et al. (in Social Cognitive and Affective Neuroscience, 5(4), 378–384, 2010) developed vignettes based on real-life episodic memories rated concurrently on key moral features; however, the method is time intensive (~1.4–3.4 h) and the stimuli and ratings require further validation and characterization. The present study addresses these limitations by: (i) validating three short subsets of these vignettes (39 per subset) that are time-efficient (10–25 min per subset) yet representative of the ratings and factor structure of the full set, (ii) norming ratings of moral features in a larger sample (total N = 661, each subset N = ~220 vs. Knutson et al. N = 30), (iii) examining the generalizability of the original factor structure by replicating it in a larger sample across vignette subsets, sex, and political ideology, and (iv) using latent profile analysis to empirically characterize vignette groupings based on event feature ratings profiles and vignette content. This study therefore provides researchers with a core battery of well-characterized and realistic vignettes, concurrently rated on key moral features that can be administered in a brief, time-efficient manner to advance research on the nature of moral judgment.
Numerous studies in psychology, cognitive neuroscience and psycholinguistics have used pictures of objects as stimulus materials. Currently, authors engaged in cross-linguistic work or wishing to run parallel studies at multiple sites where different languages are spoken must rely on rather small sets of black-and-white or colored line drawings. These sets are increasingly experienced as being too limited. Therefore, we constructed a new set of 750 colored pictures of concrete concepts. This set, MultiPic, constitutes a new valuable tool for cognitive scientists investigating language, visual perception, memory and/or attention in monolingual or multilingual populations. Importantly, the MultiPic databank has been normed in six different European languages (British English, Spanish, French, Dutch, Italian and German). All stimuli and norms are freely available at http://www.bcbl.eu/databases/multipic.
Recent years have witnessed a growing interest in behavioral and neuroimaging studies on the processing of symbolic communicative gestures, such as pantomimes and emblems, but well-controlled stimuli have been scarce. This study describes a dataset of more than 200 video clips of an actress performing pantomimes (gestures that mimic object-directed/object-use actions; e.g., playing guitar), emblems (conventional gestures; e.g., thumbs up), and meaningless gestures. Gestures were divided into four lists. For each of these four lists, 50 Italian and 50 American raters judged the meaningfulness of the gestures and provided names and descriptions for them. The results of these rating and norming measures are reported separately for the Italian and American raters, offering the first normed set of meaningful and meaningless gestures for experimental studies. The stimuli are available for download via the Figshare database.
The Massive Auditory Lexical Decision (MALD) database is an end-to-end, freely available auditory and production data set for speech and psycholinguistic research, providing time-aligned stimulus recordings for 26,793 words and 9592 pseudowords, and response data for 227,179 auditory lexical decisions from 231 unique monolingual English listeners. In addition to the experimental data, we provide many precompiled listener- and item-level descriptor variables. This data set makes it easy to explore responses, build and test theories, and compare a wide range of models. We present summary statistics and analyses.
We report normative data from a large (N = 307) sample of young adult participants tested with a computerized version of the long form of the classical Benton Facial Recognition Test (BFRT; Benton {\&} Van Allen, 1968). The BFRT-c requires participants to match a target face photograph to either one or three of six face photographs presented simultaneously. We found that the percent accuracy on the BFRT-c (81{\%}-83{\%}) was below ceiling yet well above chance level, with little interindividual variance in this typical population sample, two important aspects of a sensitive clinical test. Although the split-half reliability on response accuracy was relatively low, due to the large variability in difficulty across items, the correct response times measured in this version-completed in 3 min, on average-provide a reliable and critical complementary measure of performance at individual unfamiliar-face matching. In line with previous observations from other measures, females outperformed male participants at the BFRT-c, especially for female faces. In general, performance was also lower following lighting changes than following head rotations, in line with previous studies that have emphasized participants' limited ability to match pictures of unfamiliar faces with important variations in illumination. Overall, this normative data set supports the validity of the BFRT-c as a key component of a battery of tests to identify clinical impairments in individual face recognition, such as observed in acquired prosopagnosia. However, this analysis strongly recommends that researchers consider the full test results: Beyond global indexes of performance based on accuracy rates only, they should consider the time taken to match individual faces as well as the variability in performance across items.
Subjective ratings of perceptual and motor attributes were obtained for a set of 750 concrete concepts in Spanish by requiring scale-based judgments from a sample of university students (N = 539). Following on the work of Amsel, Urbach, and Kutas (2012), the seven attributes were color, motion, sound, smell, taste, graspability, and pain. Normative data based on the obtained ratings are provided as a tool for future investigations. Additionally, the relationships of these attributes to other lexical dimensions (e.g., familiarity, frequency, concreteness) and the factorial organization of concepts around the main components were analyzed. The pattern of results is consistent with prior findings that highlight the relevance of dimensions related to survival as being crucially involved in conceptual processing.
Nonword repetition (NWR) has been a widely used measure of language-learning ability in children with and without language disorders. Although NWR tasks have been created for a variety of languages, minimal attention has been given to Asian tonal languages. This study introduces a new set of NWR stimuli for Vietnamese. The stimuli include 20 items ranging in length from one to four syllables. The items consist of dialect-neutral phonemes in consonant-vowel (CV) and CVC sequences that follow the phonotactic constraints of the language. They were rated high on wordlikeness and have comparable position segments and biphone probabilities across stimulus lengths. We validated the stimuli with a sample of 59 typically developing Vietnamese-English bilingual children, ages 5 to 8. The stimuli exhibited the expected age and length effects commonly found in NWR tasks: Older children performed better on the task than younger children, and longer items were more difficult to repeat than shorter items. We also compared different scoring systems in order to examine the individual phoneme types (consonants, vowels, and tones) and composite scores (proportions of phonemes correct, with and without tone). The study demonstrates careful construction and validation of the stimuli, and future directions are discussed.
textcopyright} 2017, The Author(s). Adults need to be able to process infants' emotional expressions accurately to respond appropriately and care for infants. However, research on processing of the emotional expressions of infant faces is hampered by the lack of validated stimuli. Although many sets of photographs of adult faces are available to researchers, there are no corresponding sets of photographs of infant faces. We therefore developed and validated a database of infant faces, which is available via e-mail request. Parents were recruited via social media and asked to send photographs of their infant (0–12 months of age) showing positive, negative, and neutral facial expressions. A total of 195 infant faces were obtained and validated. To validate the images, student midwives and nurses (n = 53) and members of the general public (n = 18) rated each image with respect to its facial expression, intensity of expression, clarity of expression, genuineness of expression, and valence. On the basis of these ratings, a total of 154 images with rating agreements of at least 75{\%} were included in the final database. These comprise 60 photographs of positive infant faces, 54 photographs of negative infant faces, and 40 photographs of neutral infant faces. The images have high criterion validity and good test–retest reliability. This database is therefore a useful and valid tool for researchers.
Using appropriate stimuli to evoke emotions is especially important for researching emotion. Psychologists have provided several standardized affective stimulus databases-such as the International Affective Picture System (IAPS) and the Nencki Affective Picture System (NAPS) as visual stimulus databases, as well as the International Affective Digitized Sounds (IADS) and the Montreal Affective Voices as auditory stimulus databases for emotional experiments. However, considering the limitations of the existing auditory stimulus database studies, research using auditory stimuli is relatively limited compared with the studies using visual stimuli. First, the number of sample sounds is limited, making it difficult to equate across emotional conditions and semantic categories. Second, some artificially created materials (music or human voice) may fail to accurately drive the intended emotional processes. Our principal aim was to expand existing auditory affective sample database to sufficiently cover natural sounds. We asked 207 participants to rate 935 sounds (including the sounds from the IADS-2) using the Self-Assessment Manikin (SAM) and three basic-emotion rating scales. The results showed that emotions in sounds can be distinguished on the affective rating scales, and the stability of the evaluations of sounds revealed that we have successfully provided a larger corpus of natural, emotionally evocative auditory stimuli, covering a wide range of semantic categories. Our expanded, standardized sound sample database may promote a wide range of research in auditory systems and the possible interactions with other sensory modalities, encouraging direct reliable comparisons of outcomes from different researchers in the field of psychology.
Orthography-semantics consistency (OSC) is a measure that quantifies the degree of semantic relatedness between a word and its orthographic relatives. OSC is computed as the frequency-weighted average semantic similarity between the meaning of a given word and the meanings of all the words containing that very same orthographic string, as captured by distributional semantic models. We present a resource including optimized estimates of OSC for 15,017 English words. In a series of analyses, we provide a progressive optimization of the OSC variable. We show that computing OSC from word-embeddings models (in place of traditional count models), limiting preprocessing of the corpus used for inducing semantic vectors (in particular, avoiding part-of-speech tagging and lemmatization), and relying on a wider pool of orthographic relatives provide better performance for the measure in a lexical-processing task. We further show that OSC is an important and significant predictor of reaction times in visual word recognition and word naming, one that correlates only weakly with other psycholinguistic variables (e.g., family size, word frequency), indicating that it captures a novel source of variance in lexical access. Finally, some theoretical and methodological implications are discussed of adopting OSC as one of the predictors of reaction times in studies of visual word recognition.
The rapid expansion of the Internet and the availability of vast repositories of natural text provide researchers with the immense opportunity to study human reactions, opinions, and behavior on a massive scale. To help researchers take advantage of this new frontier, the present work introduces and validates the Evaluative Lexicon 2.0 (EL 2.0)—a quantitative linguistic tool that specializes in the measurement of the emotionality of individuals' evaluations in text. Specifically, the EL 2.0 utilizes natural language to measure the emotionality, extremity, and valence of evaluative reactions and attitudes. The present article describes how we used a combination of 9 million real-world online reviews and over 1,500 participant judges to construct the EL 2.0 and an additional 5.7 million reviews to validate it. To assess its unique value, the EL 2.0 is compared with two other prominent text analysis tools—LIWC and Warriner et al.'s (Behavior Research Methods, 45, 1191–1207, 2013) wordlist. The EL 2.0 is comparatively distinct in its ability to measure emotionality and explains a significantly greater proportion of the variance in individuals' evaluations. The EL 2.0 can be used with any data that involve speech or writing and provides researchers with the opportunity to capture evaluative reactions both in the laboratory and “in the wild.” The EL 2.0 wordlist and normative emotionality, extremity, and valence ratings are freely available from www.evaluativelexicon.com.
This article describes the method used to build the Basque Verb Index (BVI), a corpus-based lexicon. The BVI is the result of semiautomatic annotation of the EPEC corpus with verb predicate information, following the PropBank-VerbNet model. The method presented is the product of a deep study of the syntactic–semantic behaviour of verbs in EPEC-RolSem (the EPEC corpus tagged with verb predicate information). During the process of annotating EPEC-RolSem, we have identified and stored in the BVI lexicon the different role-patterns associated with all verbs appearing in the corpus. In addition, each entry in the BVI is linked to the corresponding verb entry in well-known resources such as PropBank, VerbNet, WordNet and FrameNet. We have also implemented a tool called e-ROLda to facilitate the process of looking up verb patterns in the BVI and examples in EPEC-RolSem as a basis for future studies.
We present word prevalence data for 61,858 English words. Word prevalence refers to the number of people who know the word. The measure was obtained on the basis of an online crowdsourcing study involving over 220,000 people. Word prevalence data are useful for gauging the difficulty of words and, as such, for matching stimulus materials in experimental conditions or selecting stimulus materials for vocabulary tests. Word prevalence also predicts word processing times, over and above the effects of word frequency, word length, similarity to other words, and age of acquisition, in line with previous findings in the Dutch language.
Using the megastudy approach, we report a new database (MEGALEX) of visual and auditory lexical decision times and accuracy rates for tens of thousands of words. We collected visual lexical decision data for 28,466 French words and the same number of pseudowords, and auditory lexical decision data for 17,876 French words and the same number of pseudowords (synthesized tokens were used for the auditory modality). This constitutes the first large-scale database for auditory lexical decision, and the first database to enable a direct comparison of word recognition in different modalities. Different regression analyses were conducted to illustrate potential ways to exploit this megastudy database. First, we compared the proportions of variance accounted for by five word frequency measures. Second, we conducted item-level regression analyses to examine the relative importance of the lexical variables influencing performance in the different modalities (visual and auditory). Finally, we compared the similarities and differences between the two modalities. All data are freely available on our website ( https://sedufau.shinyapps.io/megalex/ ) and are searchable at www.lexique.org , inside the Open Lexique search engine.
This article describes a family of dependency treebanks of early attestations of Indo-European languages originating in the parallel treebank built by the members of the project pragmatic resources in old Indo-European languages. The treebanks all share a set of open-source software tools, including a web annotation interface, and a set of annotation schemes and guidelines developed especially for the project languages. The treebanks use an enriched dependency grammar scheme complemented by detailed morphological tags, which have proved sufficient to give detailed descriptions of these richly inflected languages, and which have been easy to adapt to new languages. We describe the tools and annotation schemes and discuss some challenges posed by the various languages that have been annotated. We also discuss problems with tokenisation, sentence division and lemmatisation, commonly encountered in ancient and mediaeval texts, and challenges associated with low levels of standardisation and ongoing morphological and syntactic change.
Standardized pictorial stimuli and predictors of successful picture naming are not readily available for Gulf Arabic. On the basis of data obtained from Qatari Arabic, a variety of Gulf Arabic, the present study provides norms for a set of 319 object pictures and a set of 141 action pictures. Norms were collected from healthy speakers, using a picture-naming paradigm and rating tasks. Norms for naming latencies, name agreement, visual complexity, image agreement, imageability, age of acquisition, and familiarity were established. Furthermore, the database includes other intrinsic factors, such as syllable length and phoneme length. It also includes orthographic frequency values (extracted from Aralex; Boudelaa {\&} Marslen-Wilson, 2010). These factors were then examined for their impact on picture-naming latencies in object- and action-naming tasks. The analysis showed that the primary determinants of naming latencies in both nouns and verbs are (in descending order) image agreement, name agreement, familiarity, age of acquisition, and imageability. These results indicate no evidence that noun- and verb-naming processes in Gulf Arabic are influenced in different ways by these variables. This is the first database for Gulf Arabic, and therefore the norms collected from the present study will be of paramount importance for researchers and clinicians working with speakers of this variety of Arabic. Due to the similarity of the Arabic varieties spoken in the Gulf, these different varieties are grouped together under the label "Gulf Arabic" in the literature. The normative databases and the standardized pictures from this study can be downloaded from http://qufaculty.qu.edu.qa/tariq-khwaileh/download-center/ .
We report a new multidimensional measure of visual complexity (GraphCom) that captures variability in the complexity of graphs within and across writing systems. We applied the measure to 131 written languages, allowing comparisons of complexity and providing a basis for empirical testing of GraphCom. The measure includes four dimensions whose value in capturing the different visual properties of graphs had been demonstrated in prior reading research—(1) perimetric complexity, sensitive to the ratio of a written form to its surrounding white space (Pelli, Burns, Farell, {\&} Moore-Page, 2006); (2) number of disconnected components, sensitive to discontinuity (Gibson, 1969); (3) number of connected points, sensitive to continuity (Lanthier, Risko, Stolz, {\&} Besner, 2009); and (4) number of simple features, sensitive to the strokes that compose graphs (Wu, Zhou, {\&} Shu, 1999). In our analysis of the complexity of 21,550 graphs, we (a) determined the complexity variation across writing systems along each dimension, (b) examined the relationships among complexity patterns within and across writing systems, and (c) compared the dimensions in their abilities to differentiate the graphs from different writing systems, in order to predict human perceptual judgments (n = 180) of graphs with varying complexity. The results from the computational and experimental comparisons showed that GraphCom provides a measure of graphic complexity that exceeds previous measures in its empirical validation. The measure can be universally applied across writing systems, providing a research tool for studies of reading and writing.
We describe the Multilanguage Written Picture Naming Dataset. This gives trial-level data and time and agreement norms for written naming of the 260 pictures of everyday objects that compose the colorized Snodgrass and Vanderwart picture set (Rossion {\&} Pourtois in Perception, 33, 217–236, 2004). Adult participants gave keyboarded responses in their first language under controlled experimental conditions (N = 1,274, with subsamples responding in Bulgarian, Dutch, English, Finnish, French, German, Greek, Icelandic, Italian, Norwegian, Portuguese, Russian, Spanish, and Swedish). We measured the time to initiate a response (RT) and interkeypress intervals, and calculated measures of name and spelling agreement. There was a tendency across all languages for quicker RTs to pictures with higher familiarity, image agreement, and name frequency, and with higher name agreement. Effects of spelling agreement and effects on output rates after writing onset were present in some, but not all, languages. Written naming therefore shows name retrieval effects that are similar to those found in speech, but our findings suggest the need for cross-language comparisons as we seek to understand the orthographic retrieval and/or assembly processes that are specific to written output.
This article presents the Provo Corpus, a corpus of eye-tracking data with accompanying predictability norms. The predictability norms for the Provo Corpus differ from those of other corpora. In addition to traditional cloze scores that estimate the predictability of the full orthographic form of each word, the Provo Corpus also includes measures of the predictability of the morpho-syntactic and semantic information for each word. This makes the Provo Corpus ideal for studying predictive processes in reading. Some analyses using these data have previously been reported elsewhere (Luke {\&} Christianson, 2016). The Provo Corpus is available for download on the Open Science Framework, at https://osf.io/sjefs .
In this article, we present StimulStat – a lexical database for the Russian language in the form of a web application. The database contains more than 52,000 of the most frequent Russian lemmas and more than 1.7 million word forms derived from them. These lemmas and forms are characterized according to more than 70 properties that were demonstrated to be relevant for psycholinguistic research, including frequency, length, phonological and grammatical properties, orthographic and phonological neighborhood frequency and size, grammatical ambiguity, homonymy and polysemy. Some properties were retrieved from various dictionaries and are presented collectively in a searchable form for the first time, the others were computed specifically for the database. The database can be accessed freely at http://stimul.cognitivestudies.ru. {\textcopyright} 2017, Psychonomic Society, Inc.
This article reports the construction of a multimodal annotated database of spoken discourse and co-verbal gestures by native healthy speakers of Cantonese and individuals with language impairment: the Cantonese AphasiaBank. This corpus was established as a foundation for aphasiologists and clinicians to use in designing and conducting research investigations into theoretical and clinical issues related to acquired language disorders in Chinese. Details in terms of the purpose, structure, and levels of annotation of the database (containing part-of-speech-annotated orthographic transcripts with Romanization and the corresponding videos) are described. The discussion presents the challenges of building a spoken database of a language that is not linguistically well-researched and that does not have a standardized written form for many of its lexical items, as well as presenting how these issues were addressed. Most importantly, the article highlights the potential of Cantonese AphasiaBank as a powerful research tool for linguists and psycholinguists.
The present study provides normative measures for a new stimulus set of images consisting of 225 everyday objects, each depicted both as a photograph and a matched clipart image generated directly from the photograph (450 images total). The clipart images preserve the same scale, shape, orientation, and general color features as the corresponding photographs. Various norms (modal name and verb agreement measures, picture-name agreement, familiarity, visual complexity, and image agreement) were collected separately for each image type and in two different contexts: online (using Mechanical Turk) and in the laboratory. We discuss similarities and differences in the normative measures according to both image type and experimental context. The full set of norms is provided in the supplemental materials.
textcopyright} 2018 Psychonomic Society, Inc. In language production research, the latency with which speakers produce a spoken response to a stimulus and the onset and offset times of words in longer utterances are key dependent variables. Measuring these variables automatically often yields partially incorrect results. However, exact measurements through the visual inspection of the recordings are extremely time-consuming. We present AlignTool, an open-source alignment tool that establishes preliminarily the onset and offset times of words and phonemes in spoken utterances using Praat, and subsequently performs a forced alignment of the spoken utterances and their orthographic transcriptions in the automatic speech recognition system MAUS. AlignTool creates a Praat TextGrid file for inspection and manual correction by the user, if necessary. We evaluated AlignTool's performance with recordings of single-word and four-word utterances as well as semi-spontaneous speech. AlignTool performs well with audio signals with an excellent signal-to-noise ratio, requiring virtually no corrections. For audio signals of lesser quality, AlignTool still is highly functional but its results may require more frequent manual corrections. We also found that audio recordings including long silent intervals tended to pose greater difficulties for AlignTool than recordings filled with speech, which AlignTool analyzed well overall. We expect that by semi-automatizing the temporal analysis of complex utterances, AlignTool will open new avenues in language production research.
Automatic speech recognition (ASR) has gained significant improvement for major languages such as English and Chinese, partly due to the emergence of deep neural networks (DNN) and large amount of training data. For minority languages, however, the progress is largely behind the main stream. A particularly obstacle is that there are almost no large-scale speech databases for minority languages, and the only few databases are held by some institutes as private properties, far from open and standard, and very few are free. Besides the speech database, phonetic and linguistic resources are also scarce, including phone set, lexicon, and language model. In this paper, we publish a speech database in Kazakh, a major minority language in the western China. Accompanying this database, a full set of phonetic and linguistic resources are also published, by which a full-fledged Kazakh ASR system can be constructed. We will describe the recipe for constructing a baseline system, and report our present results. The resources are free for research institutes and can be obtained by request. The publication is supported by the M2ASR project supported by NSFC, which aims to build multilingual ASR systems for minority languages in China.
In the present work, we introduce the Miami University Deception Detection Database (MU3D), a free resource containing 320 videos of target individuals telling truths and lies. Eighty (20 Black female, 20 Black male, 20 White female, and 20 White male) different targets were recorded speaking honestly and dishonestly about their social relationships. Each target generated four different videos (i.e., positive truth, negative truth, positive lie, negative lie), yielding 320 videos fully crossing target race, target gender, statement valence, and statement veracity. These videos were transcribed by trained research assistants and evaluated by na{\"{i}}ve raters. Descriptive analyses of the video characteristics (e.g., length) and subjective ratings (e.g., target attractiveness) are provided. The stimuli and an information codebook can be accessed free of charge for academic research purposes from http://hdl.handle.net/2374.MIA/6067. The MU3D offers scholars the ability to conduct research using standardized stimuli that can aid in building more comprehensive theories of interpersonal sensitivity, enhance replication among labs, facilitate the use of signal detection analyses, and promote consideration of race, gender, and their interactive effects in deception detection research.
Humor ratings are provided for 4,997 English words collected from 821 participants using an online crowd-sourcing platform. Each participant rated 211 words on a scale from 1 (humorless) to 5 (humorous). To provide for comparisons across norms, words were chosen from a set common to a number of previously collected norms (e.g., arousal, valence, dominance, concreteness, age of acquisition, and reaction time). The complete dataset provides researchers with a list of humor ratings and includes information on gender, age, and educational differences. Results of analyses show that the ratings have reliability on a par with previous ratings and are not well predicted by existing norms.
We present here emoFinder ( http://usc.es/pcc/emofinder ), a Web-based search engine for Spanish word properties taken from different normative databases. The tool incorporates several subjective word properties for 16,375 distinct words. Although it focuses particularly on normative ratings for emotional dimensions (e.g., valence and arousal) and discrete emotional categories (fear, disgust, anger, happiness, and sadness), it also makes available ratings for other word properties that are known to affect word processing (e.g., concreteness, familiarity, contextual availability, and age of acquisition). The tool provides two main functionalities: Users can search for words matched on specific criteria with regard to the selected properties, or users can obtain the properties for a set of words. The output from emoFinder is highly customizable and can be accessed online or exported to a computer. The tool architecture is easily scalable, so that it can be updated to include word properties from new Spanish normative databases as they become available.
The use of emoticons and emoji is increasingly popular across a variety of new platforms of online communication. They have also become popular as stimulus materials in scientific research. However, the assumption that emoji/emoticon users' interpretations always correspond to the developers'/researchers' intended meanings might be misleading. This article presents subjective norms of emoji and emoticons provided by everyday users. The Lisbon Emoji and Emoticon Database (LEED) comprises 238 stimuli: 85 emoticons and 153 emoji (collected from iOS, Android, Facebook, and Emojipedia). The sample included 505 Portuguese participants recruited online. Each participant evaluated a random subset of 20 stimuli for seven dimensions: aesthetic appeal, familiarity, visual complexity, concreteness, valence, arousal, and meaningfulness. Participants were additionally asked to attribute a meaning to each stimulus. The norms obtained include quantitative descriptive results (means, standard deviations, and confidence intervals) and a meaning analysis for each stimulus. We also examined the correlations between the dimensions and tested for differences between emoticons and emoji, as well as between the two major operating systems-Android and iOS. The LEED constitutes a readily available normative database (available at www.osf.io/nua4x ) with potential applications to different research domains.
Phonotactic probability refers to the frequency with which phonological segments and sequences of phonological segments occur in words in a given language. We describe one method of estimating phonotactic probabilities based on words in American English. These estimates of phonotactic probability have been used in a number of previous studies and are now being made available to other researchers via a Web-based interface. Instructions for using the interface, as well as details regarding how the measures were derived, are provided in the present article. The Phonotactic Probability Calculator can be accessed at http://www.people.ku.edu/-mvitevit/PhonoProbHome.html.
Sentiment information about social media posts is increasingly considered an important resource for customer segmentation, market understanding, and tackling other socio-economic issues. However, sentiment in social media is difficult to measure since user-generated content is usually short and informal. Although many traditional sentiment analysis methods have been proposed, identifying slang sentiment words remains a challenging task for practitioners. Though some slang words are available in existing sentiment lexicons, with new slang being generated with emerging memes, a dedicated lexicon will be useful for researchers and practitioners. To this end, we propose to build a slang sentiment dictionary to aid sentiment analysis. It is laborious and time-consuming to collect a comprehensive list of slang words and label the sentiment polarity. We present an approach to leverage web resources to construct a Slang Sentiment Dictionary (SlangSD) that is easy to expand. SlangSD is publicly available for research purposes. We empirically show the advantages of using SlangSD, the newly-built slang sentiment word dictionary for sentiment classification, and provide examples demonstrating its ease of use with a sentiment analysis system.
The paper presents the results of the Janes project, which aimed to develop language resources and tools for Slovene user generated content. The paper first describes the 200 million word Janes corpus, containing tweets, forum posts, news comments, user and talk pages from Wikipedia, and blogs and blog comments, where each text is accompanied by rich metadata. The developed processing tools for Slovene user generated content are presented next, which include a tokeniser, word-normaliser, part-of-speech tagger and lemmatiser, and a named entity recogniser. A set of manually annotated datasets was also produced, both for tool training as well as for linguistic research. The developed resources and tools are made publicly available under Creative Commons licences in the repository of the CLARIN.SI research infrastructure and on GitHub, while the corpora are also available through the CLARIN.SI concordancers.
Most current models of research on emotion recognize valence (how pleasant a stimulus is) and arousal (the level of activation or intensity that a stimulus elicits) as important components in the classification of affective experiences (Barrett, 1998; Kuppens, Tuerlinckx, Russell, {\&} Barrett, 2012). Here we present a set of norms for valence and arousal for a very large set of Spanish words, including items from a variety of frequencies, semantic categories, and parts of speech, including a subset of conjugated verbs. In this regard, we found that there were significant but very small differences between the ratings for conjugations of the same verb, validating the practice of applying the ratings for infinitives to all derived forms of the verb. Our norms show a high degree of reliability and are strongly correlated with those of Redondo, Fraga, Padr{\'{o}}n, and Comesa{\~{n}}a's (2007) Spanish version of the influential Affective Norms for English Words (Bradley {\&} Lang, 1999), as well as those from Warriner, Kuperman, and Brysbaert (2013), the largest available set of emotional norms for English words. Additionally, we included measures of word prevalence-that is, the percentage of participants that knew a particular word-for each variable (Keuleers, Stevens, Mandera, {\&} Brysbaert, 2015). Our large set of norms in Spanish not only will facilitate the creation of stimuli and the analysis of texts in that language, but also will be useful for cross-language comparisons and research on emotional aspects of bilingualism. The norms can be downloaded and available as a supplementary materials to this article.
During social communication, words and sentences play a critical role in the expression of emotional meaning. The Minho Affective Sentences (MAS) were developed to respond to the lack of a standardized sentence battery with normative affective ratings: 192 neutral, positive, and negative declarative sentences were strictly controlled for psycholinguistic variables such as numbers of words and letters and per-million word frequency. The sentences were designed to represent examples of each of the five basic emotions (anger, sadness, disgust, fear, and happiness) and of neutral situations. These sentences were presented to 536 participants who rated the stimuli using both dimensional and categorical measures of emotions. Sex differences were also explored. Additionally, we probed how personality, empathy, and mood from a subset of 40 participants modulated the affective ratings. Our results confirmed that the MAS affective norms are valid measures to guide the selection of stimuli for experimental studies of emotion. The combination of dimensional and categorical ratings provided a more fine-grained characterization of the affective properties of the sentences. Moreover, the affective ratings of positive and negative sentences were not only modulated by participants' sex, but also by individual differences in empathy and mood state. Together, our results indicate that, in their quest to reveal the neurofunctional underpinnings of verbal emotional processing, researchers should consider not only the role of sex, but also of interindividual differences in empathy and mood states, in responses to the emotional meaning of sentences.
We have developed and tested 144 compound remote associate problems. Across eight experiments, 289 participants were given four time limits (2 sec, 7 sec, 15 sec, or 30 sec) for solving each problem. This paper provides a brief overview of the problems and normative data regarding the percentage of participants solving, and mean time-to-solution for, each problem at each time limit. These normative data can be used in selecting problems on the basis of difficulty or mean time necessary for reaching a solution.
This paper presents a rule-based approach for generating a large phonetic database for Romanian. The knowledge base is developed by means of the GRAALAN (Grammar Abstract Language) system. By inspecting dictionaries and corpora, we generate a phonetic database over 100,000 lemmas. Our database has a high degree of accuracy ensured by our rule-based method applied for generating phonetic transcriptions.
The BT Archives house the records of British Telecom, the world's oldest telecommunications company, which traces its history back to the formation of the Electric Telegraphy Company in 1846. Prior to its privatisation in 1984, BT was a public corporation (and before that a government department) and as a result all of the pre-privatisation material in the archives is in the public domain, making it ideal for academic research. Despite this legal availability, however, the physical availability of material in the archive was limited to two days a week in an archive space in Holborn, London. In 2011 the ‘New Connections' project was set up with the aim of making around half a million items from the public archives of British Telecom available in a new digital archive. As part of ‘New Connections', three academic research projects were funded, one of which was the creation and analysis of the British Telecom Correspondence Corpus (BTCC). The era that the archive covers makes it a potentially fascinating source of data for the linguistic study of business correspondence. The mid-nineteenth to latetwentieth century is a crucial period in the development of English business correspondence as the amount of business being conducted by letter increased massively during this period as a result of the Industrial Revolution, the introduction of the Penny Post, and increased access to education both in schools and through composition grammar guides. Despite its importance in the development of business correspondence, this period has received relatively little attention. The aim of constructing the British Telecom Correspondence Corpus was to start addressing this gap in available linguistic data and enable studies into the development of business correspondence from the mid-nineteenth to late-twentieth century.
Here we report on MELD-SCH (MEgastudy of Lexical Decision in Simplified CHinese), a dataset that contains the lexical decision data of 1,020 one-character, 10,022 two-character, 949 three-character, and 587 four-character simplified Chinese words obtained from 504 native Chinese users. It also includes a number of word-level and character-level variables. Analyses showed that the reliability of the dataset is satisfactory, as indicated by split-half correlations and comparisons with other datasets. Item-based regression showed that both word-level and character-level variables contributed significantly to the reaction times and error rates of lexical decision. Moreover, we discovered a U-shape relationship between word-length and reaction times, which has not been reported in Chinese before. MELD-SCH can facilitate research in Chinese word recognition by providing high quality normative data and information of different linguistic variables. It also encourages researchers to extend their empirical findings, which are mostly based on one-character and two-character words, to words of different lengths.
This study introduces a corpus of 260 naturalistic human nonlinguistic vocalizations representing nine emotions: amusement, anger, disgust, effort, fear, joy, pain, pleasure, and sadness. The recognition accuracy in a rating task varied greatly per emotion, from {\textless}40{\{}{\%}{\}} for joy and pain, to {\textgreater}70{\{}{\%}{\}} for amusement, pleasure, fear, and sadness. In contrast, the raters' linguistic--cultural group had no effect on recognition accuracy: The predominantly English-language corpus was classified with similar accuracies by participants from Brazil, Russia, Sweden, and the UK/USA. Supervised random forest models classified the sounds as accurately as the human raters. The best acoustic predictors of emotion were pitch, harmonicity, and the spacing and regularity of syllables. This corpus of ecologically valid emotional vocalizations can be filtered to include only sounds with high recognition rates, in order to study reactions to emotional stimuli of known perceptual types (reception side), or can be used in its entirety to study the association between affective states and vocal expressions (production side).
This article introduces GECO, the Ghent Eye-Tracking Corpus, a monolingual and bilingual corpus of the eyetracking data of participants reading a complete novel. English monolinguals and Dutch–English bilinguals read an entire novel, which was presented in paragraphs on the screen. The bilinguals read half of the novel in their first language, and the other half in their second language. In this article, we describe the distributions and descriptive statistics of the most important reading time measures for the two groups of participants. This large eyetracking corpus is perfectly suited for both exploratory purposes and more directed hypothesis testing, and it can guide the formulation of ideas and theories about naturalistic reading processes in a meaningful context. Most importantly, this corpus has the potential to evaluate the generalizability of monolingual and bilingual language theories and models to the reading of long texts and narratives. The corpus is freely available at http://expsy.ugent.be/downloads/geco.
The discrete emotion theory proposes that affective experiences can be reduced to a limited set of universal "basic" emotions, most commonly identified as happiness, sadness, anger, fear, and disgust. Here we present norms for 10,491 Spanish words for those five discrete emotions collected from a total of 2,010 native speakers, making it the largest set of norms for discrete emotions in any language to date. When used in conjunction with the norms from Hinojosa, Mart{\'{i}}nez-Garc{\'{i}}a et al. (Behavior Research Methods, 48, 272-284, 2016) and Ferr{\'{e}}, Guasch, Mart{\'{i}}nez-Garc{\'{i}}a, Fraga, {\&} Hinojosa (Behavior Research Methods, 49, 1082-1094, 2017), researchers now have access to ratings of discrete emotions for 13,633 Spanish words. Our norms show a high degree of inter-rater reliability and correlate highly with those from Ferr{\'{e}} et al. (2017). Our exploration of the relationship between the five discrete emotions and relevant lexical and emotional variables confirmed findings of previous studies conducted with smaller datasets. The availability of such large set of norms will greatly facilitate the study of emotion, language and related fields. The norms are available as supplementary materials to this article.
In this article we present Curras, the first morphologically annotated corpus of the Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of the Arabic language. Arabic dialects are generally under-resourced compared to Modern Standard Arabic, the primarily written and official form of Arabic. We start in the article with a background description that situates Palestinian Arabic linguistically and historically and compares it to Modern Standard Arabic and Egyptian Arabic in terms of phonological, morphological, orthographic, and lexical variations. We then describe the methodology we developed to collect Palestinian Arabic text to guarantee a variety of representative domains and genres. We also discuss the annotation process we used, which extended previous efforts for annotation guideline development, and utilized existing automatic annotation solutions for Standard Arabic and Egyptian Arabic. The annotation guidelines and annotation meta-data are described in detail. The Curras Palestinian Arabic corpus consists of more than 56 K tokens, which are annotated with rich morphological and lexical features. The inter-annotator agreement results indicate a high degree of consistency.
This article introduces the MC4WEPS corpus, a new resource for evaluating Web people search disambiguation tasks, and describes its design, collection and annotation process, the agreement between the different annotators, and finally introduces a baseline evaluation. This corpus is built by compiling multilingual search engines results where the queries are person names. Proper noun disambiguation is an open problem in natural language ambiguity resolution and, specifically, resolving the ambiguity of person names in Web search results is still a challenging problem. However, state-of-the-art approaches have been evaluated only with monolingual web page collections. The MC4WEPS corpus aims to provide the research community with a reference corpus for the task of disambiguating search engine results where the query is a person name shared by homonymous individuals. The features of this new corpus stand out from existing corpora for the same task, namely multilingualism and inclusion of social networking websites. These characteristics make it more representative of a real search scenario, especially for evaluating person name disambiguation in a multilingual context. The article also includes detailed information about the format and the availability of the corpus.
We introduce the Open Affective Standardized Image Set (OASIS), an open-access online stimulus set containing 900 color images depicting a broad spectrum of themes, including humans, animals, objects, and scenes, along with normative ratings on two affective dimensions-valence (i.e., the degree of positive or negative affective response that the image evokes) and arousal (i.e., the intensity of the affective response that the image evokes). The OASIS images were collected from online sources, and valence and arousal ratings were obtained in an online study (total N = 822). The valence and arousal ratings covered much of the circumplex space and were highly reliable and consistent across gender groups. OASIS has four advantages: (a) the stimulus set contains a large number of images in four categories; (b) the data were collected in 2015, and thus OASIS features more current images and reflects more current ratings of valence and arousal than do existing stimulus sets; (c) the OASIS database affords users the ability to interactively explore images by category and ratings; and, most critically, (d) OASIS allows for free use of the images in online and offline research studies, as they are not subject to the copyright restrictions that apply to the International Affective Picture System. The OASIS images, along with normative valence and arousal ratings, are available for download from www.benedekkurdi.com/{\#}oasis or https://db.tt/yYTZYCga .
This article provides norms for general taboo, personal taboo, insult, valence, and arousal for 672 Dutch words, including 202 taboo words. Norms were collected using a 7-point Likert scale and based on ratings by psychology students from the Erasmus University Rotterdam in The Netherlands. The sample consisted of 87 psychology students (58 females, 29 males). We obtained high reliability based on split-half analyses. Our norms show high correlations with arousal and valence ratings collected by another Dutch word-norms study (Moors et al.,, Behavior Research Methods, 45, 169-177, 2013). Our results show that the previously found qua-dratic relation (i.e., U-shaped pattern) between valence and arousal also holds when only taboo words are considered. Additionally, words rated high on taboo tended to be rated low on valence, but some words related to sex rated high on both taboo and valence. Words that rated high on taboo rated high on insult, again with the exception of words related to sex many of which rated low on insult. Finally, words rated high on taboo and insult rated high on arousal. The Dutch Taboo Norms (DTN) database is a useful tool for researchers interested in the effects of taboo words on cognitive processing. The data associated with this paper can be accessed via the Open Science Framework (https://osf.io/vk782/).
This paper presents the methodology, design principles and detailed evaluation of a new freely available multilayer corpus, collected and edited via classroom annotation using collaborative software. After briefly discussing corpus design for open, extensible corpora, five classroom annotation projects are presented, covering structural markup in TEI XML, multiple part of speech tagging, constituent and dependency parsing, information structural and coreference annotation, and Rhetorical Structure Theory analysis. Layers are inspected for annotation quality and together they coalesce to form a richly annotated corpus that can be used to study the interactions between different levels of linguistic description. The evaluation gives an indication of the expected quality of a corpus created by students with relatively little training. A multifactorial example study on lexical NP coreference likelihood is also presented, which illustrates some applications of the corpus. The results of this project show that high quality, richly annotated resources can be created effectively as part of a linguistics curriculum, opening new possibilities not just for research, but also for corpora in linguistics pedagogy.
In experimental contexts, affect-related word lists have been widely applied when examining how cognitive processes interact with emotional processes. These lists, however, present limitations when studying the relation between emotion and cognitive processes such as time and number processing because affective words do not inherently contain time or quantity information. Live events, in contrast, are experienced by an observer and therefore inherently carry affect information. Unfortunately, existing life-event lists and inventories have been largely applied within clinical contexts as diagnostic tools, and therefore are not suitable for many experimental contexts because they do not contain a balanced number of reliably positive, negative, and neutral life events. In Experiment 1, we create a standardized affect-related life-events list with 171 positive, negative, and neutral affect-related life events. In Experiment 2, we show that strength of affect and significance of the event are integral dimensions, suggesting that these two features are difficult to separate perceptually. The implications of these findings and some potential future applications of the created life-events list are discussed.
This paper presents the Stars2 corpus of definite descriptions for referring expression generation (REG). The corpus was produced in collaborative communication involving speaker-hearer pairs, and includes situations of reference that are arguably under-represented in similar work. Stars2 is intended as an incremental contribution to the research in REG and related fields, and it may be used both as training/test data for algorithms of this kind, and also to gain further insights into reference phenomena in general, with a particular focus on the issue of attribute choice in referential overspecification.
Given the vast amounts of data available in digitised textual form, it is important to provide mechanisms that allow users to extract nuggets of relevant information from the ever growing volumes of potentially important documents. Text mining techniques can help, through their ability to automatically extract relevant event descriptions, which link entities with situations described in the text. However, correct and complete interpretation of these event descriptions is not possible without considering additional contextual information often present within the surrounding text. This information, which we refer to as meta-knowledge, can include (but is not restricted to) the modality, subjectivity, source, polarity and specificity of the event. We have developed a meta-knowledge annotation scheme specifically tailored for news events, which includes six aspects of event interpretation. We have applied this annotation scheme to the ACE 2005 corpus, which contains 599 documents from various written and spoken news sources. We have also identified and annotated the words and phrases evoking the different types of meta-knowledge. Evaluation of the annotated corpus shows high levels of inter-annotator agreement for five meta-knowledge attributes, and moderate level of agreement for the sixth attribute. Detailed analysis of the annotated corpus has revealed further insights into the expression mechanisms of different types of meta-knowledge, their relative frequencies and mutual correlations.
In the vast literature exploring learning, many studies have used paired-associate stimuli, despite the fact that real-world learning involves many different types of information. One of the most popular materials used in studies of learning has been a set of Swahili-English word pairs for which Nelson and Dunlosky (Memory 2; 325-335, 1994) published recall norms two decades ago. These norms involved use of the Swahili words as cues to facilitate recall of the English translation. It is unclear whether cueing in the opposite direction (from English to Swahili) would lead to symmetric recall performance. Bilingual research has suggested that translation in these two different directions involves asymmetric links that may differentially impact recall performance, depending on which language is used as the cue (Kroll {\&} Stewart, Journal of Memory and Language 33; 149-174,1994). Moreover, the norms for these and many other learning stimuli have typically been gathered from college students. In the present study, we report recall accuracy and response time norms for Swahili words when they are cued by their English translations. We also report norms for a companion set of fact stimuli that may be used along with the Swahili-English word pairs to assess learning on a broader scale across different stimulus materials. Data were collected using Amazon's Mechanical Turk to establish a sample that was diverse in both age and ethnicity. These different, but related, stimulus sets will be applicable to studies of learning, metacognition, and memory in diverse samples.
As the cognitive neuroscience of metaphor has evolved, so too have the theoretical questions of greatest interest. To keep pace with these developments, in the present study we generated a large set of metaphoric and literal sentence pairs ideally suited to addressing the current methodological and conceptual needs of metaphor researchers. In particular, the need has emerged to distinguish metaphors along three dimensions: the grammatical class of their base terms, the sensorimotor features of their base terms, and the syntactic form in which the base terms appear. To meet this need, we generated nominal metaphors (and matched literal sentences) using entity nouns as the base terms, with the intention that they be used in concert with already published sets of predicate metaphors or nominal metaphors using event nouns. Using the results of three norming experiments, we provide 120 pairs of closely matched metaphoric and literal sentences that are characterized along 14 dimensions: 11 at the sentence level (length, frequency, concreteness, familiarity, naturalness, imageability, figurativeness, interpretability, ease of interpretation, valence, and valence judgment reaction time), and three related to the base term (visual, motion, and auditory imagery). These items extend previously published stimuli, filling an extant gap in metaphor research and allowing for tests of new behavioral and neural hypotheses about metaphor.
Our visual environment is not random, but follows compositional rules according to what objects are usually found where. Despite the growing interest in how such semantic and syntactic rules - a scene grammar - enable effective attentional guidance and object perception, no common image database containing highly-controlled object-scene modifications has been publically available. Such a database is essential in minimizing the risk that low-level features drive high-level effects of interest, which is being discussed as possible source of controversial study results. To generate the first database of this kind - SCEGRAM - we took photographs of 62 real-world indoor scenes in six consistency conditions that contain semantic and syntactic (both mild and extreme) violations as well as their combinations. Importantly, always two scenes were paired, so that an object was semantically consistent in one scene (e.g., ketchup in kitchen) and inconsistent in the other (e.g., ketchup in bathroom). Low-level salience did not differ between object-scene conditions and was generally moderate. Additionally, SCEGRAM contains consistency ratings for every object-scene condition, as well as object-absent scenes and object-only images. Finally, a cross-validation using eye-movements replicated previous results of longer dwell times for both semantic and syntactic inconsistencies compared to consistent controls. In sum, the SCEGRAM image database is the first to contain well-controlled semantic and syntactic object-scene inconsistencies that can be used in a broad range of cognitive paradigms (e.g., verbal and pictorial priming, change detection, object identification, etc.) including paradigms addressing developmental aspects of scene grammar. SCEGRAM can be retrieved for research purposes from http://www.scenegrammarlab.com/research/scegram-database/ .
[Correction Notice: An Erratum for this article was reported in Vol 49(3) of Behavior Research Methods (see record 2017-21867-030). In the original article, there was an error in the second sentence of Footnote 3 on page 175. The corrected sentence is present in the erratum.] The body-shape-related stimuli used in most body-image studies have several limitations (e.g., a lack of pilot validation procedures and the use of non-body-shape-related control/neutral stimuli). We therefore developed a database of 61 computer-generated body-only pictures of women, wherein bodies were methodically manipulated in terms of fatness versus thinness. Eighty-two young women assessed the pictures' attractiveness, beauty, harmony (valence ratings), and body shape (assessed on a thinness/fatness axis), providing normative data for valence and body shape ratings. First, stimuli manipulated for fatness versus thinness conveyed comparable emotional intensities regarding the valence and body shape ratings. Second, different subcategories of stimuli were obtained on the basis of variations in body shape and valence judgments. Fat and thin bodies were distributed into several subcategories depending on their valence ratings, and a subcategory containing stimuli that were neutral in terms of valence and body shape was identified. Interestingly, at a descriptive level, the thinness/fatness manipulations of the bodies were in a curvilinear relationship with the valence ratings: Thin bodies were not only judged as positive, but also as negative when their estimated body mass indexes (BMIs) decreased too much. Finally, convergent validity was assessed by exploring the impacts of body-image-related variables (BMI, thin-ideal internalization, and body dissatisfaction) on participants' judgments of the bodies. Valence judgments, but not body shape judgments, were influenced by the participants' levels of thin-ideal internalization and body dissatisfaction. Participants' BMIs did not significantly influence their judgments. Given these findings, this database contains relevant material that can be used in various fields, primarily for studies of body-image disturbance or eating disorders. (PsycINFO Database Record (c) 2017 APA, all rights reserved) (Source: journal abstract)
Age of acquisition (AoA) is an important variable in word recognition research. Up to now, nearly all psychology researchers examining the AoA effect have used ratings obtained from adult participants. An alternative basis for determining AoA is directly testing children's knowledge of word meanings at various ages. In educational research, scholars and teachers have tried to establish the grade at which particular words should be taught by examining the ages at which children know various word meanings. Such a list is available from Dale and O'Rourke's (1981) Living Word Vocabulary for nearly 44 thousand meanings coming from over 31 thousand unique word forms and multiword expressions. The present article relates these test-based AoA estimates to lexical decision times as well as to AoA adult ratings, and reports strong correlations between all of the measures. Therefore, test-based estimates of AoA can be used as an alternative measure.
This article provides semantic differential ratings of 1,469 concepts in Bengali, a language spoken by about 250 million individuals in eastern India and Bangladesh. These data were collected from 20 male and 20 female Calcutta respondents who rated stimuli on three culturally universal affective dimensions: evaluation–potency–activity (EPA). This study employs pan-respondent component analyses as a means of examining the respondents' usage of the standard EPA scales. The pan-respondent component analyses indicate that some respondents used the rating scales in unexpected ways, recording their feelings about one component of concepts' EPA with ratings on a scale intended to measure a different dimension. When scores were based only on respondents who used the scales appropriately, several interesting patterns were found. For respondents of both genders, potency scores have a curvilinear relation with evaluation, such that very good and very bad concepts are mostly seen as very potent, whereas evaluatively neutral concepts are seen as somewhat impotent or just slightly potent. A moderate linear correlation exists between activity and evaluation, and a modest positive relation exists between potency and activity. Gender correlations are high on evaluation, .93, but much lower for potency scores, with a correlation of .55, and even lower for activity, .30. In this article we examine several explanations for why scales denoting potency and activity were reinterpreted as indicating goodness by certain respondents, and consider the matter of including data collected from respondents who used scales in this way.
In everyday social interactions, people's facial expressions sometimes reflect genuine emotion (e.g., anger in response to a misbehaving child) and sometimes do not (e.g., smiling for a school photo). There is increasing theoretical interest in this distinction, but little is known about perceived emotion genuineness for existing facial expression databases. We present a new method for rating perceived genuineness using a neutral-midpoint scale (-7 = completely fake; 0 = don't know; +7 = completely genuine) that, unlike previous methods, provides data on both relative and absolute perceptions. Normative ratings from typically developing adults for five emotions (anger, disgust, fear, sadness, and happiness) provide three key contributions. First, the widely used Pictures of Facial Affect (PoFA; i.e., "the Ekman faces") and the Radboud Faces Database (RaFD) are typically perceived as not showing genuine emotion. Also, in the only published set for which the actual emotional states of the displayers are known (via self-report; the McLellan faces), percepts of emotion genuineness often do not match actual emotion genuineness. Second, we provide genuine/fake norms for 558 faces from several sources (PoFA, RaFD, KDEF, Gur, FacePlace, McLellan, News media), including a list of 143 stimuli that are event-elicited (rather than posed) and, congruently, perceived as reflecting genuine emotion. Third, using the norms we develop sets of perceived-as-genuine (from event-elicited sources) and perceived-as-fake (from posed sources) stimuli, matched on sex, viewpoint, eye-gaze direction, and rated intensity. We also outline the many types of research questions that these norms and stimulus sets could be used to answer.
Idiomatic expressions such as kick the bucket or go down a storm can differ on a number of internal features, such as familiarity, meaning, literality, and decomposability, and these types of features have been the focus of a number of normative studies. In this article, we provide normative data for a set of Bulgarian idioms and their English translations, and by doing so replicate in a Slavic language the relationships between the ratings previously found in Romance and Germanic languages. Additionally, we compared whether collecting these types of ratings in between-subjects or within-subjects designs affects the data and the conclusions drawn, and found no evidence that design type affects the final outcome. Finally, we present the results of a meta-analysis that summarizes the relationships found across the literature. As in many previous individual studies, we found that familiarity correlates with a number of other features; however, such studies have shown conflicting results concerning literality and decomposability ratings. The meta-analysis revealed reliable relationships of decomposability with a number of other measures, such as familiarity, meaning, and predictability. Conversely, literality was shown to have little to no relationship with any of the other subjective ratings. The implications for these relationships in the context of the wider experimental literature are discussed, with a particular focus on the importance of attaining familiarity ratings for each sample of participants in experimental work.
The Developmental Emotional Faces Stimulus Set (DEFSS) is designed to provide a standardized set of emotional stimuli that includes both child and adult faces, and that has been validated by participants across a wide range of ages. This article describes the creation and validation of the DEFSS, which includes 404 validated facial photographs of people between 8 and 30 years old displaying 5 different emotional expressions: happy, angry, fearful, sad, and neutral. The emotions in all photographs were identified correctly by 86{\%} of raters (minimum 55{\%}), and validity did not vary as a function of the age group of the model nor that of the raters, indicating that the pictures are equally appropriate for use across the entire age range. Strengths and limitations of the DEFFS are discussed.
The Swiss avalanche bulletin is produced twice a day in four languages. Due to the lack of time available for manual translation, a fully automated translation system is employed, based on a catalogue of predefined phrases and predetermined rules of how these phrases can be combined to produce sentences. Because this catalogue of phrases is limited to a small sublanguage, the system is able to automatically translate such sentences from German into the target languages French, Italian and English without subsequent proofreading or correction. Having been operational for two winter seasons, we assess here the quality of the produced texts based on two different surveys where participants rated texts from real avalanche bulletins from both origins, the catalogue of phrases versus manually written and translated texts. With a mean recognition rate of 55 %, users can hardly distinguish between the two types of texts, and give very similar ratings with respect to their language quality. Overall, the output from the catalogue system can be considered virtually equivalent to a text written by avalanche forecasters and then manually translated by professional translators. Furthermore, forecasters declared that all relevant situations were captured by the system with sufficient accuracy. Forecaster’s working load did not change with the introduction of the catalogue: the extra time to find matching sentences is compensated by the fact that they no longer need to double-check manually translated texts. The reduction of daily translation costs is expected to offset the initial development costs within a few years.
Sentiment analysis allows the semantic evaluation of pieces of text according to the expressed sentiments and opinions. While considerable attention has been given to the polarity (positive, negative) of English words, only few studies were interested in the conveyed emotions (joy, anger, surprise, sadness, etc.) especially in other languages. In this paper, we present the elaboration and the evaluation of a new French lexicon considering both polarity and emotion. The elaboration method is based on the semi-automatic translation and expansion to synonyms of the English NRC Word Emotion Association Lexicon (NRC-EmoLex). First, online translators have been automatically queried in order to create a first version of our new French Expanded Emotion Lexicon (FEEL). Then, a human professional translator manually validated the automatically obtained entries and the associated emotions. She agreed with more than 94 % of the pre-validated entries (those found by a majority of translators) and less than 18 % of the remaining entries (those found by very few translators). This result highlights that online tools can be used to get high quality resources with low cost. Annotating a subset of terms by three different annotators shows that the associated sentiments and emotions are consistent. Finally, extensive experiments have been conducted to compare the final version of FEEL with other existing French lexicons. Various French benchmarks for polarity and emotion classifications have been used in these evaluations. Experiments have shown that FEEL obtains competitive results for polarity, and significantly better results for basic emotions.
Databases containing lexical properties on any given orthography are crucial for psycholinguistic research. In the last ten years, a number of lexical databases have been developed for Greek. However, these lack important part-of-speech information. Furthermore, the need for alternative procedures for calculating syllabic measurements and stress information, as well as combination of several metrics to investigate linguistic properties of the Greek language are highlighted. To address these issues, we present a new extensive lexical database of Modern Greek (GreekLex 2) with part-of-speech information for each word and accurate syllabification and orthographic information predictive of stress, as well as several measurements of word similarity and phonetic information. The addition of detailed statistical information about Greek part-of-speech, syllabification, and stress neighbourhood allowed novel analyses of stress distribution within different grammatical categories and syllabic lengths to be carried out. Results showed that the statistical preponderance of stress position on the pre-final syllable that is reported for Greek language is dependent upon grammatical category. Additionally, analyses showed that a proportion higher than 90{\%} of the tokens in the database would be stressed correctly solely by relying on stress neighbourhood information. The database and the scripts for orthographic and phonological syllabification as well as phonetic transcription are available at http://www.psychology.nottingham.ac.uk/greeklex/.
Differences between norm ratings collected when participants are asked to consider more than one picture characteristic are contrasted with the traditional methodological approaches of collecting ratings separately for image constructs. We present data that suggest that reporting normative data, based on methodological procedures that ask participants to consider multiple image constructs simultaneously, could potentially confounded norm data. We provide data for two new image constructs, beauty and the extent to which participants encountered the stimuli in their everyday lives. Analysis of this data suggests that familiarity and encounter are tapping different image constructs. The extent to which an observer encounters an object predicts human judgments of visual complexity. Encountering an image was also found to be an important predictor of beauty, but familiarity with that image was not. Taken together, these results suggest that continuing to collect complexity measures from human judgments is a pointless exercise. Automated measures are more reliable and valid measures, which are demonstrated here as predicting human preferences.
The two main theoretical accounts of the human affective space are the dimensional perspective and the discrete-emotion approach. In recent years, several affective norms have been developed from a dimensional perspective, including ratings for valence and arousal. In contrast, the number of published datasets relying on the discrete-emotion approach is much lower. There is a need to fill this gap, considering that discrete emotions have an effect on word processing above and beyond those of valence and arousal. In the present study, we present ratings from 1,380 participants for a set of 2,266 Spanish words in five discrete emotion categories: happiness, anger, fear, disgust, and sadness. This will be the largest dataset published to date containing ratings for discrete emotions. We also present, for the first time, a fine-grained analysis of the distribution of words into the five emotion categories. This analysis reveals that happiness words are the most consistently related to a single, discrete emotion category. In contrast, there is a tendency for many negative words to belong to more than one discrete emotion. The only exception is disgust words, which overlap least with the other negative emotions. Normative valence and arousal data already exist for all of the words included in this corpus. Thus, the present database will allow researchers to design studies to contrast the predictions of the two most influential theoretical perspectives in this field. These studies will undoubtedly contribute to a deeper understanding of the effects of emotion on word processing.
This paper describes the development of a multilingual and multigenre manually annotated speech dataset, freely available to the research community as ground truth for the evaluation of automatic transcription systems and spoken language translation systems. The dataset includes two video genres—television broadcast news and talk-shows—and covers Flemish, English, German, and Italian, for a total of about 35 h of television speech. Besides segmentation and orthographic transcription, we added a very rich annotation on the audio signal, both at the linguistic level (e.g. filled pauses, pronunciation errors, disfluencies, speech in a foreign language) and at the acoustic level (e.g. background noise and different types of non-speech events). Furthermore, a subset of the transcriptions is translated in four directions, namely Flemish to English, German to English, German to Italian and English to Italian. The development of this dataset was organized in several phases, relying on expert transcribers as well as involving non-expert contributors through crowdsourcing. We first conducted a feasibility study to test and compare two methods for crowdsourcing speech transcription on broadcast news data. These methods are based on different transcription processes (i.e. parallel vs. iterative) and incorporate two different quality control mechanisms. With both methods, we achieved near-expert transcription quality—in terms of word error rate—for English, German and Italian data. Instead, for Flemish data we were not able to get a sufficient response from the crowd to complete the offered transcription tasks. The results obtained demonstrate that the viability of methods for crowdsourcing speech transcription significantly depends on the target language. This paper provides a detailed comparison of the results obtained with the two crowdsourcing methods tested, describes the main characteristics of the final ground truth resource created as well as the methodology adopted, and the guidelines prepared for its development.
The Algerian linguistic situation is very intricate due to the ethnic, geographical and colonial occupation influences which have lead to a complex sociolinguistic environment. As a result of the contact between different languages and accents, the Algerian speech community has acquired a distinctive sociolinguistic situation. In addition to the intra- and inter- lingual variations describing day-to-day linguistic behavior of the Algerian speakers, their speech is characterized by the presence of many linguistic phenomena such as bilingualism and code switching. The study of automatic regional accent recognition in such a type of environment is a new idea in the field of automatic languages, dialect and accent recognition especially that previous studies were conducted using monolingual evaluation data. The assessment of the effectiveness of GMM-UBM and i-vectors frameworks for accent recognition approaches through the use of the Algerian Modern Colloquial Arabic Speech Corpus (AMCASC), which is a linguistic resource collected for this purpose, shows that not only the recording conditions mismatch, channels mismatch, recordings length mismatch and the amplitude clipping which have a non-desirable effect on the effectiveness of these acoustic approaches but also language contact phenomena are other perturbation sources which should be taken into consideration especially in real life applications .
Text reuse is the act of borrowing text from existing documents to create new texts. Freely available and easily accessible large online repositories are not only making reuse of text more common in society but also harder to detect. A major hindrance in the development and evaluation of existing/new mono-lingual text reuse detection methods, especially for South Asian languages, is the unavailability of standardized benchmark corpora. Amongst other things, a gold standard corpus enables researchers to directly compare existing state-of-the-art methods. In our study, we address this gap by developing a benchmark corpus for one of the widely spoken but under resourced languages i.e. Urdu. The COrpus of Urdu News TExt Reuse (COUNTER) corpus contains 1200 documents with real examples of text reuse from the field of journalism. It has been manually annotated at document level with three levels of reuse: wholly derived, partially derived and non derived. We also apply a number of similarity estimation methods on our corpus to show how it can be used for the development, evaluation and comparison of text reuse detection systems for the Urdu language. The corpus is a vital resource for the development and evaluation of text reuse detection systems in general and specifically for Urdu language.
Synesthesia is a neurological phenomenon in which certain types of stimuli elicit involuntary perceptions in an unrelated pathway. A common type of synesthesia is grapheme–color synesthesia, in which the visual perception of letters and numbers stimulates the perception of a specific color. Previous studies have often collected relatively small numbers of grapheme–color associations per synesthete, but the accumulation of a large quantity of data has greater promise for uncovering the mechanisms underlying synesthetic association. In this study, we therefore collected large samples of data from a total of eight synesthetes. All told, we obtained over 1000 synesthetic colors associated with Japanese kanji characters from each of two synesthetes, over 100 synesthetic colors form each of three synesthetes, and about 80 synesthetic colors associated with Japanese hiragana, Latin letters, and Arabic numerals from each of three synesthetes. We then compiled the data into a database, called the KANJI-Synesthetic Colors Database (K-SCD), which has a total of 5122 colors for 483, 46, and 46 Japanese kanji, hiragana, and katakana characters, respectively, as well as for 26 Latin letters and ten Arabic numerals. In addition to introducing the K-SCD, this article demonstrates the database's merits by using two examples, in which two new rules for synesthetic association, “shape similarity” and “synesthetic color clustering,” were found. The K-SCD is publicly accessible (www.cv.jinkan.kyoto-u.ac.jp/site/uploads/K-SCD.xlsm) and will be a valuable resource for those who wish to conduct statistical analyses using a rich dataset in order to uncover the rules governing synesthetic association and to understand its mechanisms.
This paper introduces a multi-layer corpus architecture with multiple tokenizations using the open source historical, diachronic corpus of German called Register in Diachronic German Science. The corpus contains herbal texts printed between the fifteenth and nineteenth centuries and is concerned with the development of a German scientific register, independent of Latin. We will discuss difficulties of transcribing, normalizing and annotating historical texts and will thereby argue for the advantages of multiple layers and multiple tokenizations. A virtually infinite number of annotations can be added to the corpus, without the need for deciding between or discarding interpretations. Thus, this flexible architecture enables multiple normalizations and types of annotation and is open to a wide range of research questions in the humanities. We provide case studies concerning the exploitation of our different normalizations as well as structural, register-specific and linguistic annotations. The corpus architecture allows for its reuse as a resource for corpus-based research approaches.
Building on the success of the VU Amsterdam Metaphor Corpus, which comprises English texts annotated with metaphor following the Metaphor Identification Procedure Vrjie Universiteit (MIPVU; Steen et al. in Cogn Linguist 21(4):765–796, 2010a; Steen et al. in A method for linguistic metaphor identification: from MIP to MIPVU. John Benjamins, Amsterdam/Philadelphia, 2010b), this study has three aims: (1) to adapt and evaluate the transferability and reliability of MIPVU for Mandarin Chinese; (2) to construct a corpus of Chinese texts annotated for metaphor using the adapted procedure; and (3) to examine the distribution of metaphor-related words across Chinese texts in three different written registers: academic discourse, fiction, and news. The results of our inter-annotator reliability test show that MIPVU can be reliably applied to linguistic metaphor identification in Chinese texts. Our metaphor-annotated corpus consists of texts randomly sampled from the Lancaster Corpus of Mandarin Chinese, totaling 30,012 words (about 10,000 for each register). Data analysis reveals that approximately one out of every nine lexical units in our Chinese corpus is related to metaphor, that there is considerable variation in metaphor density across different registers and lexical categories, and that metaphor density is significantly lower in Chinese than in English texts. Our assessment of the replicability of MIPVU for Mandarin Chinese adds to the groundbreaking methodological contribution that Steen et al. (2010a, b) has made to metaphor research. The metaphor-annotated corpus of Mandarin Chinese contributes a valuable language resource for Chinese metaphor researchers, and our analysis of the distribution of metaphor-related words in the corpus offers useful new insights into the extent and use of metaphor in Chinese discourse.
The Princeton WordNet® (PWN) is a widely used lexical knowledge database for semantic information processing. There are now many wordnets under creation for languages worldwide. In this paper, we endeavor to construct a wordnet for Pre-Qin ancient Chinese (PQAC), called PQAC WordNet (PQAC-WN), to process the semantic information of PQAC. In previous work, most recently constructed wordnets have been established either manually by experts or automatically using resources from which translation pairs between English and the target language can be extracted. The former method, however, is time-consuming, and the latter method, owing to a lack of language resources, cannot be performed on PQAC. As a result, a method based on word definitions in a monolingual dictionary is proposed. Specifically, for each sense, kernel words are first extracted from its definition, and the senses of each kernel word are then determined by graph-based Word Sense Disambiguation. Finally, one optimal sense is chosen from the kernel word senses to guide the mapping between the word sense and PWN synset. In this research, we obtain 66 % PQAC senses that can be shared with English and another 14 % language-specific senses that were added to PQAC-WN as new synsets. Overall, the automatic mapping achieves a precision of over 85 %.
Silent-letter endings are often claimed to be a major source of inconsistency in the French orthography. In this report, we introduce Silex, a database designed to facilitate the study of spelling performance in general, and silent-letter endings in particular. It was derived from two large and recent corpora based on child- and adult-targeted material. Silex consists of three kinds of Excel workbooks: a set of Stimuli Selector workbooks that allow researchers to select words based on a variety of statistics and word characteristics; a Table Generator workbook that allows researchers to build consistency distribution tables by selecting specific phonological or orthographic units; and a Master File workbook, from which all statistics were derived, and that allows researchers to compute other statistics. Silex is different from existing databases in the manner that silent-letter endings were coded and how consistency indices were computed. Importantly, Silex provides unconditional- and conditional-consistency indices for silent-letter endings. To demonstrate the utility of Silex, we first described the silent-letter phenomenon in French. We found that, at minimum, 28 {\%} of French words end with a silent letter. Moreover, silent-letter endings are usually t, e, s, x, or d, and the occurrence of these letters is conditioned by the phonological ending of words. Second, we showed how Silex could prove useful for the development of theoretical models and for empirical studies. The novel information provided in Silex as well as the flexibility of this database should enable researchers to advance our understanding of developing and skilled spelling performance.
Most experimental research making use of the Japanese language has involved the 1945 officially standardized kanji (Japanese logographic characters) in the Jōyō kanji list (originally announced by the Japanese government in 1981). However, this list was extensively modified in 2010: five kanji were removed and 196 kanji were added; the latest revision of the list now has a total of 2136 kanji. Using an up-to-date corpus consisting of 11 years' worth of articles printed in the Mainichi Newspaper (2000-2010), we have constructed two novel databases that can be used in psychological research using the Japanese language: (1) a database containing a wide variety of properties on the latest 2136 Jōyō kanji, and (2) a novel database containing 27,950 two-kanji compound words (or jukugo). Based on these two databases, we have created an interactive website ( www.kanjidatabase.com ) to retrieve and store linguistic information to be used in psychological and linguistic experiments. The present paper reports the most important characteristics for the new databases, as well as their value for experimental psychological and linguistic research
Written symbols such as letters have been extensively used in cognitive psychology, be it to understand their contribution to written word recognition or to examine processes involved in other mental functions. Sometimes, however, researchers want to manipulate letters while removing their associated characteristics. A powerful solution to do so is to use new characters, devised to be highly similar to letters, but without associated sound or name. Given the growing use of artificial characters in experimental paradigms, the aim of the present study was to make available the Brussels Artificial Character Sets (BACS), two full, strictly controlled, and portable sets of artificial characters for a broad range of experimental situations.
WordNet, a large lexical database of English, was conceived as a model of human semantic organization. Evidence from timing experiments, association norms, and distributional properties of words supported a semantic network model in which words are interlinked via a small number of lexical and conceptual relations. Its large coverage and unique structure, which allows automatic systems to detect and quantify semantic relatedness among words, soon made WordNet an invaluable tool for natural language processing tasks. Information retrieval, document summarization, and machine translation crucially require word sense discrimination and disambiguation. Wordnets have been built in dozens of languages and for specific technical sublanguages, and the number of applications in research, language technology and pedagogy has grown. Although WordNet’s central focus has shifted from its psycholinguistic origins, its design, based on theories about the structure of the human mental lexicon, is validated as a sound approach to representing the meanings of words. (PsycINFO Database Record (c) 2019 APA, all rights reserved)
Perceptual information is important for the meaning of nouns. We present modality exclusivity norms for 485 Dutch nouns rated on visual, auditory, haptic, gustatory, and olfactory associations. We found these nouns are highly multimodal. They were rated most dominant in vision, and least in olfaction. A factor analysis identified two main dimensions: one loaded strongly on olfaction and gustation (reflecting joint involvement in flavor), and a second loaded strongly on vision and touch (reflecting joint involvement in manipulable objects). In a second study, we validated the ratings with similarity judgments. As expected, words from the same dominant modality were rated more similar than words from different dominant modalities; but – more importantly – this effect was enhanced when word pairs had high modality strength ratings. We further demonstrated the utility of our ratings by investigating whether perceptual modalities are differentially experienced in space, in a third study. Nouns were categorized into their dominant modality and used in a lexical decision experiment where the spatial position of words was either in proximal or distal space. We found words dominant in olfaction were processed faster in proximal than distal space compared to the other modalities, suggesting olfactory information is mentally simulated as “close” to the body. Finally, we collected ratings of emotion (valence, dominance, and arousal) to assess its role in perceptual space simulation, but the valence did not explain the data. So, words are processed differently depending on their perceptual associations, and strength of association is captured by modality exclusivity ratings.
In languages where the position of lexical stress within a word is not predictable from print, readers rely on distributional information extracted from the lexicon in order to assign stress. Lexical databases are thus especially important for researchers willing to address stress assignment in those languages. Here we present Q2Stress, a new database aimed to fill the lack of such a resource for Italian. Q2Stress includes multiple cues readers may use in assigning stress, such as type and token frequency of stress patterns as well as their distribution with respect to number of syllables, grammatical category, word beginnings, word endings, and consonant-vowel structures. Furthermore, for the first time, data for both adults and children are available. Q2Stress may help researchers to answer empirical as well as theoretical questions about stress assignment and stress-related issues, and more in general, to explore the orthography-to-phonology relation in reading. Q2Stress is designed as a user-friendly resource, as it comes with scripts allowing researchers to explore and select their own stimuli according to several criteria as well as summary tables for overall data analysis.
textcopyright} 2016 Psychonomic Society, Inc.Using a megastudy approach, we developed a database of lexical variables and lexical decision reaction times and accuracy rates for more than 25,000 traditional Chinese two-character compound words. Each word was responded to by about 33 native Cantonese speakers in Hong Kong. This resource provides a valuable adjunct to influential mega-databases, such as the Chinese single-character, English, French, and Dutch Lexicon Projects. Three analyses were conducted to illustrate the potential uses of the database. First, we compared the proportion of variance in lexical decision performance accounted for by six word frequency measures and established that the best predictor was Cai and Brysbaert's (PLoS One, 5, e10729, 2010) contextual diversity subtitle frequency. Second, we ran virtual replications of three previously published lexical decision experiments and found convergence between the original experiments and the present megastudy. Finally, we conducted item-level regression analyses to examine the effects of theoretically important lexical variables in our normative data. This is the first publicly available large-scale repository of behavioral responses pertaining to Chinese two-character compound word processing, which should be of substantial interest to psychologists, linguists, and other researchers.
Sublexical phonotactic regularities in language have a major impact on language development, as well as on speech processing and production throughout the entire lifespan. To understand the impact of phonotactic regularities on speech and language functions at the behavioral and neural levels, it is essential to have access to oral language corpora to study these complex phenomena in different languages. Yet, probably because of their complexity, oral language corpora remain less common than written language corpora. This article presents the first corpus and database of spoken Quebec French syllables and phones: SyllabO+. This corpus contains phonetic transcriptions of over 300,000 syllables (over 690,000 phones) extracted from recordings of 184 healthy adult native Quebec French speakers, ranging in age from 20 to 97 years. To ensure the representativeness of the corpus, these recordings were made in both formal and familiar communication contexts. Phonotactic distributional statistics (e.g., syllable and co-occurrence frequencies, percentages, percentile ranks, transition probabilities, and pointwise mutual information) were computed from the corpus. An open-access online application to search the database was developed, and is available at www.speechneurolab.ca/syllabo . In this article, we present a brief overview of the corpus, as well as the syllable and phone databases, and we discuss their practical applications in various fields of research, including cognitive neuroscience, psycholinguistics, neurolinguistics, experimental psychology, phonetics, and phonology. Nonacademic practical applications are also discussed, including uses in speech-language pathology.
With the Developmental Lexicon Project (DeveL), we present a large-scale study that was conducted to collect data on visual word recognition in German across the lifespan. A total of 800 children from Grades 1 to 6, as well as two groups of younger and older adults, participated in the study and completed a lexical decision and a naming task. We provide a database for 1,152 German words, comprising behavioral data from seven different stages of reading development, along with sublexical and lexical characteristics for all stimuli. The present article describes our motivation for this project, explains the methods we used to collect the data, and reports analyses on the reliability of our results. In addition, we explored developmental changes in three marker effects in psycholinguistic research: word length, word frequency, and orthographic similarity. The database is available online.
Semantic feature norms (e.g., STIMULUS: car → RESPONSE: <has four wheels>) are commonly used in cognitive psychology to look into salient aspects of given concepts. Semantic features are typically collected in experimental settings and then manually annotated by the researchers into feature types (e.g., perceptual features, taxonomic features, etc.) by means of content analyses—that is, by using taxonomies of feature types and having independent coders perform the annotation task. However, the ways in which such content analyses are typically performed and reported are not consistent across the literature. This constitutes a serious methodological problem that might undermine the theoretical claims based on such annotations. In this study, we first offer a review of some of the released datasets of annotated semantic feature norms and the related taxonomies used for content analysis. We then provide theoretical and methodological insights in relation to the content analysis methodology. Finally, we apply content analysis to a new dataset of semantic features and show how the method should be applied in order to deliver reliable annotations and replicable coding schemes. We tackle the following issues: (1) taxonomy structure, (2) the description of categories, (3) coder training, and (4) sustainability of the coding scheme—that is, comparison of the annotations provided by trained versus novice coders. The outcomes of the project are threefold: We provide methodological guidelines for semantic feature classification; we provide a revised and adapted taxonomy that can (arguably) be applied to both concrete and abstract concepts; and we provide a dataset of annotated semantic feature norms.
As human activity and interaction increasingly take place online, the digital residues of these activities provide a valuable window into a range of psychological and social processes. A great deal of progress has been made toward utilizing these opportunities; however, the complexity of managing and analyzing the quantities of data currently available has limited both the types of analysis used and the number of researchers able to make use of these data. Although fields such as computer science have developed a range of techniques and methods for handling these difficulties, making use of those tools has often required specialized knowledge and programming experience. The Text Analysis, Crawling, and Interpretation Tool (TACIT) is designed to bridge this gap by providing an intuitive tool and interface for making use of state-of-the-art methods in text analysis and large-scale data management. Furthermore, TACIT is implemented as an open, extensible, plugin-driven architecture, which will allow other researchers to extend and expand these capabilities as new methods become available.
The main purpose of this study was to report age-based subjective age-of-acquisition (AoA) norms for 600 Turkish words. A total of 115 children, 100 young adults, 115 middle-aged adults, and 127 older adults provided AoA estimates for 600 words on a 7-point scale. The intraclass correlations suggested high reliability, and the AoA estimates were highly correlated across the four age groups. Children gave earlier AoA estimates than the three adult groups; this was true for high-frequency as well as low-frequency words. In addition to the means and standard deviations of the AoA estimates, we report word frequency, concreteness, and imageability ratings, as well as word length measures (numbers of syllables and letters), for the 600 words as supplemental materials. The present ratings represent a potentially useful database for researchers working on lexical processing as well as other aspects of cognitive processing, such as autobiographical memory.
This article presents K-SPAN (Korean Surface Phonetics and Neighborhoods), a database of surface phonetic forms and several measures of phonological neighborhood density for 63,836 Korean words. Currently publicly available Korean corpora are limited by the fact that they only provide orthographic representations in Hangeul, which is problematic since phonetic forms in Korean cannot be reliably predicted from orthographic forms. We describe the method used to derive the surface phonetic forms from a publicly available orthographic corpus of Korean, and report on several statistics calculated using this database; namely, segment unigram frequencies, which are compared to previously reported results, along with segment-based and syllable-based neighborhood density statistics for three types of representation: an "orthographic" form, which is a quasi-phonological representation, a "conservative" form, which maintains all known contrasts, and a "modern" form, which represents the pronunciation of contemporary Seoul Korean. These representations are rendered in an ASCII-encoded scheme, which allows users to query the corpus without having to read Korean orthography, and permits the calculation of a wide range of phonological measures.
Emotionally charged pictorial materials are frequently used in phobia research, but no existing standardized picture database is dedicated to the study of different phobias. The present work describes the results of two independent studies through which we sought to develop and validate this type of database-a Set of Fear Inducing Pictures (SFIP). In Study 1, 270 fear-relevant and 130 neutral stimuli were rated for fear, arousal, and valence by four groups of participants; small-animal (N = 34), blood/injection (N = 26), social-fearful (N = 35), and nonfearful participants (N = 22). The results from Study 1 were employed to develop the final version of the SFIP, which includes fear-relevant images of social exposure (N = 40), blood/injection (N = 80), spiders/bugs (N = 80), and angry faces (N = 30), as well as 726 neutral photographs. In Study 2, we aimed to validate the SFIP in a sample of spider, blood/injection, social-fearful, and control individuals (N = 66). The fear-relevant images were rated as being more unpleasant and led to greater fear and arousal in fearful than in nonfearful individuals. The fear images differentiated between the three fear groups in the expected directions. Overall, the present findings provide evidence for the high validity of the SFIP and confirm that the set may be successfully used in phobia research.
This article presents subjective rating norms for a new set of Stills And Videos of facial Expressions-the SAVE database. Twenty nonprofessional models were filmed while posing in three different facial expressions (smile, neutral, and frown). After each pose, the models completed the PANAS questionnaire, and reported more positive affect after smiling and more negative affect after frowning. From the shooting material, stills and 5 s and 10 s videos were edited (total stimulus set = 180). A different sample of 120 participants evaluated the stimuli for attractiveness, arousal, clarity, genuineness, familiarity, intensity, valence, and similarity. Overall, facial expression had a main effect in all of the evaluated dimensions, with smiling models obtaining the highest ratings. Frowning expressions were perceived as being more arousing, clearer, and more intense, but also as more negative than neutral expressions. Stimulus presentation format only influenced the ratings of attractiveness, familiarity, genuineness, and intensity. The attractiveness and familiarity ratings increased with longer exposure times, whereas genuineness decreased. The ratings in the several dimensions were correlated. The subjective norms of facial stimuli presented in this article have potential applications to the work of researchers in several research domains. From our database, researchers may choose the most adequate stimulus presentation format for a particular experiment, select and manipulate the dimensions of interest, and control for the remaining dimensions. The full stimulus set and descriptive results (means, standard deviations, and confidence intervals) for each stimulus per dimension are provided as supplementary material.
Emotions are highly influential to many psychological processes. Indeed, research employing emotional stimuli is rapidly escalating across the field of psychology. However, challenges remain regarding discrete evocation of frequently co-elicited emotions such as amusement and happiness, or anger and disgust. Further, as much contemporary work in emotion employs college students, we sought to additionally evaluate the efficacy of film clips to discretely elicit these more challenging emotions in a young adult population using an online medium. The internet is an important tool for investigating responses to emotional stimuli, but validations of emotionally evocative film clips across laboratory and web-based settings are limited in the literature. An additional obstacle is identifying stimuli amidst the numerous film clip validation studies. During our investigation, we recognized the lack of a categorical database to facilitate rapid identification of useful film clips for individual researchers' unique investigations. Consequently, here we also sought to produce the first compilation of such stimuli into an accessible and comprehensive catalog. We based our catalog upon prior work as well as our own, and identified 24 articles and 295 film clips from four decades of research. We present information on the validation of these clips in addition to our own research validating six clips using online administration settings. The results of our search in the literature and our own study are presented in tables designed to facilitate and improve a selection of highly valid film stimuli for future research.
The Affective Norms for Polish Short Texts (ANPST) dataset (Imbir, 2016d) is a list of 718 affective sentence stimuli with known affective properties with respect to subjectively perceived valence, arousal, dominance, origin, subjective significance, and source. This article examines the reliability of the ANPST and the impact of population type and sex on affective ratings. The ANPST dataset was introduced to provide a recognized method of eliciting affective states with linguistic stimuli more complex than single words and that included contextual information and thus are less ambiguous in interpretation than single word. Analysis of the properties of the ANPST dataset showed that norms collected are reliable in terms of split-half estimation and that the distributions of ratings are similar to those obtained in other affective norms studies. The pattern of correlations was the same as that found in analysis of an affective norms dataset for words based on the same six variables. Female psychology students' valence ratings were also more polarized than those of their female student peers studying other subjects, but arousal ratings were only higher for negative words. Differences also appeared for all other measured dimensions. Women's valence ratings were found to be more polarized and arousal ratings were higher than those made by men, and differences were also present for dominance, origin, and subjective significance. The ANPST is the first Polish language list of sentence stimuli and could easily be adapted for other languages and cultures.
FANchild (French Affective Norms for Children) provides norms of valence and arousal for a large corpus of French words (N = 720) rated by 908 French children and adolescents (ages 7, 9, 11, and 13). The ratings were made using the Self-Assessment Manikin (Lang, 1980). Because it combines evaluations of arousal and valence and includes ratings provided by 7-, 9-, 11-, and 13-year-olds, this database complements and extends existing French-language databases. Good response reliability was observed in each of the four age groups. Despite a significant level of consensus, we found age differences in both the valence and arousal ratings: Seven- and 9-year-old children gave higher mean valence and arousal ratings than did the other age groups. Moreover, the tendency to judge words positively (i.e., positive bias) decreased with age. This age- and sex-related database will enable French-speaking researchers to study how the emotional character of words influences their cognitive processing, and how this influence evolves with age. FANchild is available at https://www.researchgate.net/profile/Catherine{\_}Monnier/contributions.
Psycholinguistic research has been advanced by the development of word recognition megastudies. For instance, the English Lexicon Project (Balota et al., 2007) provides researchers with access to naming and lexical-decision latencies for over 40,000 words. In the present work, we extended the megastudy approach to a task that emphasizes semantic processing. Using a concrete/abstract semantic decision (i.e., does the word refer to something concrete or abstract?), we collected decision latencies and accuracy rates for 10,000 English words. The stimuli were concrete and abstract words selected from Brysbaert, Warriner, and Kuperman's (2013) comprehensive list of concreteness ratings. In total, 321 participants provided responses to 1,000 words each. Whereas semantic effects tend to be quite modest in naming and lexical decision studies, analyses of the concrete/abstract semantic decision responses show that a substantial proportion of variance can be explained by semantic variables. The item-level and trial-level data will be useful for other researchers interested in the semantic processing of concrete and abstract words
ASL-LEX is a lexical database that catalogues information about nearly 1,000 signs in American Sign Language (ASL). It includes the following information: subjective frequency ratings from 25–31 deaf signers, iconicity ratings from 21–37 hearing non-signers, videoclip duration, sign length (onset and offset), grammatical class, and whether the sign is initialized, a fingerspelled loan sign, or a compound. Information about English translations is available for a subset of signs (e.g., alternate translations, translation consistency). In addition, phonological properties (sign type, selected fingers, flexion, major and minor location, and movement) were coded and used to generate sub-lexical frequency and neighborhood density estimates. ASL-LEX is intended for use by researchers, educators, and students who are interested in the properties of the ASL lexicon. An interactive website where the database can be browsed and downloaded is available at http://asl-lex.org.
The use of the corpus becomes essential in the development of applications based on natural language processing (NLP). In Ecuador, these applications are incompatible because in each region use words outside the context of Spanish. This article presents the development of a corpus compatible with Ecuadorian natural language words. We applied a identification algorithm to take advantage of local literature and power a new data base. The corpus mounted is verified by a quantitative and qualitative comparison with an open access corpus. The result is the first corpus in this country with high scalability and great versatility. {\textcopyright} 2017 IEEE.
For most of the Uralic languages, there is a lack of systematically collected, consequently transcribed and morphologically annotated text corpora. This paper sums up the steps, the preliminary results and the future directions of building a linguistic corpus of some Uralic languages, namely Tundra Nenets, Udmurt, Synya Khanty, and Surgut Khanty. The experiences of building a corpus containing both old and modern, and written and oral data samples are discussed. Principles concerning data collection strategies of languages with different level of vitality and endangerment are discussed. Methodologies and challenges of data processing, and the levels of linguistic annotation are also described in detail.
The Remote Associates Test (RAT) has been used to measure creativity, however few repositories or standardizations of test items exist, like the normative data on 144 items provided by Bowden and Jung-Beeman. comRAT is a computational solver which has been used to solve the compound RAT in linguistic and visual forms, showing correlation to human performance over the normative data provided by Bowden and Jung-Beeman. This paper describes using a variant of comRAT, comRAT-G, to generate and construct a repository of compound RAT items for use in the cognitive psychology and cognitive modelling community. Around 17 million compound Remote Associates Test items are created from nouns alone, aiming to provide control over (i) frequency of occurrence of query items, (ii) answer items, (iii) the probability of coming up with an answer, (iv) keeping one or more query items constant and (v) keeping the answer constant. Queries produced by comRAT-G are evaluated in a study in comparison with queries from the normative dataset of Bowden and Jung-Beeman, showing that comRAT-G queries are similar to the established query set.
Words are widely used as stimuli in cognitive research. Because of their complexity, using words requires strict control of their objective (lexical and sublexical) and subjective properties. In this work, we present the Minho Word Pool (MWP), a dataset that provides normative values of imageability, concreteness, and subjective frequency for 3,800 (European) Portuguese words-three subjective measures that, in spite of being used extensively in research, have been scarce for Portuguese. Data were collected with 2,357 college students who were native speakers of European Portuguese. The participants rated 100 words drawn randomly from the full set for each of the three subjective indices, using a Web survey procedure (via a URL link). Analyses comparing the MWP ratings with those obtained for the same words from other national and international databases showed that the MWP norms are reliable and valid, thus providing researchers with a useful tool to support research in all neuroscientific areas using verbal stimuli. The MWP norms can be downloaded along with this article or from http://p-pal.di.uminho.pt/about/databases .
An idiom is classically defined as a formulaic sequence whose meaning is comprised of more than the sum of its parts. For this reason, idioms pose a unique problem for models of sentence processing, as researchers must take into account how idioms vary and along what dimensions, as these factors can modulate the ease with which an idiomatic interpretation can be activated. In order to help ensure external validity and comparability across studies, idiom research benefits from the availability of publicly available resources reporting ratings from a large number of native speakers. Resources such as the one outlined in the current paper facilitate opportunities for consensus across studies on idiom processing and help to further our goals as a research community. To this end, descriptive norms were obtained for 870 American English idioms from 2,100 participants along five dimensions: familiarity, meaningfulness, literal plausibility, global decomposability, and predictability. Idiom familiarity and meaningfulness strongly correlated with one another, whereas familiarity and meaningfulness were positively correlated with both global decomposability and predictability. Correlations with previous norming studies are also discussed.
In the present study, we collected valence, arousal, concreteness, familiarity, imageability, and context availabili-ty ratings for a total of 1,100 Chinese words. The ratings for all variables were collected with 9-point Likert scales. We tested the reliability of the present database by comparing it to the extant Chinese Affective Word System, and performed split-half correlations for all six variables. We then evaluated the relationships between all variables. Regarding the affective variables, we found a typical quadratic relation between va-lence and arousal, in line with previous findings. Likewise, significant correlations were found between the semantic var-iables. Importantly, we explored the relationships between ratings for the affective variables (i.e., valence and arousal) and concreteness ratings, suggesting that valence and arousal ratings can predict concreteness ratings. This database of af-fective norms will be a valuable source of information for emotion research that makes use of Chinese words, and will enable researchers to use highly controlled Chinese verbal stimuli to more reliably investigate the relation between cog-nition and emotion.
textcopyright} 2016, The Author(s). In this article, we introduce HelexKids, an online written-word database for Greek-speaking children in primary education (Grades 1 to 6). The database is organized on a grade-by-grade basis, and on a cumulative basis by combining Grade 1 with Grades 2 to 6. It provides values for Zipf, frequency per million, dispersion, estimated word frequency per million, standard word frequency, contextual diversity, orthographic Levenshtein distance, and lemma frequency. These values are derived from 116 textbooks used in primary education in Greece and Cyprus, producing a total of 68,692 different word types. HelexKids was developed to assist researchers in studying language development, educators in selecting age-appropriate items for teaching, as well as writers and authors of educational books for Greek/Cypriot children. The database is open access and can be searched online at www.helexkids.org.
This study presents a normative database of Spanish restricted length word stems that provides useful information for the selection of stimuli in memory experiments with Word Stem Completion (WSC) tasks. The database includes indices relative to stems (total baseline completion, priming baseline completion, priming, number of completions, ratio between given and deleted letters, and syllabic structure), and indices relative to characteristics of the words used to obtain the stems (frequency, familiarity, number of meanings, length, number of syllables, arousal, and valence). A WSC task was performed by 515 participants to calculate priming and baseline indices. An Exploratory Factor Analysis showed that these indices are grouped in four factors: perceptual, lexical, emotional, and response competition. Stepwise regression analyses performed with these factors showed that the lexical, response competition, and perceptual factors predict priming baseline completion, while only the lexical factor predicts priming. The model that best explains the relationship between priming and priming baseline completion was a cubic model, and the optimum baseline values for achieving priming were between .31 and .36. These norms can be downloaded as Supplemental Materials for this article from https://nuvol.uv.es/owncloud/index.php/s/hpj9by1qbENdjfj .
Words that have been learned early in life are responded to faster than words that have been acquired later. Subjective ratings of acquisition ages have been successfully employed to study the effect of age of acquisition (AoA). Although a large number of norms exist in many languages, fewer are available for German. Therefore, subjective AoA ratings for 3,259 German words were collected online, includ-ing 2,363 nouns and 473 verbs. These words were presented in lists of 140 words, and participants rated the age in years at which they had first learned each word. A split-half correla-tion testified to a high internal reliability. There were also high correlations with rated AoA values for subsets of the items that had been collected in previous studies, in both German and English. Age and gender were found to influence the ratings very weakly, in that older and male participants tended to give slightly higher age ratings. Education, multilingualism, and frequent usage of languages other than German did not exert an influence on the rating values. These new ratings will ex-tend the currently existing norms available for language and reading research across languages and will provide re-searchers with a wider choice of word stimuli. The ratings are available expressed in two measurements: age in years, and AoA rated on a 7-point Likert scale.
? 2016 Psychonomic Society, Inc.Semantic feature production norms provide many quantitative measures of different feature and concept variables that are necessary to solve some debates surrounding the nature of the organization, both normal and pathological, of semantic memory. Despite the current existence of norms for different languages, there are still no published norms in Spanish. This article presents a new set of norms collected from 810 participants for 400 living and nonliving concepts among Spanish speakers. These norms consist of empirical collections of features that participants used to describe the concepts. Four files were elaborated: a concept?feature file, a concept?concept matrix, a feature?feature matrix, and a significantly correlated features file. We expect that these norms will be useful for researchers in the fields of experimental psychology, neuropsychology, and psycholinguistics.
We present Chinese translation norms for 1429 English words. Chinese-English bilinguals (N=28) were asked to provide the first Chinese translation that came to mind for 1429 English words. The results revealed that 71{\%} of the English words received more than one correct translation indicating the large amount of translation ambiguity when translating from English to Chinese. The relationship between translation ambiguity and word frequency, concreteness and language proficiency was investigated. Although the significant correlations were not strong, results revealed that English word frequency was positively correlated with the number of alternative translations, whereas English word concreteness was negatively correlated with the number of translations. Importantly, regression analyses showed that the number of Chinese translations was predicted by word frequency and concreteness. Furthermore, an interaction between these predictors revealed that the number of translations was more affected by word frequency for more concrete words than for less concrete words. In addition, mixed-effects modelling showed that word frequency, concreteness and English language proficiency were all significant predictors of whether or not a dominant translation was provided. Finally, correlations between the word frequencies of English words and their Chinese dominant translations were higher for translation-unambiguous pairs than for translation-ambiguous pairs. The translation norms are made available in a database together with lexical information about the words, which will be a useful resource for researchers investigating Chinese-English bilingual language processing.
This article presents the Danish NOMCO Corpus, an annotated multimodal collection of video-recorded first acquaintance conversations between Danish speakers. The annotation includes speech transcription including word boundaries, and formal as well as functional coding of gestural behaviours, specifically head movements, facial expressions, and body posture. The corpus has served as the empirical basis for a number of studies of communication phenomena related to turn management, feedback exchange, information packaging and the expression of emotional attitudes. We describe the annotation scheme, procedure, and annotation results. We then summarise a number of studies conducted on the corpus. The corpus is available for research and teaching purposes through the authors of this article.
We present a collection of parallel treebanks that have been automatically aligned on both the terminal and the non-terminal constituent level for use in syntax-based machine translation. We describe how they were constructed and applied to a syntax- and example-based machine translation system called Parse and Corpus-Based Machine Translation (PaCo-MT). For the language pair Dutch to English, we present non-terminal alignment evaluation scores for a variety of tree alignment approaches. Finally, based on the parallel treebanks created by these approaches, we evaluate the MT system itself and compare the scores with those of Moses, a current state-of-the-art statistical MT system, when trained on the same data.
To simplify the problem of studying how people learn natural language, researchers use the artificial grammar learning (AGL) task. In this task, participants study letter strings constructed according to the rules of an artificial grammar and subsequently attempt to discriminate grammatical from ungrammatical test strings. Although the data from these experiments are usually analyzed by comparing the mean discrimination performance between experimental conditions, this practice discards information about the individual items and participants that could otherwise help uncover the particular features of strings associated with grammaticality judgments. However, feature analysis is tedious to compute, often complicated, and ill-defined in the literature. Moreover, the data violate the assumption of independence underlying standard linear regression models, leading to Type I error inflation. To solve these problems, we present AGSuite, a free Shiny application for researchers studying AGL. The suite's intuitive Web-based user interface allows researchers to generate strings from a database of published grammars, compute feature measures (e.g., Levenshtein distance) for each letter string, and conduct a feature analysis on the strings using linear mixed effects (LME) analyses. The LME analysis solves the inflation of Type I errors that afflicts more common methods of repeated measures regression analysis. Finally, the software can generate a number of graphical representations of the data to support an accurate interpretation of results. We hope the ease and availability of these tools will encourage researchers to take full advantage of item-level variance in their datasets in the study of AGL. We moreover discuss the broader applicability of the tools for researchers looking to conduct feature analysis in any field.
Words are frequently used as stimuli in cognitive psychology experiments, for example, in recognition memory studies. In these experiments, it is often desirable to control for the words' psycholinguistic properties because differences in such properties across experi-mental conditions might introduce undesirable confounds. In order to avoid confounds, stud-ies typically check to see if various affective and lexico-semantic properties are matched across experimental conditions, and so databases that contain values for these properties are needed. While word ratings for these variables exist in English and other European lan-guages, ratings for Chinese words are not comprehensive. In particular, while ratings for sin-gle characters exist, ratings for two-character words—which often have different meanings than their constituent characters, are scarce. In this study, ratings for 292 two-character Chi-nese nouns were obtained from Cantonese speakers in Hong Kong. Affective variables, including valence and arousal, and lexico-semantic variables, including familiarity, concrete-ness, and imageability, were rated in the study. The words were selected from a film subtitle database containing word frequency information that could be extracted and listed along-side the resulting ratings. Overall, the subjective ratings showed good reliability across all rated dimensions, as well as good reliability within and between the different groups of par-ticipants who each rated a subset of the words. Moreover, several well-established relation-ships between the variables found consistently in other languages were also observed in this study, demonstrating that the ratings are valid. The resulting word database can be used in studies where control for the above psycholinguistic variables is critical to the research design. PLOS ONE | https://doi.org/10.1371/journal.pone.0174569 March 27, 2017 1 / 16 a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 OPEN ACCESS Citation: Yee LTS (2017) Valence, arousal, familiarity, concreteness, and imageability ratings for 292 two-character Chinese nouns in Cantonese speakers in Hong Kong. PLoS ONE 12(3):
Picture databases are commonly used in experimental work on various aspects of emotion processing. However, existing standardized facial databases, typically used to explore emotion recognition, can be augmented with more contextual information for studying emotion and social perception. Moreover, the perception of social engagement, i.e., the degree of interaction or engagement inferred between the people in target pictures, has not been measured. In this paper, we describe the development of a database comprising 203 black-and-white line drawings depicting people within various situational contexts, and normed on perceived emotional valence, intensity, and social engagement, a new construct. Analyses of ratings collected from 62 young adults (30 females, 32 males; mean age 22 years) revealed the typical quadratic relationship between valence and intensity, i.e., stimuli that are more emotionally charged, whether positively or negatively valenced, are more intense than emotionally-neutral stimuli. Moreover, the results showed significant linear and quadratic relationships between valence and social engagement ratings, indicating that emotionally-charged social scenes were perceived as more engaging than emotionally-neutral social scenes. This new database will facilitate investigations of how people perceive and interpret social and emotional information in everyday interactions, and is offered as a resource to experimenters involved in social and/or emotional processing research.
Words are considered semantically ambiguous if they have more than one meaning and can be used in multiple contexts. A number of recent studies have provided objective ambiguity measures by using a corpus-based approach and have demonstrated ambiguity advantages in both naming and lexical decision tasks. Although the predictive power of objective ambiguity measures has been examined in several alphabetic language systems, the effects in logographic languages remain unclear. Moreover, most ambiguity measures do not explicitly address how the various contexts associated with a given word relate to each other. To explore these issues, we computed the contextual diversity (Adelman, Brown, {\&} Quesada, Psychological Science, 17; 814-823, 2006) and semantic ambiguity (Hoffman, Lambon Ralph, {\&} Rogers, Behavior Research Methods, 45; 718-730, 2013) of traditional Chinese single-character words based on the Academia Sinica Balanced Corpus, where contextual diversity was used to evaluate the present semantic space. We then derived a novel ambiguity measure, namely semantic variability, by computing the distance properties of the distinct clusters grouped by the contexts that contained a given word. We demonstrated that semantic variability was superior to semantic diversity in accounting for the variance in naming response times, suggesting that considering the substructure of the various contexts associated with a given word can provide a relatively fine scale of ambiguity information for a word. All of the context and ambiguity measures for 2,418 Chinese single-character words are provided as supplementary materials.
Corpora are essential resources for language studies, as well as for training statistical natural language processing systems. Although very large English corpora have been built, only relatively small corpora are available for many varieties of English. National top-level domains (e.g., .au, .ca) could be exploited to automatically build web corpora, but it is unclear whether such corpora would reflect the corresponding national varieties of English; i.e., would a web corpus built from the .ca domain correspond to Canadian English? In this article we build web corpora from national top-level domains corresponding to countries in which English is widely spoken. We then carry out statistical analyses of these corpora in terms of keywords, measures of corpus comparison based on the Chi-square test and spelling variants, and the frequencies of words known to be marked in particular varieties of English. We find evidence that the web corpora indeed reflect the corresponding national varieties of English. We then demonstrate, through a case study on the analysis of Canadianisms, that these corpora could be valuable lexicographical resources.
The extent to which processing words involves breaking them down into smaller units or morphemes or is the result of an interactive activation of other units, such as meanings, letters, and sounds (e.g., dis-agree-ment vs. disagreement), is currently under debate. Disentangling morphology from phonology and semantics is often a methodological challenge, because orthogonal manipulations are difficult to achieve (e.g., semantically unrelated words are often phonologically related: casual-casualty and, vice versa, sign-signal). The present norms provide a morphological classification of 3,263 suffixed derived words from two widely spoken languages: English (2,204 words) and Spanish (1,059 words). Morphologically complex words were sorted into four categories according to the nature of their relationship with the base word: phonologically transparent (friend-friendly), phonologically opaque (child-children), semantically transparent (habit-habitual), and semantically opaque (event-eventual). In addition, ratings were gathered for age of acquisition, imageability, and semantic distance (i.e., the extent to which the meaning of the complex derived form could be drawn from the meaning of its base constituents). The norms were completed by adding values for word frequency; word length in number of phonemes, letters, and syllables; lexical similarity, as measured by the number of neighbors; and morphological family size. A series of comparative analyses from the collated ratings for the base and derived words were also carried out. The results are discussed in relation to recent findings.
The free text notes typed by physicians during patient consultations contain valuable information for the study of disease and treatment. These notes are difficult to process by existing natural language analysis tools since they are highly telegraphic (omitting many words), and contain many spelling mistakes, inconsistencies in punctuation, and non-standard word order. To support information extraction and classification tasks over such text, we describe a de-identified corpus of free text notes, a shallow syntactic and named entity annotation scheme for this kind of text, and an approach to training domain specialists with no linguistic background to annotate the text. Finally, we present a statistical chunking system for such clinical text with a stable learning rate and good accuracy, indicating that the manual annotation is consistent and that the annotation scheme is tractable for machine learning.
We present a new set of subjective age-of-acquisition (AoA) ratings for 299 words (158 nouns, 141 verbs) in 25 languages from five language families (Afro-Asiatic: Semitic languages; Altaic: one Turkic language: Indo-European: Baltic, Celtic, Germanic, Hellenic, Slavic, and Romance languages; Niger-Congo: one Bantu language; Uralic: Finnic and Ugric languages). Adult native speakers reported the age at which they had learned each word. We present a comparison of the AoA ratings across all languages by contrasting them in pairs. This comparison shows a consistency in the orders of ratings across the 25 languages. The data were then analyzed (1) to ascertain how the demographic characteristics of the participants influenced AoA estimations and (2) to assess differences caused by the exact form of the target question (when did you learn vs. when do children learn this word); (3) to compare the ratings obtained in our study to those of previous studies; and (4) to assess the validity of our study by comparison with quasi-objective AoA norms derived from the MacArthur-Bates Communicative Development Inventories (MB-CDI). All 299 words were judged as being acquired early (mostly before the age of 6 years). AoA ratings were associated with the raters' social or language status, but not with the raters' age or education. Parents reported words as being learned earlier, and bilinguals reported learning them later. Estimations of the age at which children learn the words revealed significantly lower ratings of AoA. Finally, comparisons with previous AoA and MB-CDI norms support the validity of the present estimations. Our AoA ratings are available for research or other purposes.
Evidence based medicine (EBM) urges the medical doctor to incorporate the latest available clinical evidence at point of care. A major stumbling block in the practice of EBM is the difficulty to keep up to date with the clinical advances. In this paper we describe a corpus designed for the development and testing of text processing tools for EBM, in particular for tasks related to the extraction and summarisation of answers and corresponding evidence related to a clinical query. The corpus is based on material from the Clinical Inquiries section of The Journal of Family Practice. It was gathered and annotated by a combination of automated information extraction, crowdsourcing tasks, and manual annotation. It has been used for the original summarisation task for which it was designed, as well as for other related tasks such as the appraisal of clinical evidence and the clustering of the results. The corpus is available at SourceForge (http://sourceforge.net/projects/ebmsumcorpus/).
Lexical frequency is one of the strongest predictors of word processing time. The frequencies are often calculated from book-based corpora, or more recently from subtitle-based corpora. We present new frequencies based on Twitter, blog posts, or newspapers for 66 languages. We show that these frequencies predict lexical decision reaction times similar to the already existing frequencies, or even better than them. These new frequencies are freely available and may be downloaded from http://worldlex.lexique.org .
Color has the ability to influence a variety of human behaviors, such as object recognition, the identification of facial expressions, and the ability to categorize stimuli as positive or negative. Researchers have started to examine the relationship between emotional words and colors, and the findings have revealed that brightness is often associated with positive emotional words and darkness with negative emotional words (e.g., Meier, Robinson, {\&} Clore, Psychological Science, 15, 82-87, 2004). In addition, words such as anger and failure seem to be inherently associated with the color red (e.g., Kuhbandner {\&} Pekrun). The purpose of the present study was to construct norms for positive and negative emotion and emotion-laden words and their color associations. Participants were asked to provide the first color that came to mind for a set of 160 emotional items. The results revealed that the color RED was most commonly associated with negative emotion and emotion-laden words, whereas YELLOW and WHITE were associated with positive emotion and emotion-laden words, respectively. The present work provides researchers with a large database to aid in stimulus construction and selection.
A corpus of Polish speech, which has been collected for the purpose of automatic speech recognition (ASR) and text-to-speech (TTS) systems applications, is presented. The corpus consists of several groups of recordings: read sentences, spoken commands, a phonetically balanced TTS training corpus, telephonic speech and others. In summary duration of recordings is above 25 h. Number of unique speakers amounts to 166. The majority of them being in an age group of 20–35 and one third of them being female. Analysis of unique word occurrence frequency in relation to larger text resources has been concluded. From them, most commonly appearing words have been found and presented. The corpus was used as training data for the ASR system. Results of cross-validation training and testing the SARMATA ASR system using our corpus have shown that phrase recognition rate is 91.9 %. The corpus was additionally evaluated in comparative test against the CORPORA corpus, which had shown major increase in phrase recognition rate in favour of our corpus.
Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been devel-oped for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguis-tic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in lan-guage arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.
Many studies have shown that how words are processed in a variety of language-related tasks is affected by their age of acquisition (AoA). Most AoA norms have been collected for nouns, a fact that limits the extent to which verb stimuli can be adequately manipulated and controlled in empirical studies. With the aim of increasing the number of verbs with AoA values in Spanish, 900 college students were recruited to provide subjective estimates for a total of 4,640 infinitive and reflexive forms. An AoA score for each verb was obtained by averaging the responses of the participants, and these norms were included, together with additional quantitative information (standard deviations, ranges, and z scores), in a database that can be downloaded with this article as supplemental materials.
Our understanding of the mental lexicon, the way meaning is extracted from word forms, is almost entirely built on data from spoken languages. While there is much work demonstrating that in many ways the linguistic structure and psychological mechanisms for processing signed language and spoken language processing are the same, less is known about the signed language mental lexicon. In this dissertation, I examine the structure of the American Sign Language mental lexicon, and the ways meaning can be extracted from the manual/visual signal. In the third chapter of this dissertation I ask whether a single cognitive architecture might explain diverse behavioral patterns in signed and spoken language. Chen and Mirman (2012) presented a computational model of word processing that unified opposite effects of neighborhood density in speech production, perception, and written word recognition. Carreiras et al. (2008) demonstrate that neighborhood density effects in Spanish Sign Language (LSE) also vary depending on whether the neighbors share the same handshape or location. We present a spreading activation architecture that borrows the principles proposed by Chen and Mirman (2012), and show that if this architecture is elaborated to incorporate relatively minor facts about either 1) the time course of sign perception or 2) the frequency of sub-lexical units in sign languages, it produces data that match the experimental findings from sign languages. This work serves as a proof of concept that a single cognitive architecture could underlie both sign and word recognition. In the second chapter I present ASL-LEX, a lexical database for ASL that catalogues more than forty properties about almost 1,000 signs. The database includes, for example, information about each sign's iconicity, phonological make-up, and neighborhood density. I use this information to better understand the structure of the ASL lexicon, the distribution of each of these properties, and the relationships between these properties. This lexical database is the largest and most comprehensive database of ASL, and can be used by researchers to develop experiments and by educators to identify and support vocabulary development. In the fourth chapter, I use ASL-LEX to develop a tightly-controlled study of sign perception. I ask whether neighborhood density and sub-lexical frequency play a role in sign perception, and if the mechanisms of sign perception are affected by early language experience. Eighty deaf participants with varying early language backgrounds completed a lexical decision task. I find that neighborhood density inhibits sign perception in people with low early ASL exposure, but has no effect in people with high early ASL exposure. Location frequency inhibits sign perception in all people, but the effect is stronger in people with low early ASL exposure. This suggests that impoverished access to ASL early in life has lasting consequences for sign perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
The design of experimental tasks in psychology and linguistics requires using stimulus with properties and characteristics in standardized values. This allows predicting with higher accuracy the impact of the stimulus presentation. The lexical associative norms are instruments that determine the strength of association between two concepts. The most common method to construct these norms is to take a free response from a presentation of a cue word. The main goal of this study was to construct lexical associative norms of 407 Spanish words. 800 students from Ciudad de Córdoba, Argentina, participated in the study. Quantitative analyses were performed taking into account the number of valid answers, blank and non valid answers, and number of associates per item. A qualitative classification was performed according to the strength of association. Additionally, it is presented a group of psycholinguistic indexes for a better description of the items used. Correlation analysis demonstrated a strong and negative relation between the frequency of first and second associations and the number of associations per item. This study pretends to be highly useful in research in psychology and linguistic where it is required consulting the norms presented to the design of evaluation instruments. (PsycINFO Database Record (c) 2018 APA, all rights reserved)
In many research domains, researchers have employed gradually morphing pictures to study perception under ambiguity. Despite their inherent utility, only a limited number of stimulus sets are available, and those sets vary substantially in quality and perceptual complexity. Here we present normative data for 40 morphing picture series. In all sets, line drawings of pictures of common objects are morphed over 15 iterations into a completely different object. Objects are either morphed from an animate to an inanimate object (or vice versa) or morphed within the animate and inanimate object categories. These pictures, together with the normative naming data presented here, will be of value for research on a diverse range of questions, from perceptual processing to decision making.
textcopyright} 2015, Psychonomic Society, Inc. Relative meaning frequency is a critical factor to consider in studies of semantic ambiguity. In this work, we examined how this measure may change across the European and Rioplatense dialects of Spanish, as well as how the overall distributional properties differ between Spanish and English, using a computer-assisted norming approach based on dictionary definitions (Armstrong, Tokowicz, {\&} Plaut, 2012). The results showed that the two dialects differ considerably in terms of the relative meaning frequencies of their constituent homonyms, and that the overall distributions of relative frequencies vary considerably across languages, as well. These results highlight the need for localized norms to design powerful studies of semantic ambiguity and suggest that dialectal differences may be responsible for some discrepant effects related to homonymy. In quantifying the reliability of the norms, we also established that as few as seven ratings are needed to converge on a highly stable set of ratings. This approach is therefore a very practical means of acquiring essential data in studies of semantic ambiguity, relative to past approaches, such as those based on the classification of free associates. The norms also present new possibilities for studying semantic ambiguity effects within and between populations who speak one or more languages. The norms and associated software are available for download at http://edom.cnbc.cmu.edu/ or http://www.bcbl.eu/databases/edom/.
Faces impart exhaustive information about their bearers, and are widely used as stimuli in psychological research. Yet many extant facial stimulus sets have sub- stantially less detail than faces encountered in real life. In this paper, we describe a new database of facial stimuli, the Multi-Racial Mega-Resolution database (MR2). The MR2 includes 74 extremely high resolution images of European, African, and East Asian faces. This database provides a high-quality, diverse, naturalistic, and well-controlled facial image set for use in research. The MR2 is available under a Creative Commons license, and may be accessed online.
Humans appear to rely on spatial mappings to describe and represent concepts. In particular, conceptual cueing refers to the effect whereby after reading or hearing a particular word, the location of observers' visual attention in space can be systematically shifted in a particular direction. For example, words such as "sun" and "happy" orient attention upwards, whereas words such as "basement" and "bitter" orient attention downwards. This area of research has garnered much interest, particularly within the embodied cognition framework, for its potential to enhance our understanding of the interaction between abstract cognitive processes such as language and basic visual processes such as attention and stimulus processing. To date, however, this area has relied on subjective classification criteria to determine whether words ought to be classified as having a meaning that implies "up" or "down." The present study, therefore, provides a set of 498 items that have each been systematically rated by over 90 participants, providing refined, continuous measures of the extent to which people associate given words with particular spatial dimensions. The resulting database provides an objective means to aid item-selection for future research in this area.
Normative databases for pictorial stimuli are widely used in research on language processing in order to control for a number of psycholinguistic variables in the selected stimuli. Such resources are lacking for Arabic and its dialectal varieties. In the present study, we aimed to provide Tunisian Arabic (TA) normative data for 348 line drawings taken from Cycowicz, Friedman, Rothstein, and Snodgrass (1997), which include Snodgrass and Vanderwart's (1980) 260 pictures. Norms were collected for the following psycholinguistic variables: name agreement, familiarity, subjective frequency, and imageability. Word length data (in numbers of phonemes and syllables) are also listed in the database. We investigated the effects of these variables on word reading in TA. We found that word length and frequency were the best predictors of word-reading latencies in TA. Name agreement was also a significant predictor of word-reading latencies. A particularly interesting finding was that the semantic variables, imageability and familiarity, affected word-reading latencies in TA. Thus, it would seem that TA readers rely on semantics even when reading individual Arabic words that are transparent in terms of orthography-to-phonology mappings. This database represents a precious and much-needed psycholinguistic resource for researchers investigating language processing in Arabic-speaking populations.
In the speeded word fragment completion task, participants have to complete fragments such as tom{\_}to as quickly and accurately as possible. Previous work has shown that this paradigm can successfully capture subtle priming effects (Heyman, De Deyne, Hutchison, {\&} Storms Behavior Research Methods, 47, 580-606, 2015). In addition, it has several advantages over the widely used lexical decision task. That is, the speeded word fragment completion task is more efficient, more engaging, and easier. Given its potential, we conducted a study to gather speeded word fragment completion norms. The goal of this megastudy was twofold. On the one hand, it provides a rich database of over 8,000 stimuli, which can, for instance, be used in future research to equate stimuli on baseline response times. On the other hand, the aim was to gain insight into the underlying processes of the speeded word fragment completion task. To this end, item-level regression and mixed-effects analyses were performed on the response latencies using 23 predictor variables. Since all items were selected from the Dutch Lexicon Project (Keuleers, Diependaele, {\&} Brysbaert Frontiers in Psychology, 1, 174, 2010), we ran the same analyses on lexical decision latencies to compare the two tasks. Overall, the results revealed many similarities, but also some remarkable differences, which are discussed. We propose that both tasks are complementary when examining visual word recognition. The article ends with a discussion of potential process models of the speeded word fragment completion task.
In the present study, we introduce affective norms for a new set of Spanish words, the Madrid Affective Database for Spanish (MADS), that were scored on two emotional dimensions (valence and arousal) and on five discrete emotional categories (happiness, anger, sadness, fear, and disgust), as well as on concreteness, by 660 Spanish native speakers. Measures of several objective psycholinguistic variables—grammatical class, word frequency, number of letters, and number of syllables—for the words are also included. We observed high split-half reliabilities for every emotional variable and a strong quadratic relationship between valence and arousal. Additional analyses revealed several associations between the affective dimensions and discrete emotions, as well as with some psycholinguistic variables. This new corpus complements and extends prior databases in Spanish and allows for designing new experiments investigating the influence of affective content in language processing under both dimensional and discrete theoretical conceptions of emotion. These norms can be downloaded as supplemental materials for this article from www.dropbox.com/s/o6dpw3irk6utfhy/Hinojosa{\%}20et{\%}20al{\_}Supplementary{\%}20materials.xlsx?dl=0.
Houses have often been used as comparison stimuli in face-processing studies because of the many attributes they share with faces (e.g., distinct members of a basic category, consistent internal features, mono-orientation, and relative familiarity). Despite this, no large, well-controlled databases of photographs of houses that have been developed for research use currently exist. To address this gap, we photographed 100 houses and carefully edited these images. We then asked 41 undergraduate students (18 to 31 years of age) to rate each house on three dimensions: typicality, likeability, and face-likeness. The ratings had a high degree of face validity, and analyses revealed a significant positive correlation between typicality and likeability. We anticipate that this stimulus set (i.e., the DalHouses) and the associated ratings will prove useful to face-processing researchers by minimizing the effort required to acquire stimuli and allowing for easier replication and extension of studies. The photographs of all 100 houses and their ratings data can be obtained at http://dx.doi.org/10.6084/m9.figshare.1279430.
The Nencki Affective Picture System (NAPS; Marchewka, {\.{Z}}urawski, Jednor{\'{o}}g, {\&} Grabowska, Behavior Research Methods, 2014) is a standardized set of 1,356 realistic, high-quality photographs divided into five categories (people, faces, animals, objects, and landscapes). NAPS has been primarily standardized along the affective dimensions of valence, arousal, and approach-avoidance, yet the characteristics of discrete emotions expressed by the images have not been investigated thus far. The aim of the present study was to collect normative ratings according to categorical models of emotions. A subset of 510 images from the original NAPS set was selected in order to proportionally cover the whole dimensional affective space. Among these, using three available classification methods, we identified images eliciting distinguishable discrete emotions. We introduce the basic-emotion normative ratings for the Nencki Affective Picture System (NAPS BE), which will allow researchers to control and manipulate stimulus properties specifically for their experimental questions of interest. The NAPS BE system is freely accessible to the scientific community for noncommercial use as supplementary materials to this article.
The most important forms of idioms in Chinese, chengyus (CYs), have a fixed length of four Chinese characters. Most CYs are joined structures of two, two-character words—subject–verb units (SVs), verb–object units (VOs), structures of modification (SMs), or verb–verb units—or of four, one-character words. Both the first and second pairs of words in a four-word CY form an SV, a VO, or an SM. In the present study, normative measures were obtained for knowledge, familiarity, subjective frequency, age of acquisition, predictability, literality, and compositionality for 350 CYs, and the influences of the CYs' syntactic structures on the descriptive norms were analyzed. Consistent with previous studies, all of the norms yielded a high reliability, and there were strong correlations between knowledge, familiarity, subjective frequency, and age of acquisition, and between familiarity and predictability. Unlike in previous studies (e.g., Libben {\&} Titone in Memory {\&} Cognition, 36, 1103–1121, 2008), however, we observed a strong correlation between literality and compositionality. In general, the results seem to support a hybrid view of idiom representation and comprehension. According to the evaluation scores, we further concluded that CYs consisting of just one SM are less likely to be decomposable than those with a VOVO composition, and also less likely to be recognized through their constituent words, or to be familiar to, known by, or encountered by users. CYs with an SMSM composition are less likely than VOVO CYs to be decomposable or to be known or encountered by users. Experimental studies should investigate how a CY's syntactic structure influences its representation and comprehension.
Recently, genre collection and automatic genre identification for the web has attracted much attention. However, currently there is no genre-annotated corpus of web pages where inter-annotator reliability has been established, i.e. the corpora are either not tested for inter-annotator reliability or exhibit low inter-coder agreement. Annotation has also mostly been carried out by a small number of experts, leading to concerns with regard to scalability of these annotation efforts and transferability of the schemes to annotators outside these small expert groups. In this paper, we tackle these problems by using crowd-sourcing for genre annotation, leading to the Leeds Web Genre Corpus—the first web corpus which is, demonstrably reliably annotated for genre and which can be easily and cost-effectively expanded using naive annotators. We also show that the corpus is source and topic diverse.
Knowledge of thematic relations is an area of increased interest in semantic memory research because it is crucial to many cognitive processes. One methodological issue that researchers face is how to identify pairs of thematically related concepts that are well-established in semantic memory for most people. In this article, we review existing methods of assessing thematic relatedness and provide thematic relatedness production norming data for 100 object concepts. In addition, 1,174 related concept pairs obtained from the production norms were classified as reflecting one of the five subtypes of relations: attributive, argument, coordinate, locative, and temporal. The database and methodology will be useful for researchers interested in the effects of thematic knowledge on language processing, analogical reasoning, similarity judgments, and memory. These data will also benefit researchers interested in investigating potential processing differences among the five types of semantic relations.
This article presents subjective rating norms for a new set of 600 symbols, depicting various contents (e.g., transportation, technology, and leisure activities) that can be used by researchers in different fields. Symbols were evaluated for aesthetic appeal, familiarity, visual complexity, concreteness, valence, arousal, and meaningfulness. The normative data were obtained from 388 participants, and no gender differences were found. Descriptive results (means, standard deviations, and confidence intervals) for each symbol in each dimension are presented. Overall, the dimensions were highly correlated. Additionally, participants were asked to briefly describe the meaning of each symbol. The results indicate that the present symbol set is varied, allowing for the selection of exemplars with different levels on the seven examined dimensions. This set of symbols constitutes a tool with potential for research in different areas. The database with all of the symbols is available as supplemental materials.
ABSTRACTIn brain and behaviour, gustation, and olfaction are closely linked to emotional processing. This paper shows that similarly, words associated with taste and smell, such as “pungent” and “delicious”, are on average more emotionally valenced than words associated with the other senses, such as “beige” (visual) and “echoing” (auditory). Moreover, taste and smell words occur more frequently in emotionally valenced phrases, for example, “fragrant” modifies more emotionally valenced nouns (“fragrant kiss”) than the visual adjective “yellow” (“yellow house”). It is argued that taste and smell words form an affectively loaded part of the English lexicon. Taste and smell words are also shown to be more emotionally flexible in that words such as “sweet” can be combined with both good and bad nouns (“sweet delight” versus “sweet disaster”), much more so than is the case for sensory words for the other modalities. The paper discusses implications for theories of embodied language understanding.
Change blindness has been a topic of interest in cognitive sciences for decades. Change detection experiments are frequently used for studying various research topics such as attention and perception. However, creating change detection stimuli is tedious and there is no open repository of such stimuli using natural scenes. We introduce the Change Blindness (CB) Database with object changes in 130 colored images of natural indoor scenes. The size and eccentricity are provided for all the changes as well as reaction time data from a baseline experiment. In addition, we have two specialized satellite databases that are subsets of the 130 images. In one set, changes are seen in rooms or in mirrors in those rooms (Mirror Change Database). In the other, changes occur in a room or out a window (Window Change Database). Both the sets have controlled background, change size, and eccentricity. The CB Database is intended to provide researchers with a stimulus set of natural scenes with defined stimulus parameters that can be used for a wide range of experiments. The CB Database can be found at http://search.bwh.harvard.edu/new/CBDatabase.html .
Research in metaphor processing has made extensive use of the normed metaphor database created by Katz, Paivio, Marschark, {\&} Clark (Metaphor and Symbolic Activity, 3, 191–214, 1988). Because of the plasticity of figurative language, we conducted a renorming of selected metaphors from the database on a new student population. Correlations between Katz et al.'s and the present data showed that the pattern of responses has remained highly consistent across time and populations. The consistency of the normative ratings allows us to be confident in future research that will use the Katz et al. collection.
The EU-Emotion Stimulus Set is a newly developed collection of dynamic multimodal emotion and mental state representations. A total of 20 emotions and mental states are represented through facial expressions, vocal expressions, body gestures and contextual social scenes. This emotion set is portrayed by a multi-ethnic group of child and adult actors. Here we present the validation results, as well as participant ratings of the emotional valence, arousal and intensity of the visual stimuli from this emotion stimulus set. The EU-Emotion Stimulus Set is available for use by the scientific community and the validation data are provided as a supplement available for download.
Studies of semantic variables (e.g., concreteness) and affective variables (i.e., valence and arousal) have traditionally tended to run in different directions. However, in recent years there has been growing interest in studying the relationship, as well as the potential overlaps, between the two. This article describes a database that provides subjective ratings for 1,400 Spanish words for valence, arousal, concreteness, imageability, context availability, and familiarity. Data were collected online through a process involving 826 university students. The results showed a high interrater reliability for all of the variables examined, as well as high correlations between our affective and semantic values and norms currently available in other Spanish databases. Regarding the affective variables, the typical quadratic correlation between valence and arousal ratings was obtained. Likewise, significant correlations were found between the lexico-semantic variables. Importantly, we obtained moderate negative correlations between emotionality and both concreteness and imageability. This is in line with the claim that abstract words have more affective associations than concrete ones (Kousta, Vigliocco, Vinson, Andrews, {\&} Del Campo, 2011). The present Spanish database is suitable for experimental research into the effects of both affective properties and lexico-semantic variables on word processing and memory.
Many experimental research designs require images of novel objects. Here we introduce the Novel Object and Unusual Name (NOUN) Database. This database contains 64 primary novel object images and additional novel exemplars for ten basic- and nine global-level object categories. The objects' novelty was confirmed by both self-report and a lack of consensus on questions that required participants to name and identify the objects. We also found that object novelty correlated with qualifying naming responses pertaining to the objects' colors. The results from a similarity sorting task (and a subsequent multidimensional scaling analysis on the similarity ratings) demonstrated that the objects are complex and distinct entities that vary along several featural dimensions beyond simply shape and color. A final experiment confirmed that additional item exemplars comprised both sub- and superordinate categories. These images may be useful in a variety of settings, particularly for developmental psychology and other research in the language, categorization, perception, visual memory, and related domains.
The present study provides norms for Deese/Roediger-McDermott (DRM) lists that were used to create false memories in native speakers of Italian. The word lists reported in this article are based on the DRM lists that have been used extensively to examine illusory memories in English speakers (Deese in Journal of Experimental Psychology, 58, 17-22, 1959; Roediger {\&} McDermott in Journal of Experimental Psychology: Learning, Memory, {\&} Cognition, 21, 803-814, 1995). We translated the 24 critical lures from 24 English DRM lists and created semantically associated Italian word lists that were then normed with native Italian speakers. Overall, the participants recalled 63{\%} of the list items and 22{\%} of the critical lures with the word lists developed. In addition, 56{\%} of the list items and 82{\%} of the critical lures were recognized by the participants. The present study provides a set of Italian lists that can be used by researchers interested in evaluating false memories in Italian-speaking participants.
We present a corpus-based prosodic analysis with the aim of uncovering the relationship between dialogue acts, personality and prosody in view to providing guidelines for the ECA Greta’s text-to-speech system. The corpus used is the SEMAINE corpus, featuring four different personalities, further annotated for dialogue acts and prosodic features. In order to show the importance of the choice of dialogue act taxonomy, two different taxonomies were used, the first corresponding to Searle’s taxonomy of speech acts and the second, inspired by Bunt’s DIT++, including a division of directive acts into finer categories. Our results show that finer-grained distinctions are important when choosing a taxonomy. We also show with some preliminary results that the prosodic correlates of dialogue acts are not always as cited in the literature and prove more complex and variable. By studying the realisation of different directive acts, we also observe differences in the communicative strategies of the ECA depending on personality, in view to providing input to a speech system.
False-memory illusions have been widely studied using the Deese/Roediger-McDermott paradigm (DRM). In this paradigm, words semantically related to a single nonpresented critical word are studied. In a later memory test, critical words are often falsely recalled and recognized. The present normative study was conducted to measure the theme identifiability of 60 associative word lists in Spanish that include six words (e.g., stove, coat, blanket, scarf, chill, and bonnet) that are simultaneously associated with three critical words (e.g., HEAT, COLD, and WINTER; Beato {\&} D{\'{i}}ez, Psicothema, 26, 457-463, 2011). Different levels of backward associative strength were used in the construction of the DRM lists. In addition, we used two types of instructions to obtain theme identifiability. In the without-explanation condition, traditional instructions were used, requesting participants to write the theme list. In the with-explanation condition, the false-memory effect and how the lists were built were explained, and an example of a DRM list and critical words was shown. Participants then had to discover the critical words. The results showed that all lists produced theme identifiability. Moreover, some lists had a higher theme identifiability rate (e.g., 61 {\%} for the critical words LOVE, BOYFRIEND, COUPLE) than others (e.g., 24 {\%} for CITY, PLACE, VILLAGE). After comparing the theme identifiabilities in the different conditions, the results indicated higher theme identifiability when the false-memory effect was explained than without such an explanation. Overall, these new normative data provide a useful tool for those experiments that, for example, aim to analyze the wide differences observed in false memory with DRM lists and the role of theme identifiability.
The LSE-Sign database is a free online tool for selecting Spanish Sign Language stimulus materials to be used in experiments. It contains 2,400 individual signs taken from a recent standardized LSE dictionary, and a further 2,700 related nonsigns. Each entry is coded for a wide range of grammatical, phonological, and articulatory information, including handshape, location, movement, and non-manual elements. The database is accessible via a graphically based search facility which is highly flexible both in terms of the search options available and the way the results are displayed. LSE-Sign is available at the following website: http://www.bcbl.eu/databases/lse/.
This study introduces the Tool for the Automatic Analysis of Cohesion (TAACO), a freely available text analysis tool that is easy to use, works on most operating systems (Windows, Mac, and Linux), is housed on a user's hard drive (rather than having an Internet interface), allows for the batch processing of text files, and incorporates over 150 classic and recently developed indices related to text cohesion. The study validates TAACO by investigating how its indices related to local, global, and overall text cohesion can predict expert judgments of text coherence and essay quality. The findings of this study provide predictive validation of TAACO and support the notion that expert judgments of text coherence and quality are either negatively correlated or not predicted by local and overall text cohesion indices, but are positively predicted by global indices of cohesion. Combined, these findings provide supporting evidence that coherence for expert raters is a property of global cohesion and not of local cohesion, and that expert ratings of text quality are positively related to global cohesion.
Rebus puzzles and compound remote associate problems have been successfully used to study problem solving. These problems are physically compact, often can be solved within short time limits, and have unambiguous solutions, and English versions have been normed for solving rates and levels of difficulty. Many studies on problem solving with sudden insight have taken advantage of these features in paradigms that require many quick solutions (e.g., solution priming, visual hemifield presentations, electroencephalography, fMRI, and eyetracking). In order to promote this vein of research in Italy, as well, we created and tested Italian versions of both of these tests. The data collected across three studies yielded a pool of 88 rebus puzzles and 122 compound remote associate problems within a moderate range of difficulty. This article provides both sets of problems with their normative data, for use in future research.
In the present study, we report naming latencies and norms for 327 photos of objects in Dutch. We provide norms for eight psycholinguistic variables: age of acquisition, familiarity, imageability, image agreement, objective and subjective visual complexity, word frequency, word length in syllables and letters, and name agreement. Furthermore, multiple regression analyses revealed that the significant predictors of photo-naming latencies were name agreement, word frequency, imageability, and image agreement. The naming latencies, norms, and stimuli are provided as supplemental materials.
Despite flourishing research on the relationship between emotion and literal language, and despite the pervasiveness of figurative expressions in communication, the role of figurative language in conveying affect has been underinvestigated. This study provides affective and psycholinguistic norms for 619 German idiomatic expressions and explores the relationships between affective and psycholinguistic idiom properties. German native speakers rated each idiom for emotional valence, arousal, familiarity, semantic transparency, figurativeness, and concreteness. They also described the figurative meaning of each idiom and rated how confident they were about the attributed meaning. The results showed that idioms rated high in valence were also rated high in arousal. Negative idioms were rated as more arousing than positive ones, in line with results from single words. Furthermore, arousal correlated positively with figurativeness (supporting the idea that figurative expressions are more emotionally engaging than literal expressions) and with concreteness and semantic transparency. This suggests that idioms may convey a more direct reference to sensory representations, mediated by the meanings of their constituting words. Arousal correlated positively with familiarity. In addition, positive idioms were rated as more familiar than negative idioms. Finally, idioms without a literal counterpart were rated as more emotionally valenced and arousing than idioms with a literal counterpart. Although the meanings of ambiguous idioms were less correctly defined than those of unambiguous idioms, ambiguous idioms were rated as more concrete than unambiguous ones. We also discuss the relationships between the various psycholinguistic variables characterizing idioms, with reference to the literature on idiom structure and processing.
Automatic syntactic analysis of a corpus requires detailed lexical and morphological information that cannot always be harvested from traditional dictionaries. Therefore the development of a treebank presents an opportunity to simultaneously enrich the lexicon. In building NorGramBank, we use an incremental parsebanking approach, in which a corpus is parsed and disambiguated, and after improvements to the grammar and the lexicon, reparsed. In this context we have implemented a text preprocessing interface where annotators can enter unknown words or missing lexical information either before parsing or during disambiguation. The information added to the lexicon in this way may be of great interest both to lexicographers and to other language technology efforts.
This paper presents a historical Arabic corpus named HAC. At this early embryonic stage of the project, we report about the design, the architecture and some of the experiments which we have conducted on HAC. The corpus, and accordingly the search results, will be represented using a primary XML exchange format. This will serve as an intermediate exchange tool within the project and will allow the user to process the results offline using some external tools. HAC is made up of Classical Arabic texts that cover 1600 years of language use; the Quranic text, Modern Standard Arabic texts, as well as a variety of monolingual Arabic dictionaries. The development of this historical corpus assists linguists and Arabic language learners to effectively explore, understand, and discover interesting knowledge hidden in millions of instances of language use. We used techniques from the field of natural language processing to process the data and a graph-based representation for the corpus. We provided researchers with an export facility to render further linguistic analysis possible.
This article presents the Spanish Iarg-AnCora corpus (400 k-words, 13,883 sentences) annotated with the implicit arguments of deverbal nominalizations (18,397 occurrences). We describe the methodology used to create it, focusing on the annotation scheme and criteria adopted. The corpus was manually annotated and an interannotator agreement test was conducted (81 % observed agreement) in order to ensure the reliability of the final resource. The annotation of implicit arguments results in an important gain in argument and thematic role coverage (128 % on average). It is the first corpus annotated with implicit arguments for the Spanish language with a wide coverage that is freely available. This corpus can subsequently be used by machine learning-based semantic role labeling systems, and for the linguistic analysis of implicit arguments grounded on real data. Semantic analyzers are essential components of current language technology applications, which need to obtain a deeper understanding of the text in order to make inferences at the highest level to obtain qualitative improvements in the results.
The Active Listening Corpus (ALICO) is a multimodal data set of spontaneous dyadic conversations in German with diverse speech and gestural annotations of both dialogue partners. The annotations consist of short feedback expression transcriptions with corresponding communicative function interpretations as well as segmentations of interpausal units, words, rhythmic prominence intervals and vowel-to-vowel intervals. Additionally, ALICO contains head gesture annotations of both interlocutors. The corpus contributes to research on spontaneous human–human interaction, on functional relations between modalities, and timing variability in dialogue. It also provides data that differentiates between distracted and attentive listeners. We describe the main characteristics of the corpus and briefly present the most important results obtained from analyses in recent years.
This chapter discusses free-association responses to the primary purposes and other responses selected from the Palermo–Jenkins norms. A useful supplement to free-association norms is the additional determination of associations to responses that are elicited on the original test. These supplemental norms increase the number of association hierarchies available to the investigator and also provide information concerning the independent probabilities of chains of words. Free-association responses allow the independent manipulation of associative directionality of pairs of words, for example, A and B, that is, it becomes possible to choose word pairs on the basis of either the A-B or B-A associative strength. Listing of the associative probabilities of other words, given in response to B, can greatly increase the size of the pool of associative triads, that is, A-B-C chains. The original norms can be used to discover A-B-C word chains or word pairs varying in degree of bidirectionality. However, such a procedure identifies only a limited number of usable word pairs or chains.
In this study, we aimed to provide a large-scale set of psycholinguistic norms for 3,314 traditional Chinese characters, along with their naming reaction times (RTs), collected from 140 Chinese speakers. The lexical and semantic variables in the database include frequency, regularity, familiarity, consistency, number of strokes, homophone density, semantic ambiguity rating, phonetic combinability, semantic combinability, and the number of disyllabic compound words formed by a character. Multiple regression analyses were conducted to examine the predictive powers of these variables for the naming RTs. The results demonstrated that these variables could account for a significant portion of variance (55.8{\%}) in the naming RTs. An additional multiple regression analysis was conducted to demonstrate the effects of consistency and character frequency. Overall, the regression results were consistent with the findings of previous studies on Chinese character naming. This database should be useful for research into Chinese language processing, Chinese education, or cross-linguistic comparisons. The database can be accessed via an online inquiry system (http://ball.ling.sinica.edu.tw/namingdatabase/index.html).
Improvised acting is a viable technique to study expressive human communication and to shed light into actors’ creativity. The USC CreativeIT database provides a novel, freely-available multimodal resource for the study of theatrical improvisation and rich expressive human behavior (speech and body language) in dyadic interactions. The theoretical design of the database is based on the well-established improvisation technique of Active Analysis in order to provide naturally induced affective and expressive, goal-driven interactions. This database contains dyadic theatrical improvisations performed by 16 actors, providing detailed full body motion capture data and audio data of each participant in an interaction. The carefully engineered data collection, the improvisation design to elicit natural emotions and expressive speech and body language, as well as the well-developed annotation processes provide a gateway to study and model various aspects of theatrical performance, expressive behaviors and human communication and interaction.
This contribution aims to establish a set of validated vocal Italian pseudowords that convey three emotional tones (angry, happy, and neutral) for prosodic emotional processing research. We elaborated the materials by following a series of specific steps. First, we tested the valence of a set of written pseudowords generated by specific software. Two Italian actors (male and female) then recorded the resulting subset of linguistically legal and neutral pseudowords in three emotional tones. Finally, on the basis of the results of independent ratings of emotional intensity, we selected a set of 30 audio stimuli expressed in each of the three different emotions. Acoustic analyses indicated that the prosodic indexes of fundamental frequency, vocal intensity, and speech rate anchored individual perceptions of the emotions expressed. Finally, the acoustic profile of the set of emotional stimuli confirmed previous findings. The happy tone stimuli showed high f0 values, high intensity, high pitch variability, and a faster speech rate. The angry tone stimuli were also characterized by high f0 and intensity, but by relatively smaller pitch variability and a lower speech rate. This last profile echoes the description of "cold anger." This new set of prosodic emotion stimuli will constitute a useful resource for future research that requires emotional prosody materials. It could be used both for Italian and for cross-language studies.
Language identification, as the task of determining the language a given text is written in, has progressed substantially in recent decades. However, three main issues remain still unresolved: (1) distinction of similar languages, (2) detection of multilingualism in a single document, and (3) identifying the language of short texts. In this paper, we describe our work on the development of a benchmark to encourage further research in these three directions, set forth an evaluation framework suitable for the task, and make a dataset of annotated tweets publicly available for research purposes. We also describe the shared task we organized to validate and assess the evaluation framework and dataset with systems submitted by seven different participants, and analyze the performance of these systems. The evaluation of the results submitted by the participants of the shared task helped us shed some light on the shortcomings of state-of-the-art language identification systems, and gives insight into the extent to which the brevity, multilingualism, and language similarity found in texts exacerbate the performance of language identifiers. Our dataset with nearly 35,000 tweets and the evaluation framework provide researchers and practitioners with suitable resources to further study the aforementioned issues on language identification within a common setting that enables to compare results with one another.
The majority of the world’s languages have little to no NLP resources or tools. This is due to a lack of training data (“resources”) over which tools, such as taggers or parsers, can be trained. In recent years, there have been increasing efforts to apply NLP methods to a much broader swath of the world’s languages. In many cases this involves bootstrapping the learning process with enriched or partially enriched resources. We propose that Interlinear Glossed Text (IGT), a very common form of annotated data used in the field of linguistics, has great potential for bootstrapping NLP tools for resource-poor languages. Although IGT is generally very richly annotated, and can be enriched even further (e.g., through structural projection), much of the content is not easily consumable by machines since it remains “trapped” in linguistic scholarly documents and in human readable form. In this paper, we describe the expansion of the ODIN resource—a database containing many thousands of instances of IGT for over a thousand languages. We enrich the original IGT data by adding word alignment and syntactic structure. To make the data in ODIN more readily consumable by tool developers and NLP researchers, we adopt and extend a new XML format for IGT, called Xigt. We also develop two packages for manipulating IGT data: one, INTENT, enriches raw IGT automatically, and the other, XigtEdit, is a graphical IGT editor.
The Chinese Proposition Bank (CPB) is a corpus annotated with semantic roles for the arguments of verbal and nominalized predicates. The semantic roles for the core arguments are defined in a predicate-specific manner. That is, a set of semantic roles, numerically identified, are defined for each sense of a predicate lemma and recorded in a valency lexicon called frame files. The predicate-specific manner in which the semantic roles are defined reduces the cognitive burden on the annotators since they only need to internalize a few roles at a time and this has contributed to the consistency in annotation. It was also a sensible approach given the contentious issue of how many semantic roles are needed if one were to adopt of set of global semantic roles that apply to all predicates. A downside of this approach, however, is that the predicate-specific roles may not be consistent across predicates, and this inconsistency has a negative impact on training automatic systems. Given the progress that has been made in defining semantic roles in the last decade or so, time is ripe for adopting a set of general semantic roles. In this article, we describe our effort to “re-annotate” the CPB with a set of “global” semantic roles that are predicate-independent and investigate their impact on automatic semantic role labeling systems. When defining these global semantic roles, we strive to make them compatible with a recently published ISO standards on the annotation of semantic roles (ISO 24617-4:2014 SemAF-SR) while taking the linguistic characteristics of the Chinese language into account. We show that in spite of the much larger number of global semantic roles, the accuracy of an off-the-shelf semantic role labeling system retrained on the data re-annotated with global semantic roles is comparable to that trained on the data set with the original predicate-specific semantic roles. We also argue that the re-annotated data set, together with the original data, provides the user with more flexibility when using the corpus.
Sentiment analysis aims to extract the sentiment polarity of given segment of text. Polarity resources that indicate the sentiment polarity of words are commonly used in different approaches. While English is the richest language in regard to having such resources, the majority of other languages, including Turkish, lack polarity resources. In this work we present the first comprehensive Turkish polarity resource, SentiTurkNet, where three polarity scores are assigned to each synset in the Turkish WordNet, indicating its positivity, negativity, and objectivity (neutrality) levels. Our method is general and applicable to other languages. Evaluation results for Turkish show that the polarity scores obtained through this method are more accurate compared to those obtained through direct translation (mapping) from SentiWordNet.
This article describes a new software tool called RadicalLocator that can be used to automatically identify (e.g., for visual inspection) individual target radicals (i.e., groups of strokes) in written Chinese characters. We first briefly clarify why this software is useful for research purposes and discuss the factors that make this pattern recognition task so difficult. We then describe how the software can be downloaded and installed, and used to identify the radicals in characters for the purposes of, for example, selecting materials for psycholinguistic experiments. Finally, we discuss several known limitations of the software and heuristics for addressing them.
We collected subjective frequency, age-of-acquisition, and imageability norms for 319 acronyms from French adults. Objective printed frequency, bigram frequency, and lengths in letters, phonemes, and syllables, as well as orthographic neighbors, were computed. The time taken to read acronyms aloud was also recorded. Correlational analyses indicated that the relations between the psycholinguistic variables were similar to those usually found for common words (e.g., highly imageable acronyms were more frequent and learned earlier in life than less imageable acronyms), but were generally weaker in the former than in the latter. Linear mixed-model analyses performed on the reading latencies revealed that the main determinants were the voicing feature of initial phonemes, the type of pronunciation of the acronyms (ambiguous vs. unambiguous, typical vs. atypical characteristics), length (number of letters and number of syllables), together with bigram frequency, printed frequency, and imageability. Both objective frequency and imageability interacted reliably with the ambiguous typical and ambiguous atypical properties. Accuracy was predicted by the number of letters and by imageability factors: More errors occurred on longer than on shorter acronyms, and also more errors on less imageable than on more imageable acronyms. The theoretical and methodological implications of the findings for the understanding of acronym reading are discussed. The entire set of norms and the acronym reading times (and accuracy scores), together with the acronym definitions, are provided as supplemental materials.
Subjective estimations of age of acquisition (AoA) for a large pool of Spanish words were collected from college students in Spain. The average score for each word (based on 50 individual responses, on a scale from 1 to 11) was taken as an AoA indicator, and normative values for a total of 7,039 single words are provided as supplemental materials. Beyond its intrinsic value as a standalone corpus, the largest of its kind for Spanish, the value of the database is enhanced by the fact that it contains most of the words that are currently included in other normative studies, allowing for a more complete characterization of the lexical stimuli that are usually employed in studies with Spanish-speaking participants. The norms are available for downloading as supplemental materials with this article.
We describe the creation of a massively parallel corpus based on 100 translations of the Bible. We discuss some of the difficulties in acquiring and processing the raw material as well as the potential of the Bible as a corpus for natural language processing. Finally we present a statistical analysis of the corpora collected and a detailed comparison between the English translation and other English corpora.
Human faces are fundamentally dynamic, but experimental investigations of face perception have traditionally relied on static images of faces. Although naturalistic videos of actors have been used with success in some contexts, much research in neuroscience and psychophysics demands carefully controlled stimuli. In this article, we describe a novel set of computer-generated, dynamic face stimuli. These grayscale faces are tightly controlled for low- and high-level visual properties. All faces are standardized in terms of size, luminance, location, and the size of facial features. Each face begins with a neutral pose and transitions to an expression over the course of 30 frames. Altogether, 222 stimuli were created, spanning three different categories of movement: (1) an affective movement (fearful face), (2) a neutral movement (close-lipped, puffed cheeks with open eyes), and (3) a biologically impossible movement (upward dislocation of eyes and mouth). To determine whether early brain responses sensitive to low-level visual features differed between the expressions, we measured the occipital P100 event-related potential, which is known to reflect differences in early stages of visual processing, and the N170, which reflects structural encoding of faces. We found no differences between the faces at the P100, indicating that different face categories were well matched on low-level image properties. This database provides researchers with a well-controlled set of dynamic faces, controlled for low-level image characteristics, that are applicable to a range of research questions in social perception.
Gestures are commonly used together with spoken language in human communication. One major limitation of gesture investigations in the existing literature lies in the fact that the coding of forms and functions of gestures has not been clearly differentiated. This paper first described a recently developed Database of Speech and GEsture (DoSaGE) based on independent annotation of gesture forms and functions among 119 neurologically unimpaired right-handed native speakers of Cantonese (divided into three age and two education levels), and presented findings of an investigation examining how gesture use was related to age and linguistic performance. Consideration of these two factors, for which normative data are currently very limited or lacking in the literature, is relevant and necessary when one evaluates gesture employment among individuals with and without language impairment. Three speech tasks, including monologue of a personally important event, sequential description, and story-telling, were used for elicitation. The EUDICO Linguistic ANnotator (ELAN) software was used to independently annotate each participant's linguistic information of the transcript, forms of gestures used, and the function for each gesture. About one-third of the subjects did not use any co-verbal gestures. While the majority of gestures were non-content-carrying, which functioned mainly for reinforcing speech intonation or controlling speech flow, the content-carrying ones were used to enhance speech content. Furthermore, individuals who are younger or linguistically more proficient tended to use fewer gestures, suggesting that normal speakers gesture differently as a function of age and linguistic performance.
Lexvo.org brings information about languages, words, and other linguistic entities to the Web of Linked Data. It defines URIs for terms, languages, scripts, and characters, which are not only highly interconnected but also linked to a variety of resources on the Web. Additionally, new datasets are being published to contribute to the emerging Linked Data Cloud of Language-Related information.
In the present article, we introduce the Nencki Affective Word List (NAWL), created in order to provide researchers with a database of 2,902 Polish words, including nouns, verbs, and adjectives, with ratings of emotional valence, arousal, and imageability. Measures of several objective psycholinguistic features of the words (frequency, grammatical class, and number of letters) are also controlled. The database is a Polish adaptation of the Berlin Affective Word List-Reloaded (BAWL-R; V{\~{o}} et al., Behavior Research Methods 41:534-538, 2009), commonly used to investigate the affective properties of German words. Affective normative ratings were collected from 266 Polish participants (136 women and 130 men). The emotional ratings and psycholinguistic indexes provided by NAWL can be used by researchers to better control the verbal materials they apply and to adjust them to specific experimental questions or issues of interest. The NAWL is freely accessible to the scientific community for noncommercial use as supplementary material to this article.
We present a database of 858 German words from the semantic fields of authority and community, which represent core dimensions of human sociality. The words were selected on the basis of co-occurrence profiles of representative keywords for these semantic fields. All words were rated along five dimensions, each measured by a bipolar semantic-differential scale: Besides the classic dimensions of affective meaning (valence, arousal, and potency), we collected ratings of authority and community with newly developed scales. The results from cluster, correlational, and multiple regression analyses on the rating data suggest a robust negativity bias for authority valuation among German raters recruited via university mailing lists, whereas community ratings appear to be rather unrelated to the well-established affective dimensions. Furthermore, our data involve a strong overall negative correlation-rather than the classical U-shaped distribution-between valence and arousal for socially relevant concepts. Our database provides a valuable resource for research questions at the intersection of cognitive neuroscience and social psychology. It can be downloaded as supplemental materials with this article.
Learner corpora consist of texts produced by non-native speakers. In addition to these texts, some learner corpora also contain error annotations, which can reveal common errors made by language learners, and provide training material for automatic error correction. We present a novel type of error-annotated learner corpus containing sequences of revised essay drafts written by non-native speakers of English. Sentences in these drafts are annotated with comments by language tutors, and are aligned to sentences in subsequent drafts. We describe the compilation process of our corpus, present its encoding in TEI XML, and report agreement levels on the error annotations. Further, we demonstrate the potential of the corpus to facilitate research on textual revision in L2 writing, by conducting a case study on verb tenses using ANNIS, a corpus search and visualization platform.
Research on the emotional, cognitive, and social determinants of moral judgment has surged in recent years. The development of moral foundations theory (MFT) has played an important role, demonstrating the breadth of morality. Moral psychology has responded by investigating how different domains of moral judgment are shaped by a variety of psychological factors. Yet, the discipline lacks a validated set of moral violations that span the moral domain, creating a barrier to investigating influences on judgment and how their neural bases might vary across the moral domain. In this paper, we aim to fill this gap by developing and validating a large set of moral foundations vignettes (MFVs). Each vignette depicts a behavior violating a particular moral foundation and not others. The vignettes are controlled on many dimensions including syntactic structure and complexity making them suitable for neuroimaging research. We demonstrate the validity of our vignettes by examining respondents' classifications of moral violations, conducting exploratory and confirmatory factor analysis, and demonstrating the correspondence between the extracted factors and existing measures of the moral foundations. We expect that the MFVs will be beneficial for a wide variety of behavioral and neuroimaging investigations of moral cognition.
Pictures are often used in studies on memory, perception, and language; normative data are thus needed for such visual stimuli. In the present study, we aimed to obtain normative data for a set of 272 black-and-white pictures from middle-aged and elderly Persian speakers. A total of 206 volunteers were divided into two groups: a middle-aged (40-59 years old) group and an elderly (60 years old and over) group. The groups had similar characteristics in terms of education. Norms for every picture were developed to provide measures of name agreement, image agreement, conceptual familiarity, age of acquisition, and visual complexity. The results revealed that all of these measures vary with age, except for conceptual familiarity.
This article presents valence/pleasantness, activity/arousal, power/dominance, origin, subjective significance, and source-of-experience norms for 1,586 Polish words (primarily nouns), adapted from the Affective Norms for English Words list (1,040 words) and from my own previous research (546 words), regarding the duality-of-mind approach for emotion formation. This is a first attempt at creating affective norms for Polish words. The norms are based on ratings by a total of 1,670 college students (852 females and 818 males) from different Warsaw universities and academies, studying various disciplines in equal proportions (humanities, engineering, and social and natural sciences) using a 9-point Likert Self-Assessment Manikin scale. Each participant assessed 240 words on six different scales (40 words per scale) using a paper-and-pencil group survey procedure. These affective norms for Polish words are a valid and useful tool that will allow researchers to use standard, well-known verbal materials comparable to the materials used in other languages (English, German, Portuguese, Spanish, French, Dutch, etc.). The normative values of the Polish adaptation of affective norms are included in the online supplemental materials for this article.
Since the work of Taft and Forster (1976), a growing literature has examined how English compound words are recognized and organized in the mental lexicon. Much of this research has focused on whether compound words are decomposed during recognition by manipulating the word frequencies of their lexemes. However, many variables may impact morphological processing, including relational semantic variables such as semantic transparency, as well as additional form-related and semantic variables. In the present study, ratings were collected on 629 English compound words for six variables [familiarity, age of acquisition (AoA), semantic transparency, lexeme meaning dominance (LMD), imageability, and sensory experience ratings (SER)]. All of the compound words selected for this study are contained within the English Lexicon Project (Balota et al., 2007), which made it possible to use a regression approach to examine the predictive power of these variables for lexical decision and word naming performance. Analyses indicated that familiarity, AoA, imageability, and SER were all significant predictors of both lexical decision and word naming performance when they were added separately to a model containing the length and frequency of the compounds, as well as the lexeme frequencies. In addition, rated semantic transparency also predicted lexical decision performance. The database of English compound words should be beneficial to word recognition researchers who are interested in selecting items for experiments on compound words, and it will also allow researchers to conduct further analyses using the available data combined with word recognition times included in the English Lexicon Project.
Knowledge of specific characteristics of verbal material is imperative in cognitive research, and this need calls for periodical updating of normative data. With this aim, and considering that the most recent Spanish-language category norms for adults date back to more than 30 years ago, and that they do not include some very common categories, a new normative study was conducted. In this study, production data for exemplars in the 56 categories of Battig and Montague (Journal of Experimental Psychology, 80, 1-46, 1969) were collected from a pool of 284 young adults who were native speakers of Spanish using an exemplar production task. With the goal of providing a useful tool for cognitive research to be conducted with Spanish-speaking samples, indices of frequency, rank, and lexical availability for the exemplars of each category are provided in a computerized database. The norms described are available for downloading as supplemental material with this article.
This paper intends to present a machine readable Romanian language pronunciation dictionary called NaviRo. The dictionary contains 138,500 unique words from the DexOnline dictionary together with their phonetic transcriptions in speech assessment method phonetic alphabet. The development of the pronunciation dictionary and the performed validation tests are also described in the paper. NaviRo pronunciation dictionary is freely available on the project website (http://users.utcluj.ro/~jdomokos/naviro) in plain text, Hidden Markov Model Toolkit and Festival speech synthesis system dictionary format. There are also available for download the used grapheme and phoneme sets and the audio samples for the used phonemes. The use of these resources is completely unrestricted for any research purposes in order to speed up Romanian language speech technology research.
To investigate the differences in communicative activities by the same interlocutors in Japanese (their L1) and in English (their L2), an 8-h multimodal corpus of multiparty conversations was collected. Three subjects participated in each conversational group, and they had conversations on free-flowing and goal-oriented topics in Japanese and in English. Their utterances, eye gazes, and gestures were recorded with microphones, eye trackers, and video cameras. The utterances and eye gazes were manually annotated. Their utterances were transcribed, and the transcriptions of each participant were aligned with those of the others along the time axis. Quantitative analyses were made to compare the communicative activities caused by the differences in conversational languages, the conversation types, and the levels of language expertise in L2. The results reveal different utterance characteristics and gaze patterns that reflect the differences in difficulty felt by the participants in each conversational condition. Both total and average durations of utterances were shorter in their L2 than in their L1 conversations. Differences in eye gazes were mainly found in those toward the information senders: Speakers were gazed at more in their second-language than in their native-language conversations. Our findings on the characteristics of conversations in the second language suggest possible directions for future research in psychology, cognitive science, and human–computer interaction technologies.
The paper describes the combined results of several projects which constitute a basic language resource infrastructure for printed historical Slovene. The IMP language resources consist of a digital library, an annotated corpus and a lexicon, which are interlinked and uniformly encoded following the Text Encoding Initiative Guidelines. The library holds about 650 units (mostly complete books) consisting of facsimiles with 45,000 pages as well as hand-corrected and structured transcriptions. The hand-annotated corpus has 300,000 tokens, where each word is tagged with its modernised word form, lemma, part-of-speech and, in cases of archaic words, its nearest contemporary equivalents. This information was extracted into the lexicon, which also covers an extended target-annotated corpus, resulting in 20,000 lemmas (of these 4,000 archaic) with 50,000 modern word forms and 70,000 attested forms. We have also developed a program to modernise, tag and lemmatise historical Slovene, and annotated the digital library with it, producing an automatically annotated corpus of 15 million words. To serve the humanities, the digital library and lexicon are available for reading and browsing on the web and the corpora via a concordancer. For language technology research and development the resources are available in source TEI XML under the Creative Commons Attribution licence. The paper presents the IMP resources, available from http://nl.ijs.si/imp/, the process of their compilation, encoding and dissemination, and concludes with directions for future research.
The present study introduces the first Spanish database with normative ratings of semantic similarity for 185 word triplets. Each word triplet is constituted by a target word (e.g., guisante [pea]) and two semantically related and nonassociatively related words: a word highly related in meaning to the target (e.g., jud{\'{i}}a [bean]), and a word less related in meaning to the target (e.g., patata [potato]). The degree of meaning similarity was assessed by 332 participants by using a semantic similarity rating task on a 9-point scale. Pairs having a value of semantic similarity ranging from 5 to 9 were classified as being more semantically related, whereas those with values ranging from 2 to 4.99 were considered as being less semantically related. The relative distance between the two pairs for the same target ranged from 0.48 to 5.07 points. Mean comparisons revealed that participants rated the more similar words as being significantly more similar in meaning to the target word than were the less similar words. In addition to the semantic similarity norms, values of concreteness and familiarity of each word in a triplet are provided. The present database can be a very useful tool for scientists interested in designing experiments to examine the role of semantics in language processing. Since the variable of semantic similarity includes a wide range of values, it can be used as either a continuous or a dichotomous variable. The full database is available in the supplementary materials.
We present SUBTLEX-PL, Polish word frequencies based on movie subtitles. In two lexical decision experiments, we compare the new measures with frequency estimates derived from another Polish text corpus that includes predominantly written materials. We show that the frequencies derived from the two corpora perform best in predicting human performance in a lexical decision task if used in a complementary way. Our results suggest that the two corpora may have unequal potential for explaining human performance for words in different frequency ranges and that corpora based on written materials severely overestimate frequencies for formal words. We discuss some of the implications of these findings for future studies comparing different frequency estimates. In addition to frequencies for word forms, SUBTLEX-PL includes measures of contextual diversity, part-of-speech-specific word frequencies, frequencies of associated lemmas, and word bigrams, providing researchers with necessary tools for conducting psycholinguistic research in Polish. The database is freely available for research purposes and may be downloaded from the authors' university Web site at http://crr.ugent.be/subtlex-pl .
For 84 unique topic-vehicle pairs (e.g., knowledge-power), participants produced associated properties for the topics (e.g., knowledge), vehicles (e.g., power), metaphors (knowledge is power), and similes (knowledge is like power). For these properties, we also obtained frequency, saliency, and connotativeness scores (i.e., how much the properties deviated from the denotative or literal meaning). In addition, we examined whether expression type (metaphor vs. simile) impacted the interpretations produced. We found that metaphors activated more salient properties than did similes, but the connotativeness levels for metaphor and simile salient properties were similar. Also, the two types of expressions did not differ across a wide range of measures collected: aptness, conventionality, familiarity, and interpretive diversity scores. Combined with the property lists, these interpretation norms constitute a thorough collection of data about metaphors and similes, employing the same topic-vehicle words, which can be used in psycholinguistic and cognitive neuroscience studies to investigate how the two types of expressions are represented and processed. These norms should be especially useful for studies that examine the online processing and interpretation of metaphors and similes, as well as for studies examining how properties related to metaphors and similes affect the interpretations produced.
Film clips are widely utilized to elicit emotion in a variety of research studies. Normative ratings for scenes selected for these purposes support the idea that selected clips correspond to the intended target emotion, but studies reporting normative ratings are limited. Using an ethnically diverse sample of college undergraduates, selected clips were rated for intensity, discreteness, valence, and arousal. Variables hypothesized to affect the perception of stimuli (i.e., gender, race-ethnicity, and familiarity) were also examined. Our analyses generally indicated that males reacted strongly to positively valenced film clips, whereas females reacted more strongly to negatively valenced film clips. Caucasian participants tended to react more strongly to the film clips, and we found some variation by race-ethnicity across target emotions. Finally, familiarity with the films tended to produce higher ratings for positively valenced film clips, and lower ratings for negatively valenced film clips. These findings provide normative ratings for a useful set of film clips for the study of emotion, and they underscore factors to be considered in research that utilizes scenes from film for emotion elicitation.
We collected sensory experience ratings (SERs) for 1,659 French words in adults. Sensory experience for words is a recently introduced variable that corresponds to the degree to which words elicit sensory and perceptual experiences (Juhasz {\&} Yap Behavior Research Methods, 45, 160-168, 2013; Juhasz, Yap, Dicke, Taylor, {\&} Gullick Quarterly Journal of Experimental Psychology, 64, 1683-1691, 2011). The relationships of the sensory experience norms with other psycholinguistic variables (e.g., imageability and age of acquisition) were analyzed. We also investigated the degree to which SER predicted performance in visual word recognition tasks (lexical decision, word naming, and progressive demasking). The analyses indicated that SER reliably predicted response times in lexical decision, but not in word naming or progressive demasking. The findings are discussed in relation to the status of SER, the role of semantic code activation in visual word recognition, and the embodied view of cognition.
We present a syntactic parser of (transcripts of) spoken Hebrew: a dependency parser of the Hebrew CHILDES database. CHILDES is a corpus of child–adult linguistic interactions. Its Hebrew section has recently been morphologically analyzed and disambiguated, paving the way for syntactic annotation. This paper describes a novel annotation scheme of dependency relations reflecting constructions of child and child-directed Hebrew utterances. A subset of the corpus was annotated with dependency relations according to this scheme, and was used to train two parsers (MaltParser and MEGRASP) with which the rest of the data were parsed. The adequacy of the annotation scheme to the CHILDES data is established through numerous evaluation scenarios. The paper also discusses different annotation approaches to several linguistic phenomena, as well as the contribution of morphological features to the accuracy of parsing.
[Correction Notice: An Erratum for this article was reported in Vol 47(1) of Behavior Research Methods (see record 2015-07948-022). In Table 3, in the column "Lexical Content of the Sentences" in the "Incongruent" row, all five words (“Anger, Disgust, Fear, Happiness, Sadness”) should be replaced by “Neutral”.] Research on emotional speech often requires valid stimuli for assessing perceived emotion through prosody and lexical content. To date, no comprehensive emotional speech database for Persian is officially available. The present article reports the process of designing, compiling, and evaluating a comprehensive emotional speech database for colloquial Persian. The database contains a set of 90 validated novel Persian sentences classified in five basic emotional categories (anger, disgust, fear, happiness, and sadness), as well as a neutral category. These sentences were validated in two experiments by a group of 1,126 native Persian speakers. The sentences were articulated by two native Persian speakers (one male, one female) in three conditions: (1) congruent (emotional lexical content articulated in a congruent emotional voice), (2) incongruent (neutral sentences articulated in an emotional voice), and (3) baseline (all emotional and neutral sentences articulated in neutral voice). The speech materials comprise about 470 sentences. The validity of the database was evaluated by a group of 34 native speakers in a perception test. Utterances recognized better than five times chance performance (71.4 {\%}) were regarded as valid portrayals of the target emotions. Acoustic analysis of the valid emotional utterances revealed differences in pitch, intensity, and duration, attributes that may help listeners to correctly classify the intended emotion. The database is designed to be used as a reliable material source (for both text and speech) in future cross-cultural or cross-linguistic studies of emotional speech, and it is available for academic research purposes free of charge. To access the database, please contact the first author. (PsycINFO Database Record (c) 2015 APA, all rights reserved)(journal abstract)
In this paper we present a language-independent, fully modular and automatic approach to bootstrap a wordnet for a new language by recycling different types of already existing language resources, such as machine-readable dictionaries, parallel corpora, and Wikipedia. The approach, which we apply here to Slovene, takes into account monosemous and polysemous words, general and specialised vocabulary as well as simple and multi-word lexemes. The extracted words are then assigned one or several synset ids, based on a classifier that relies on several features including distributional similarity. Finally, we identify and remove highly dubious (literal, synset) pairs, based on simple distributional information extracted from a large corpus in an unsupervised way. Automatic, manual and task-based evaluations show that the resulting resource, the latest version of the Slovene wordnet, is already a valuable source of lexico-semantic information.
Treebanks, especially the Penn treebank for natural language processing (NLP) in English, play an essential role in both research into and the application of NLP. However, many languages still lack treebanks and building a treebank can be very complicated and difficult. This work has a twofold objective. Firstly, to share our results in constructing a large Vietnamese treebank (VTB) with three levels of annotation including word segmentation, part-of-speech tagging, and syntactic analysis. Major steps in the treebank construction process are described with particular regard to specific Vietnamese properties such as lack of word delimiter and isolation. Those properties make sentences highly syntactically ambiguous, and therefore it is difficult to ensure a high level of agreement among annotators. Various studies of Vietnamese syntax were employed not only to define annotations but also to systematically deal with ambiguities. Annotators were supported by automatic labelling tools, which are based on statistical machine learning methods, for sentence pre-processing and a tree editor for supporting manual annotation. As a result, an annotation agreement of around 90 % was achieved. Our second objective is to present our method for automatically finding errors and inconsistencies in treebank corpora and its application to the construction of the VTB. This method employs the Shannon entropy measure in a manner that the more reduced entropy the more corrected errors in a treebank. The method ranks error candidates by using a scoring function based on conditional entropy. Our experiments showed that this method detected high-error-density subsets of original error candidate sets, and that the corpus entropy was significantly reduced after error correction. The size of these subsets was only about one third of the whole set, while these subsets contained 80–90 % of the total errors. This method can also be applied to languages similar to Vietnamese.
The language used in social media is often characterized by the abundance of informal and non-standard writing. The normalization of this non-standard language can be crucial to facilitate the subsequent textual processing and to consequently help boost the performance of natural language processing tools applied to social media text. In this paper we present a benchmark for lexical normalization of social media posts, specifically for tweets in Spanish language. We describe the tweet normalization challenge we organized recently, analyze the performance achieved by the different systems submitted to the challenge, and delve into the characteristics of systems to identify the features that were useful. The organization of this challenge has led to the production of a benchmark for lexical normalization of social media, including an evaluation framework, as well as an annotated corpus of Spanish tweets—TweetNorm_es—, which we make publicly available. The creation of this benchmark and the evaluation has brought to light the types of words that submitted systems did best with, and posits the main shortcomings to be addressed in future work.
Compared with English, Arabic is a poorly-resourced language within the field of corpus linguistics. A lack of sufficient data and research has negatively affected Arabic corpus-based researchers and natural language processing practitioners. Although a number of Arabic corpora have been developed in recent years, the overall situation has improved little. The aim of this paper is twofold. First, it reviews 14 Arabic corpora categorized by their designated purpose, target language, mode of text, size, text date, location, text type/medium, text domain, representativeness, and balance. The review also describes the availability of the reviewed corpora, the presence of tokenization, lemmatization and tagging, and whether there are any tools available to search and explore them. Second, it introduces the King Abdulaziz City for Science and Technology (KACST) Arabic corpus, which was designed and created to overcome the limitations of existing Arabic corpora. The KACST Arabic corpus is a large and diverse Arabic corpus with clearly defined design criteria. It is carefully sampled, and its contents are classified based on time, region, medium, domain, and topic, and it can be searched and explored using these classifications. The KACST Arabic corpus comprises more than 700 million words from the pre-Islamic era to the present day (a period covering more than 1,500 years), collected from 10 diverse mediums. Each text has been further classified more specifically into domains and topics. The KACST Arabic corpus is freely available to explore on the Internet (http://www.kacstac.org.sa) using a variety of tools.
A corpus of French tales is presented. Its two parts, a text corpus and a speech corpus, were designed for studying the relationships between the textual structures of tales and speech prosody, with the targeted application of an expressive text-to-speech synthesis system embedded in a humanoid robot. The 89-tale text corpus, and the 12-tale speech corpus were annotated using a common tale description framework. Lexical level annotations include extended definitions of enumerations, time, place and person named entities, as well as part of speech tags. Supra-lexical level annotations include the segmentation of tales into a sequence of episodes, the localization and attribution of direct quotations, together with tale protagonists co-references. Annotation distributions and inter-annotator agreement were analyzed. The largest coverage and strongest agreement were observed for person named entities, characters’ direct quotations, and their associated coreference chains. Speech corpus annotations were extended to allow the analysis of the relations between tale linguistic information and prosodic properties observed in associated speech. Word and phoneme boundaries were inferred through semi-automatic procedures, resulting in linguistic annotations aligned with the speech signal. Intonation stylization models were used to ease the visual and statistical analysis of tale’s prosody. Additional meta-information is provided with the speech corpus, allowing describing tale characters according to their gender, age, size, valence and kind. The corpora described in this article are publicly available through the European Language Resources Association catalog.
This paper introduces the South East Asia Mandarin–English corpus, a 63-h spontaneous Mandarin–English code-switching transcribed speech corpus suitable for LVCSR and language change detection/identification research. The corpus is recorded under unscripted interview and conversational settings from 157 Singaporean and Malaysian speakers who spoke a mixture of Mandarin and English within a single sentence. About 82 % of the transcribed utterances are intra-sentential code-switching speech and the corpus will be release by LDC in 2015. This paper presents an analysis of the code-switching statistics of the corpus, such as the duration of monolingual segments and the frequency of language turns in code-switch utterances. We also summarize the development effort, details such as the processing time for transcription, validation and language boundary labelling. Lastly, we present textual analyses of code-switch segments examining the word length of monolingual segments in code-switch utterances and the most common single word and two-word phrase of such segments.
This paper presents the IULA Spanish LSP Treebank, an open-source treebank of over 40,000 sentences, developed in the framework of the European project METANET4U. The IULA Spanish LSP Treebank is the first technical corpus of Spanish annotated at surface syntactic level, following the dependency grammar theory. We present the method we used to create the resource and the linguistic annotations that the treebank provides, using examples and comparing with similar resources. We also provide the statistics of the treebank and the evaluation results.
The paper presents the Chinese Discourse TreeBank, a corpus annotated with Penn Discourse TreeBank style discourse relations that take the form of a predicate taking two arguments. We first characterize the syntactic and statistical distributions of Chinese discourse connectives as well as the role of Chinese punctuation marks in discourse annotation, and then describe how we design our annotation strategy procedure based on this characterization. The Chinese-specific features of our annotation strategy include annotating explicit and implicit discourse relations in one single pass, defining the argument labels on semantic, rather than syntactic, grounds, as well as annotating the semantic type of implicit discourse relations directly. We also introduce a flat, 11-valued semantic type classification scheme for discourse relations. We finally demonstrate the feasibility of our approach with evaluation results.
The role of objects' motor affordances in cognition is a topic that has gained in popularity over the last decades. However, few studies exist that have normed the different motor dimensions of the objects; this limits researchers regarding usable stimuli, as well as comparability between studies. In the present study, we normed a set of 560 objects on four motor dimensions: the ease with which they can be grasped, moved, and pantomimed and the number of actions they afford. We then examined whether these four dimensions predict objects' naming latency. We believe that these norms will allow researchers interested in the role of motor affordances to have a better control over the dimensions they want to manipulate.
The present article introduces a Russian-language database of 375 action pictures and associated verbs with normative data. The pictures were normed for name agreement, conceptual familiarity, and subjective visual complexity, and measures of age of acquisition, imageability, and image agreement were collected for the verbs. Values of objective visual complexity, as well as information about verb frequency, length, argument structure, instrumentality, and name relation, are also provided. Correlations between these parameters are presented, along with a comparative analysis of the Russian name agreement norms and those collected in other languages. The full set of pictorial stimuli and the obtained norms may be freely downloaded from http://neuroling.ru/en/db.htm for use in research and for clinical purposes.
Although many facial and vocal databases are available for research, very few of them have controlled the range of attractiveness of the stimuli that they offer. To fill this gap, we created the GEneva Faces and Voices (GEFAV) database, providing standardized faces (static and dynamic neutral, smiling) and voices (speaking sentences, vowels) of young European adults. A total of 61 women and 50 men 18-35 years old agreed to be part of the GEFAV stimuli, and two rating studies involving 285 participants provided evaluations of the facial and vocal samples. The final set of stimuli was satisfactory in terms of attractiveness range (wide and rather symmetrical distribution over the attractiveness continuum) and the reliability of the ratings (high consistency between the two rating studies, high interrater agreement in the final rating study). Moreover, the database showed an adequate validity, since a series of findings described by earlier research on human attractiveness were confirmed-namely, that facial and vocal attractiveness are predicted by femininity and health in women, and by masculinity, dominance, and trustworthiness in men. In future studies, the GEFAV stimuli may be used intact or transformed, individually or in multimodal combinations, to investigate a wide range of mechanisms, such as the behavioral, neuropsychological, and neurophysiological processes involved in social cognition.
We present the Finnish PropBank, a resource for semantic role labeling (SRL) of Finnish based on the Turku Dependency Treebank whose syntax is annotated in the well-known Stanford Dependency (SD) scheme. The contribution of this paper consists of the lexicon of the verbs and their arguments present in the treebank, as well as the predicate-argument annotation of all verb occurrences in the treebank text. We demonstrate that the annotation is of high quality, that the SD scheme is highly compatible with PropBank annotation, and further that the additional dependencies present in the Turku Dependency Treebank are clearly beneficial for PropBank annotation. Further, we also use the PropBank to provide a strong baseline for automated Finnish SRL using a machine learning SRL system developed for the SemEval’14 shared task on broad-coverage semantic dependency parsing. The PropBank as well as the SRL system are available under a free license at http://bionlp.utu.fi/.
This article introduces childLex, an online database of German read by children. childLex is based on a corpus of children's books and comprises 10 million words that were syntactically annotated and lemmatized. childLex reports linguistic norms for lexical, superlexical, and sublexical variables in three different age groups: 6-8 (grades 1-2), 9-10 (grades 3-4), and 11-12 years (grades 5-6). Here, we describe how childLex was collected and analyzed. In addition, we provide information about the distributions of word frequency, word length, and orthographic neighborhood size, as well as their intercorrelations. Finally, we explain how childLex can be accessed using a Web interface.
Research on the multimodal aspects of interactional language use requires high-quality multimodal resources. In contrast to the vast amount of available written language corpora and collections of transcribed spoken language, truly multimodal corpora including visual as well as auditory data are scarce. In this paper, we first discuss a few notable exceptions that do provide high-quality and multiple-angle video recordings of face-to-face conversations. We then present a new multimodal corpus design that adds two dimensions to the existing resources. First, the recording set-up was designed in such a way as to have a full view of the dialogue partners’ gestural behaviour, including hand gestures, facial expressions and body posture. Second, by recording the participant perspective and behaviour during conversation, using head-mounted scene cameras and eye-trackers, we obtained a 3D landscape of the conversation, with detailed production information (scene camera and sound) and indices of cognitive processing (eye movements for gaze analysis) for both participants. In its current form, the resulting InSight Interaction Corpus consists of 15 recorded face-to-face interactions of 20 min each, of which five have been transcribed and annotated for a range of linguistic and gestural features, using the ELAN multimodal annotation tool.
Researchers studying a range of psychological phenomena (e.g., theory of mind, emotion, stereotyping and prejudice, interpersonal attraction, etc.) sometimes employ photographs of people as stimuli. In this paper, we introduce the Chicago Face Database, a free resource consisting of 158 high-resolution, standardized photographs of Black and White males and females between the ages of 18 and 40 years and extensive data about these targets. In Study 1, we report pre-testing of these faces, which includes both subjective norming data and objective physical measurements of the images included in the database. In Study 2 we surveyed psychology researchers to assess the suitability of these targets for research purposes and explored factors that were associated with researchers' judgments of suitability. Instructions are outlined for those interested in obtaining access to the stimulus set and accompanying ratings and measures.
All words have properties linked to form, meaning and usage patterns which influence how easily they are accessed from the mental lexicon in language production, perception and comprehension. Examples of such properties are imageability, phonological and morphological complexity, word class, argument structure, frequency of use and age of acquisition. Due to linguistic and cultural variation the properties and the values associated with them differ across languages. Hence, for research as well as clinical purposes, language specific information on lexical properties is needed. To meet this need, an electronically searchable lexical database with more than 1600 Norwegian words coded for more than 12 different properties has been established. This article presents the content and structure of the database as well as the search options available in the interface. Finally, it briefly describes some of the ways in which the database can be used in research, clinical practice and teaching. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
We examined the potential advantage of the lexical databases using subtitles and present SUBTLEX-PT, a new lexical database for 132,710 Portuguese words obtained from a 78 million corpus based on film and television series subtitles, offering word frequency and contextual diversity measures. Additionally we validated SUBTLEX-PT with a lexical decision study involving 1920 Portuguese words (and 1920 nonwords) with different lengths in letters (M = 6.89, SD = 2.10) and syllables (M = 2.99, SD = 0.94). Multiple regression analyses on latency and accuracy data were conducted to compare the proportion of variance explained by the Portuguese subtitle word frequency measures with that accounted by the recent written-word frequency database (Procura-PALavras; P-PAL; Soares, Iriarte, et al., 2014). As its international counterparts, SUBTLEX-PT explains approximately 15% more of the variance in the lexical decision performance of young adults than the P-PAL database. Moreover, in line with recent studies, contextual diversity accounted for approximately 2% more of the variance in participants' reading performance than the raw frequency counts obtained from subtitles. SUBTLEX-PT is freely available for research purposes (at http://p-pal.di.uminho.pt/about/databases). (PsycINFO Database Record (c) 2018 APA, all rights reserved)
We present a collection of association norms for 246 German depictable compound nouns and their constituents, comprising 58,652 association tokens distributed over 26,004 stimulus–associate pair types. Analyses of the data revealed that participants mainly provided noun associates, followed by adjective and verb associates. In corpus analyses, co-occurrence values for compounds and their associates were below those for nouns in general and their associates. The semantic relations between compound stimuli and their associates were more often co-hyponymy and hypernymy and less often hyponymy than for associations to nouns in general. Finally, we found a moderate correlation between the overlap of the associations to compounds and their constituents and the degree of semantic transparency. These data represent a collection of associations to German compound nouns and their constituents that constitute a valuable resource concerning the lexical semantic properties of the compound stimuli and the semantic relations between the stimuli and their associates. More specifically, the norms can be used for stimulus selection, hypothesis testing, and further research on morphologically complex words. The norms are available in text format (utf-8 encoding) as supplemental materials.
This article presents a new corpus of 820 words pertaining to 14 semantic categories, 7 natural (animals, body parts, insects, flowers, fruits, trees, and vegetables) and 7 man-made (buildings, clothing, furniture, kitchen utensils, musical instruments, tools, and vehicles); each word in the database was collected empirically in a previous exemplar generation study. In the present study, 152 Spanish speakers provided data for four psycholinguistic variables known to affect lexical-semantic processing in both neurologically intact and brain-damaged participants: age of acquisition, familiarity, manipulability, and typicality. Furthermore, we collected lexical frequency data derived from Internet search hits, plus three additional Spanish lexical frequency indexes. Word length, number of syllables, and the proportion of respondents citing the exemplar as a category member-which can be useful as an additional measure of typicality-are also provided. Reliability and validity indexes showed that our items display characteristics similar to those of other corpora. Overall, this new corpus of words provides a useful tool for scientists engaged in cognitive- and neuroscience-based research focused on examining language, memory, and object processing. The full set of norms can be downloaded from www.psychonomic.org/archive.
This article presents the NeoHelp visual stimulus set created to facilitate investigation of need-of-help recognition with clinical and normative populations of different ages, including children. Need-of-help recognition is one aspect of socioemotional development and a necessary precondition for active helping. The NeoHelp consists of picture pairs showing everyday situations: The first item in a pair depicts a child needing help to achieve a goal; the second one shows the child achieving the goal. Pictures of birds in analogue situations are also included. These control stimuli enable implementation of a human-animal categorization task which serves to separate behavioral correlates specific to need-of-help recognition from general differentiation processes. It is a concern in experimental research to ensure that results do not relate to systematic perceptual differences when comparing responses to categories of different content. Therefore, we not only derived the NeoHelp-pictures within a pair from one another by altering as little as possible, but also assessed their perceptual similarity empirically. We show that NeoHelp-picture pairs are very similar regarding low-level perceptual properties across content categories. We obtained data from 60 children in a broad age range (4 to 13 years) for three different paradigms, in order to assess whether the intended categorization and differentiation could be observed reliably in a normative population. Our results demonstrate that children can differentiate the pictures' content regarding both need-of-help category as well as species as intended in spite of the high perceptual similarities. We provide standard response characteristics (hit rates and response times) that are useful for future selection of stimuli and comparison of results across studies. We show that task requirements coherently determine which aspects of the pictures influence response characteristics. Thus, we present NeoHelp, the first open-access standardized visual stimuli set for investigation of need-of-help recognition and invite researchers to use and extend it.
The Chinese language has more native speakers than any other language, but research on the reading of Chinese characters is still not as well-developed as it is for the reading of words in alphabetic languages. Two areas notably lacking are the paucity of megastudies in Chinese and the relatively infrequent use of the lexical decision paradigm to investigate single-character recognition. The Chinese Lexicon Project, described in this article, is a database of lexical decision latencies for 2,500 Chinese single characters in simplified script, collected from a sample of native mainland Chinese (Mandarin) speakers (N = 35). This resource will provide a valuable adjunct to influential mega-databases, such as the English, French, and Dutch Lexicon Projects. Using two separate analyses, some advantages associated with megastudies are exemplified. These include the selection of the strongest measure to represent Chinese character frequency (Cai {\&} Brysbaert's (PLoS ONE 5(6): e10729, 2010) subtitle contextual diversity frequency count), and the conducting of virtual studies to replicate and clarify existing findings. The unique morpho-syllabic nature of the Chinese writing system makes it a valuable case study for functional language contrasts. Moreover, this is the first publicly available large-scale repository of behavioral responses pertaining to Chinese language processing (the behavioral dataset is attached to this article, as a supplemental file available for download). For these reasons, the data should be of substantial interest to psychologists, linguists, and other researchers.
In order to explore the role of the main psycholinguistic variables on visual word recognition, several mega-studies have been conducted in English in recent years. Nevertheless, because the effects of these variables depend on the regularity of the orthographic system, studies must also be done in other languages with different characteristics. The goal of this work was to conduct a lexical decision study in Spanish, a language with a shallow orthography and a high number of words. The influence of psycholinguistic variables on latencies corresponding to 2,765 words was assessed by means of linear mixed-effects modeling. The results show that some variables, such as frequency or age of acquisition, have significant effects on reaction times regardless of the type of words used. Other variables, such as orthographic neighborhood or imageability, were significant only in specific groups of words. Our results highlight the importance of taking into account the peculiarities of each spelling system in the development of reading models.
The present study provides affective norms for a large corpus of French words (N = 1,031) that were rated on emotional valence and emotional arousal by 469 French young adults. Ratings were made using the Self-Assessment Manikin (Lang, 1980). By combining evaluations of valence and arousal, and including ratings provided by male and female young adults, this database complements and extends existing French-language databases. The response reliability for the two affective dimensions was good, and the consistency between the present and previous ratings was high. We found a strong quadratic relationship between the valence and arousal ratings. Perceptions of the affective content of a word were partly linked to sex. This new affective database (FAN) will enable French-speaking researchers to select suitable materials for studies of how the character of affective words influences their cognitive processing. FAN is available as an online supplement downloadable with this article.
Planning, predicting, reasoning, and acting often depend crucially on the correct encoding and application of knowledge concerning the temporal and causal ordering of events. Yet no pictorial stimulus set is optimized for investigating the processing of temporal and causal order information. We introduce a novel stimulus set of 265 black-and-white line drawings depicting a diverse array of recognizable events. Most of the images in the stimulus set (N = 222) share a thematic or conceptual association with one other image in the set, and the stimuli were created and extensively normed such that the image pairs vary in the degrees to which they share a causal, ordered relation with one another. The stimuli were standardized in a series of normative tasks, including concept/noun/verb agreement, perceived frequency, visual similarity, and indexes of three features of causal associations between events (i.e., temporal proximity, exclusivity, and priority). Both younger adults (ages 18-30 years) and older adults (ages 60-80 years) contributed normative data, allowing for broad applications of the stimuli to the study of normal and age-related changes in the encoding, retention, and retrieval of information regarding temporal and causal order. Complete normative data sets are available in the online supplemental materials, and the full stimulus set is available by contacting the first author.
We present the KELLY project and its work on developing monolingual and bilingual word lists for language learning, using corpus methods, for nine languages and thirty-six language pairs. We describe the method and discuss the many challenges encountered. We have loaded the data into an online database to make it accessible for anyone to explore and we present our own first explorations of it. The focus of the paper is thus twofold, covering pedagogical and methodological aspects of the lists’ construction, and linguistic aspects of the by-product of the project, the KELLY database.
Emotion expression in human-human interaction takes place via various types of information, including body motion. Research on the perceptual-cognitive mechanisms underlying the processing of natural emotional body language can benefit greatly from datasets of natural emotional body expressions that facilitate stimulus manipulation and analysis. The existing databases have so far focused on few emotion categories which display predominantly prototypical, exaggerated emotion expressions. Moreover, many of these databases consist of video recordings which limit the ability to manipulate and analyse the physical properties of these stimuli. We present a new database consisting of a large set (over 1400) of natural emotional body expressions typical of monologues. To achieve close-to-natural emotional body expressions, amateur actors were narrating coherent stories while their body movements were recorded with motion capture technology. The resulting 3-dimensional motion data recorded at a high frame rate (120 frames per second) provides fine-grained information about body movements and allows the manipulation of movement on a body joint basis. For each expression it gives the positions and orientations in space of 23 body joints for every frame. We report the results of physical motion properties analysis and of an emotion categorisation study. The reactions of observers from the emotion categorisation study are included in the database. Moreover, we recorded the intended emotion expression for each motion sequence from the actor to allow for investigations regarding the link between intended and perceived emotions. The motion sequences along with the accompanying information are made available in a searchable MPI Emotional Body Expression Database. We hope that this database will enable researchers to study expression and perception of naturally occurring emotional body expressions in greater depth.
In this study, we present the normative values of the adaptation of the International Affective Digitized Sounds (IADS-2; Bradley {\&} Lang, 2007a) for European Portuguese (EP). The IADS-2 is a standardized database of 167 naturally occurring sounds that is widely used in the study of emotions. The sounds were rated by 300 college students who were native speakers of EP, in the three affective dimensions of valence, arousal, and dominance, by using the Self-Assessment Manikin (SAM). The aims of this adaptation were threefold: (1) to provide researchers with standardized and normatively rated affective sounds to be used with an EP population; (2) to investigate sex and cultural differences in the ratings of affective dimensions of auditory stimuli between EP and the American (Bradley {\&} Lang, 2007a) and Spanish (Fern{\'{a}}ndez-Abascal et al., Psicothema 20:104-113 2008; Redondo, Fraga, Padr{\'{o}}n, {\&} Pi{\~{n}}eiro, Behavior Research Methods 40:784-790 2008) standardizations; and (3) to promote research on auditory affective processing in Portugal. Our results indicated that the IADS-2 is a valid and useful database of digitized sounds for the study of emotions in a Portuguese context, allowing for comparisons of its results with those of other international studies that have used the same database for stimulus selection. The normative values of the EP adaptation of the IADS-2 database can be downloaded along with the online version of this article.
We present a verb–complement dictionary of Modern Hebrew, automatically extracted from text corpora. Carefully examining a large set of examples, we defined ten types of verb complements that cover the vast majority of the occurrences of verb complements in the corpora. We explored several collocation measures as indicators of the strength of the association between the verb and its complement. We then used these measures to automatically extract verb complements from corpora. The result is a wide-coverage, accurate dictionary that lists not only the likely complements for each verb, but also the likelihood of each complement. We evaluated the quality of the extracted dictionary both intrinsically and extrinsically. Intrinsically, we showed high precision and recall on randomly (but systematically) selected verbs. Extrinsically, we showed that using the extracted information is beneficial for two applications, prepositional phrase attachment disambiguation and Arabic-to-Hebrew machine translation.
Through this study, we aimed to validate a new tool for inducing moods in experimental contexts. Five audio stories with sad, joyful, frightening, erotic, or neutral content were presented to 60 participants (33 women, 27 men) in a within-subjects design, each for about 10 min. Participants were asked (1) to report their moods before and after listening to each story, (2) to assess the emotional content of the excerpts on various emotional scales, and (3) to rate their level of projection into the stories. The results confirmed our a priori emotional classification. The emotional stories were effective in inducing the desired mood, with no difference found between male and female participants. These stories therefore constitute a valuable corpus for inducing moods in French-speaking participants, and they are made freely available for use in scientific research.
The index of productive syntax (IPSyn; Scarborough (Applied Psycholinguistics 11:1-22, 1990) is a measure of syntactic development in child language that has been used in research and clinical settings to investigate the grammatical development of various groups of children. However, IPSyn is mostly calculated manually, which is an extremely laborious process. In this article, we describe the AC-IPSyn system, which automatically calculates the IPSyn score for child language transcripts using natural language processing techniques. Our results show that the AC-IPSyn system performs at levels comparable to scores computed manually. The AC-IPSyn system can be downloaded from www.hlt.utdallas.edu/{\~{}}nisa/ipsyn.html .
Most efforts at automatically creating multilingual lexicons require input lexical resources with rich content (e.g. semantic networks, domain codes, semantic categories) or large corpora. Such material is often unavailable and difficult to construct for under-resourced languages. In some cases, particularly for some ethnic languages, even unannotated corpora are still in the process of collection. We show how multilingual lexicons with under-resourced languages can be constructed using simple bilingual translation lists, which are more readily available. The prototype multilingual lexicon developed comprise six member languages: English, Malay, Chinese, French, Thai and Iban, the last of which is an under-resourced language in Borneo. Quick evaluations showed that 91.2 % of 500 random multilingual entries in the generated lexicon require minimal or no human correction.
In this paper, we present the final version of a publicly available treebank of Finnish, the Turku Dependency Treebank. The treebank contains 204,399 tokens (15,126 sentences) from 10 different text sources and has been manually annotated in a Finnish-specific version of the well-known Stanford Dependency scheme. The morphological analyses of the treebank have been assigned using a novel machine learning method to disambiguate readings given by an existing tool. As the second main contribution, we present the first open source Finnish dependency parser, trained on the newly introduced treebank. The parser achieves a labeled attachment score of 81 %. The treebank data as well as the parsing pipeline are available under an open license at http://bionlp.utu.fi/.
We present a database of high-definition (HD) videos for the study of traits inferred from whole-body actions. Twenty-nine actors (19 female) were filmed performing different actions—walking, picking up a box, putting down a box, jumping, sitting down, and standing and acting—while conveying different traits, including four emotions (anger, fear, happiness, sadness), untrustworthiness, and neutral, where no specific trait was conveyed. For the actions conveying the four emotions and untrustworthiness, the actions were filmed multiple times, with the actor conveying the traits with different levels of intensity. In total, we made 2,783 action videos (in both two-dimensional and three-dimensional format), each lasting 7 s with a frame rate of 50 fps. All videos were filmed in a green-screen studio in order to isolate the action information from all contextual detail and to provide a flexible stimulus set for future use. In order to validate the traits conveyed by each action, we asked participants to rate each of the actions corresponding to the trait that the actor portrayed in the two-dimensional videos. To provide a useful database of stimuli of multiple actions conveying multiple traits, each video name contains information on the gender of the actor, the action executed, the trait conveyed, and the rating of its perceived intensity. All videos can be downloaded free at the following address: http://www-users.york.ac.uk/{\~{}}neb506/databases.html. We discuss potential uses for the database in the analysis of the perception of whole-body actions.
In the present study, we present normative ratings of free association for 139 European Portuguese (EP) words among 7- to 8-, 9- to 10-, and 11- to 12-year-old children attending the 3rd, 5th, and 7th grades of elementary and middle school in Portugal. For each word, five indices are presented: (a) the percentage of associates, (b) the strength of the first associate, (c) the strength of the second associate, (d) the distance between the first and second associates, and (e) the percentage of idiosyncratic responses. Additionally, grade-level frequency values for each word from the ESCOLEX database (Soares et al., in press) are also provided. As expected, the results revealed developmental changes in the knowledge organization of the children, which occurred at the ages of 9–10 (5th grade) and remained stable in the 11- to 12-year-old children (7th grade). Specifically, we observed a decrease in the percentages of associates and idiosyncratic responses, as well as an increase in the strengths of the first and second associates from the 3rd to the 5th grade. Moreover, a comparative analysis with the previous work of Carneiro, Albuquerque, Fernandez, and Esteves (2004) on EP and Macizo, G{\'{o}}mez-Ariza, and Bajo (2000) on Spanish, for the subsets of common words (16 and 58, respectively), showed that the present norms fit well with previous EP data, but differ from the Spanish data.
Concreteness ratings are presented for 37,058 English words and 2,896 two-word expressions (such as zebra crossing and zoom in), obtained from over 4,000 participants by means of a norming study using Internet crowdsourcing for data collection. Although the instructions stressed that the assessment of word concreteness would be based on experiences involving all senses and motor responses, a comparison with the existing concreteness norms indicates that participants, as before, largely focused on visual and haptic experiences. The reported data set is a subset of a comprehensive list of English lemmas and contains all lemmas known by at least 85 {\%} of the raters. It can be used in future research as a reference list of generally known English lemmas.
The need for data about the acquisition of Czech by non-native learners prompted the compilation of the first learner corpus of Czech. After introducing its basic design and parameters, including a multi-tier manual annotation scheme and error taxonomy, we focus on the more technical aspects: the transcription of hand-written source texts, process of annotation, and options for exploiting the result, together with tools used for these tasks and decisions behind the choices. To support or even substitute manual annotation we assign some error tags automatically and use automatic annotation tools (tagger, spell checker).
Starting in 2006, the European Commission’s Joint Research Centre and other European Union organisations have made available a number of large-scale highly-multilingual parallel language resources. In this article, we give a comparative overview of these resources and we explain the specific nature of each of them. This article provides answers to a number of question, including: What are these linguistic resources? What is the difference between them? Why were they originally created and why was the data released publicly? What can they be used for and what are the limitations of their usability? What are the text types, subject domains and languages covered? How to avoid overlapping document sets? How do they compare regarding the formatting and the translation alignment? What are their usage conditions? What other types of multilingual linguistic resources does the EU have? This article thus aims to clarify what the similarities and differences between the various resources are and what they can be used for. It will also serve as a reference publication for those resources, for which a more detailed description has been lacking so far (EAC-TM, ECDC-TM and DGT-Acquis).
The balanced corpus of contemporary written Japanese (BCCWJ) is Japan’s first 100 million words balanced corpus. It consists of three subcorpora (publication subcorpus, library subcorpus, and special-purpose subcorpus) and covers a wide range of text registers including books in general, magazines, newspapers, governmental white papers, best-selling books, an internet bulletin-board, a blog, school textbooks, minutes of the national diet, publicity newsletters of local governments, laws, and poetry verses. A random sampling technique is utilized whenever possible in order to maximize the representativeness of the corpus. The corpus is annotated in terms of dual POS analysis, document structure, and bibliographical information. The BCCWJ is currently accessible in three different ways including Chunagon a web-based interface to the dual POS analysis data. Lastly, results of some pilot evaluation of the corpus with respect to the textual diversity are reported. The analyses include POS distribution, word-class distribution, entropy of orthography, sentence length, and variation of the adjective predicate. High textual diversity is observed in all these analyses.
Virtually no valid materials are available to evaluate confrontation naming in Spanish-English bilingual adults in the U.S. In a recent study, a large group of young Spanish-English bilingual adults were evaluated on An Object and Action Naming Battery (Edmonds {\&} Donovan in Journal of Speech, Language, and Hearing Research 55:359-381, 2012). Rasch analyses of the responses resulted in evidence for the content and construct validity of the retained items. However, the scope of that study did not allow for extensive examination of individual item characteristics, group analyses of participants, or the provision of testing and scoring materials or raw data, thereby limiting the ability of researchers to administer the test to Spanish-English bilinguals and to score the items with confidence. In this study, we present the in-depth information described above on the basis of further analyses, including (1) online searchable spreadsheets with extensive empirical (e.g., accuracy and name agreeability) and psycholinguistic item statistics; (2) answer sheets and instructions for scoring and interpreting the responses to the Rasch items; (3) tables of alternative correct responses for English and Spanish; (4) ability strata determined for all naming conditions (English and Spanish nouns and verbs); and (5) comparisons of accuracy across proficiency groups (i.e., Spanish dominant, English dominant, and balanced). These data indicate that the Rasch items from An Object and Action Naming Battery are valid and sensitive for the evaluation of naming in young Spanish-English bilingual adults. Additional information based on participant responses for all of the items on the battery can provide researchers with valuable information to aid in stimulus development and response interpretation for experimental studies in this population.
We present the German adaptation of the Affective Norms for English Words (ANEW; Bradley {\&} Lang in Technical Report No. C-1. Gainsville: University of Florida, Center for Research in Psychophysiology). A total of 1,003 Words-German translations of the ANEW material-were rated on a total of six dimensions: The classic ratings of valence, arousal, and dominance (as in the ANEW corpus) were extended with additional arousal ratings using a slightly different scale (see BAWL: V{\~{o}} et al. in Behavior Research Methods 41: 531-538, 2009; V{\~{o}}, Jacobs, {\&} Conrad in Behavior Research Methods 38: 606-609, 2006), along with ratings of imageability and potency. Measures of several objective psycholinguistic variables (different types of word frequency counts, grammatical class, number of letters, number of syllables, and number of orthographic neighbors) for the words were also added, so as to further facilitate the use of this new database in psycholinguistic research. These norms can be downloaded as supplemental materials with this article.
Defining the specific role of the factors that affect metaphor processing is a fundamental step for fully understanding figurative language comprehension, either in discourse and conversation or in reading poems and novels. This study extends the currently available materials on everyday metaphorical expressions by providing the first dataset of metaphors extracted from literary texts and scored for the major psycholinguistic variables, considering also the effect of context. A set of 115 Italian literary metaphors presented in isolation (Experiment 1) and a subset of 65 literary metaphors embedded in their original texts (Experiment 2) were rated on several dimensions (word and phrase frequency, readability, cloze probability, familiarity, concreteness, difficulty and meaningfulness). Overall, literary metaphors scored around medium-low values on all dimensions in both experiments. Collected data were subjected to correlation analysis, which showed the presence of a strong cluster of variables-mainly familiarity, difficulty, and meaningfulness-when literary metaphor were presented in isolation. A weaker cluster was observed when literary metaphors were presented in the original contexts, with familiarity no longer correlating with meaningfulness. Context manipulation influenced familiarity, concreteness and difficulty ratings, which were lower in context than out of context, while meaningfulness increased. Throughout the different dimensions, the literary context seems to promote a global interpretative activity that enhances the open-endedness of the metaphor as a semantic structure constantly open to all possible interpretations intended by the author and driven by the text. This dataset will be useful for the design of future experimental studies both on literary metaphor and on the role of context in figurative meaning, combining ecological validity and aesthetic aspects of language.
We collected norms on the gender stereotypicality of an extensive list of role nouns in Czech, English, French, German, Italian, Norwegian, and Slovak, to be used as a basis for the selection of stimulus materials in future studies. We present a Web-based tool (available at https://www.unifr.ch/lcg/ ) that we developed to collect these norms and that we expect to be useful for other researchers, as well. In essence, we provide (a) gender stereotypicality norms across a number of languages and (b) a tool to facilitate cross-language as well as cross-cultural comparisons when researchers are interested in the investigation of the impact of stereotypicality on the processing of role nouns.
The present study provides Dutch norms for age of acquisition, familiarity, imageability, image agreement, visual complexity, word frequency, and word length (in syllables) for 124 line drawings of actions. Ratings were obtained from 117 Dutch participants. Word frequency was determined on the basis of the SUBTLEX-NL corpus (Keuleers, Brysbaert, {\&} New, Behavior Research Methods, 42, 643-650, 2010). For 104 of the pictures, naming latencies and name agreement were determined in a separate naming experiment with 74 native speakers of Dutch. The Dutch norms closely corresponded to the norms for British English. Multiple regression analysis showed that age of acquisition, imageability, image agreement, visual complexity, and name agreement were significant predictors of naming latencies, whereas word frequency and word length were not. Combined with the results of a principal-component analysis, these findings suggest that variables influencing the processes of conceptual preparation and lexical selection affect latencies more strongly than do variables influencing word-form encoding.
The Database for Spoken German (Datenbank f{\"{u}}r Gesprochenes Deutsch, DGD2, http://dgd.ids-mannheim.de) is the central platform for publishing and disseminating spoken language corpora from the Archive of Spoken German (Archiv f{\"{u}}r Gesprochenes Deutsch, AGD, http://agd.ids-mannheim.de) at the Institute for the German Language in Mannheim. The corpora contained in the DGD2 come from a variety of sources, some of them in-house projects, some of them external projects. Most of the corpora were originally intended either for research into the (dialectal) variation of German or for studies in conversation analysis and related fields. The AGD has taken over the task of permanently archiving these resources and making them available for reuse to the research community. To date, the DGD2 offers access to 19 different corpora, totalling around 9000 speech events, 2500 hours of audio recordings or 8 million transcribed words. This paper gives an overview of the data made available via the DGD2, of the technical basis for its implementation, and of the most important functionalities it offers. The paper concludes with information about the users of the database and future plans for its development.
Selecting appropriate stimuli to induce emotional states is essential in affective research. Only a few standardized affective stimulus databases have been created for auditory, language, and visual materials. Numerous studies have extensively employed these databases using both behavioral and neuroimaging methods. However, some limitations of the existing databases have recently been reported, including limited numbers of stimuli in specific categories or poor picture quality of the visual stimuli. In the present article, we introduce the Nencki Affective Picture System (NAPS), which consists of 1,356 realistic, high-quality photographs that are divided into five categories (people, faces, animals, objects, and landscapes). Affective ratings were collected from 204 mostly European participants. The pictures were rated according to the valence, arousal, and approach-avoidance dimensions using computerized bipolar semantic slider scales. Normative ratings for the categories are presented for each dimension. Validation of the ratings was obtained by comparing them to ratings generated using the Self-Assessment Manikin and the International Affective Picture System. In addition, the physical properties of the photographs are reported, including luminance, contrast, and entropy. The new database, with accompanying ratings and image parameters, allows researchers to select a variety of visual stimulus materials specific to their experimental questions of interest. The NAPS system is freely accessible to the scientific community for noncommercial use by request at http://naps.nencki.gov.pl .
Reading involves a process of matching an orthographic input with stored representations in lexical memory. The masked priming paradigm has become a standard tool for investigating this process. Use of existing results from this paradigm can be limited by the precision of the data and the need for cross-experiment comparisons that lack normal experimental controls. Here, we present a single, large, high-precision, multicondition experiment to address these problems. Over 1,000 participants from 14 sites responded to 840 trials involving 28 different types of orthographically related primes (e.g., castfe-CASTLE) in a lexical decision task, as well as completing measures of spelling and vocabulary. The data were indeed highly sensitive to differences between conditions: After correction for multiple comparisons, prime type condition differences of 2.90 ms and above reached significance at the 5{\%} level. This article presents the method of data collection and preliminary findings from these data, which included replications of the most widely agreed-upon differences between prime types, further evidence for systematic individual differences in susceptibility to priming, and new evidence regarding lexical properties associated with a target word's susceptibility to priming. These analyses will form a basis for the use of these data in quantitative model fitting and evaluation and for future exploration of these data that will inform and motivate new experiments.
The paper describes a corpus of texts produced by non-native speakers of Czech. We discuss its annotation scheme, consisting of three interlinked tiers, designed to handle a wide range of error types present in the input. Each tier corrects different types of errors; links between the tiers allow capturing errors in word order and complex discontinuous expressions. Errors are not only corrected, but also classified. The annotation scheme is tested on a data set including approx. 175,000 words with fair inter-annotator agreement results. We also explore the possibility of applying automated linguistic annotation tools (taggers, spell checkers and grammar checkers) to the learner text to support or even substitute manual annotation.
Modern paraphrase research would benefit from large corpora with detailed annotations. However, currently these corpora are still thin on the ground. In this paper, we describe the development of such a corpus for Dutch, which takes the form of a parallel monolingual treebank consisting of over 2 million tokens and covering various text genres, including both parallel and comparable text. This publicly available corpus is richly annotated with alignments between syntactic nodes, which are also classified using five different semantic similarity relations. A quarter of the corpus is manually annotated, and this informs the development of an automatic tree aligner used to annotate the remainder of the corpus. We argue that this corpus is the first of this size and kind, and offers great potential for paraphrasing research.
This paper describes the creation of a fine-grained named entity annotation scheme and corpus for Dutch, and experiments on automatic main type and subtype named entity recognition. We give an overview of existing named entity annotation schemes, and motivate our own, which describes six main types (persons, organizations, locations, products, events and miscellaneous named entities) and finer-grained information on subtypes and metonymic usage. This was applied to a one-million-word subset of the Dutch SoNaR reference corpus. The classifier for main type named entities achieves a micro-averaged F-score of 84.91 %, and is publicly available, along with the corpus and annotations.
Automatic methods for wordnet development in languages other than English generally exploit information found in Princeton WordNet (PWN) and translations extracted from parallel corpora. A common approach consists in preserving the structure of PWN and transferring its content in new languages using alignments, possibly combined with information extracted from multilingual semantic resources. Even if the role of PWN remains central in this process, these automatic methods offer an alternative to the manual elaboration of new wordnets. However, their limited coverage has a strong impact on that of the resulting resources. Following this line of research, we apply a cross-lingual word sense disambiguation method to wordnet development. Our approach exploits the output of a data-driven sense induction method that generates sense clusters in new languages, similar to wordnet synsets, by identifying word senses and relations in parallel corpora. We apply our cross-lingual word sense disambiguation method to the task of enriching a French wordnet resource, the WOLF, and show how it can be efficiently used for increasing its coverage. Although our experiments involve the English–French language pair, the proposed methodology is general enough to be applied to the development of wordnet resources in other languages for which parallel corpora are available. Finally, we show how the disambiguation output can serve to reduce the granularity of new wordnets and the degree of polysemy present in PWN.
Dependency grammar is considered appropriate for many Indian languages. In this paper, we present a study of the dependency relations in Bangla language. We have categorized these relations in three different levels, namely intrachunk relations, interchunk relations and interclause relations. Each of these levels is further categorized and an annotation scheme has been developed. Both syntactic and semantic features have been taken into consideration for describing the relations. In our scheme, there are 63 such syntactico–semantic relations. We have verified the scheme by tagging a corpus of 4167 Bangla sentences to create a treebank (KGPBenTreebank).
Word processing studies increasingly make use of regression analyses based on large numbers of stimuli (the so-called megastudy approach) rather than experimental designs based on small factorial designs. This requires the availability of word features for many words. Following similar studies in English, we present and validate ratings of age of acquisition and concreteness for 30,000 Dutch words. These include nearly all lemmas language researchers are likely to be interested in. The ratings are freely available for research purposes.
We present HamleDT—a HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. In the present article, we provide a thorough investigation and discussion of a number of phenomena that are comparable across languages, though their annotation in treebanks often differs. We claim that transformation procedures can be designed to automatically identify most such phenomena and convert them to a unified annotation style. This unification is beneficial both to comparative corpus linguistics and to machine learning of syntactic parsing.
The learner translation corpus developed at the School of Translation and Interpreting of Pompeu Fabra University in Barcelona is a web-searchable resource created for pedagogical and research purposes. It comprises a multiple translation corpus (English–Catalan) featuring automatic linguistic annotation and manual error annotation, complemented with an interface for monolingual or bilingual querying of the data. The corpus can be used to identify common errors in the students’ work and to analyse their patterns of language use. It provides easy access to error samples and to multiple versions of the same source text sequence to be used as learning materials in various courses in the translator-training university curriculum.
A wordnet is an important tool for developing natural language processing applications for a language. However, most wordnets are handcrafted by experts, which limits their growth. In this article, we propose an automatic approach to create wordnets by exploiting textual resources, dubbed ECO. After extracting semantic relation instances, identified by discriminating textual patterns, ECO discovers synonymy clusters, used as synsets, and attaches the remaining relations to suitable synsets. Besides introducing each step of ECO, we report on how it was implemented to create Onto.PT, a public lexical ontology for Portuguese. Onto.PT is the result of the automatic exploitation of Portuguese dictionaries and thesauri, and it aims to minimise the main limitations of existing Portuguese lexical knowledge bases.
The processing of human and nonhuman concepts (e.g., agreeable vs. edible) during basic comprehension and reasoning tasks has become a major topic of scientific inquiry. To ensure that the experimental effects obtained from such studies reflect the hypothesised semantic distinction, potential confounds such as psycholinguistic and/or lexical properties of the exact stimuli chosen need to be addressed. In the current study, normative data of such properties were obtained for a series of 875 French adjectives by asking 8 groups of 20 participants to each rate all words on one dimension of theoretical interest. The collected ratings indicate the extent to which each adjective evokes a sensory experience (concreteness), captures an enduring attribute (temporal stability), refers to a visible characteristic (visibility), denotes a neutral or an affectively laden concept (valence), signifies an attribute of low or high intensity, is familiar to the reader and can be used to describe people and/or inanimate entities such as objects. In addition, for each item its exact grammatical class (adjective vs. past participle adjective), length (i.e., number of letters, number of syllables), and word frequency was retrieved from the lexique3 corpus. The resulting database enables researchers to consider pivotal psycholinguistic and lexical properties when selecting human and nonhuman stimuli for future research.
Theories of the representation and processing of concepts have been greatly enhanced by models based on information available in semantic property norms. This information relates both to the identity of the features produced in the norms and to their statistical properties. In this article, we introduce a new and large set of property norms that are designed to be a more flexible tool to meet the demands of many different disciplines interested in conceptual knowledge representation, from cognitive psychology to computational linguistics. As well as providing all features listed by 2 or more participants, we also show the considerable linguistic variation that underlies each normalized feature label and the number of participants who generated each variant. Our norms are highly comparable with the largest extant set (McRae, Cree, Seidenberg, {\&} McNorgan, 2005) in terms of the number and distribution of features. In addition, we show how the norms give rise to a coherent category structure. We provide these norms in the hope that the greater detail available in the Centre for Speech, Language and the Brain norms should further promote the development of models of conceptual knowledge. The norms can be downloaded at www.csl.psychol.cam.ac.uk/propertynorms.
Formal and semantic overlap across languages plays an important role in bilingual language processing systems. In the present study, Japanese (first language; L1)–English (second language; L2) bilinguals rated 193 Japanese–English word pairs, including cognates and noncognates, in terms of phonological and semantic similarity. We show that the degree of cross-linguistic overlap varies, such that words can be more or less “cognate,” in terms of their phonological and semantic overlap. Bilinguals also translated these words in both directions (L1–L2 and L2–L1), providing a measure of translation equivalency. Notably, we reveal for the first time that Japanese–English cognates are “special,” in the sense that they are usually translated using one English term (e.g., コール /kooru/ is always translated as “call”), but the English word is translated into a greater variety of Japanese words. This difference in translation equivalency likely extends to other nonetymologically related, different-script languages in which cognates are all loanwords (e.g., Korean–English). Norming data were also collected for L1 age of acquisition, L1 concreteness, and L2 familiarity, because such information had been unavailable for the item set. Additional information on L1/L2 word frequency, L1/L2 number of senses, and L1/L2 word length and number of syllables is also provided. Finally, correlations and characteristics of the cognate and noncognate items are detailed, so as to provide a complete overview of the lexical and semantic characteristics of the stimuli. This creates a comprehensive bilingual data set for these different-script languages and should be of use in bilingual word recognition and spoken language research.
In this article, we present the first open-access lexical database that provides phonological representations for 120,000 Italian word forms. Each of these also includes syllable boundaries and stress markings and a comprehensive range of lexical statistics. Using data derived from this lexicon, we have also generated a set of derived databases and provided estimates of positional frequency use for Italian phonemes, syllables, syllable onsets and codas, and character and phoneme bigrams. These databases are freely available from phonitalia.org. This article describes the methods, content, and summarizing statistics for these databases. In a first application of this database, we also demonstrate how the distribution of phonological substitution errors made by Italian aphasic patients is related to phoneme frequency.
We developed affective norms for 1,121 Italian words in order to provide researchers with a highly controlled tool for the study of verbal processing. This database was developed from translations of the 1,034 English words present in the Affective Norms for English Words (ANEW; Bradley {\&} Lang, 1999) and from words taken from Italian semantic norms (Montefinese, Ambrosini, Fairfield, {\&} Mammarella, Behavior Research Methods, 45, 440–461, 2013). Participants evaluated valence, arousal, and dominance using the Self-Assessment Manikin (SAM) in a Web survey procedure. Participants also provided evaluations of three subjective psycholinguistic indexes (familiarity, imageability, and concreteness), and five objective psycholinguistic indexes (e.g., word frequency) were also included in the resulting database in order to further characterize the Italian words. We obtained a typical quadratic relation between valence and arousal, in line with previous findings. We also tested the reliability of the present ANEW adaptation for Italian by comparing it to previous affective databases and performing split-half correlations for each variable. We found high split-half correlations within our sample and high correlations between our ratings and those of previous studies, confirming the validity of the adaptation of ANEW for Italian. This database of affective norms provides a tool for future research about the effects of emotion on human cognition.
This paper introduces the Corpus of Advanced Learner Finnish (LAS2), one of the existing corpora of learner Finnish. The corpus was started at the University of Turku in 2007, and the initial motivation for its collection was to make it possible to deal with novel linguistic challenges posed by academic immigration and to contribute to corpus linguistics, Finnish linguistics and the study of second language acquisition. This paper describes the typological standpoint of the LAS2, its position with respect to other corpora of learner Finnish, the compilation criteria, the annotation applied and the workflow implemented. The corpus consists of three subcorpora of written academic texts of non-native speakers of Finnish. The subcorpora are 1) texts for examination purposes, 2) texts for publishing and graduating purposes, and 3) texts for studying and learning purposes. The informants either study or work in Finnish within academia in Finland. When available, the data has been collected longitudinally. A reference corpus for each subcorpus written by native speakers has also been compiled. Three query tools designed within the framework of the LAS2 are also introduced. These tools enable queries based on any combinations of the linguistic annotation. They can also be used to analyse the typical inner or cotextual variation of any user-specified linguistic node or to create frequency lists of multiword units defined at any level of the annotation. The queries can be limited to a user-specified subset of the data.
Naturalistic learner productions are an important empirical resource for SLA research. Some pioneering works have produced valuable second language (L2) resources supporting SLA research.1 One common limitation of these resources is the absence of individual longitudinal data for numerous speakers with different backgrounds across the proficiency spectrum, which is vital for understanding
We present word frequencies based on subtitles of British television programmes. We show that the SUBTLEX-UK word frequencies explain more of the variance in the lexical decision times of the British Lexicon Project than the word frequencies based on the British National Corpus and the SUBTLEX-US frequencies. In addition to the word form frequencies, we also present measures of contextual diversity part-of-speech specific word frequencies, word frequencies in children programmes, and word bigram frequencies, giving researchers of British English access to the full range of norms recently made available for other languages. Finally, we introduce a new measure of word frequency, the Zipf scale, which we hope will stop the current misunderstandings of the word frequency effect.
In this article, we introduce ESCOLEX, the first European Portuguese children's lexical database with grade-level-adjusted word frequency statistics. Computed from a 3.2-million-word corpus, ESCOLEX provides 48,381 word forms extracted from 171 elementary and middle school textbooks for 6- to 11-year-old children attending the first six grades in the Portuguese educational system. Like other children's grade-level databases (e.g., Carroll, Davies, {\&} Richman, 1971; Corral, Ferrero, {\&} Goikoetxea, Behavior Research Methods, 41, 1009–1017, 2009; L{\'{e}}t{\'{e}}, Sprenger-Charolles, {\&} Col{\'{e}}, Behavior Research Methods, Instruments, {\&} Computers, 36, 156–166, 2004; Zeno, Ivens, Millard, Duvvuri, 1995), ESCOLEX provides four frequency indices for each grade: overall word frequency (F), index of dispersion across the selected textbooks (D), estimated frequency per million words (U), and standard frequency index (SFI). It also provides a new measure, contextual diversity (CD). In addition, the number of letters in the word and its part(s) of speech, number of syllables, syllable structure, and adult frequencies taken from P-PAL (a European Portuguese corpus-based lexical database; Soares, Comesa{\~{n}}a, Iriarte, Almeida, Sim{\~{o}}es, Costa, {\ldots}, Machado, 2010; Soares, Iriarte, Almeida, Sim{\~{o}}es, Costa, Fran{\c{c}}a, {\ldots}, Comesa{\~{n}}a, in press) are provided. ESCOLEX will be a useful tool both for researchers interested in language processing and development and for professionals in need of verbal materials adjusted to children's developmental stages. ESCOLEX can be downloaded along with this article or from http://p-pal.di.uminho.pt/about/databases.
In the present study, normative data in Turkish are presented for the 260 color versions of the original Snodgrass and Vanderwart (1980) picture set for the first time. Norms are reported for name and image agreement, age of acquisition (AoA), visual complexity, and conceptual familiarity, together with written word frequency, and numbers of letters and syllables. We collected data from 277 native Turkish adults in a variety of tasks. The results indicated that, whilst several measures displayed language-specific variation, we also reported what seem to be language-independent-that is, universal-measures that show a systematic relationship across several languages. The implications of the reported measures in the domain of psycholinguistic research in Turkish and for wider cross-linguistic comparisons are discussed.
Wordnets are large-scale lexical databases of related words and concepts, useful for language-aware software applications. They have recently been built for many languages by using various approaches. The Finnish wordnet, FinnWordNet (FiWN), was created by translating the more than 200,000 word senses in the English Princeton WordNet (PWN) 3.0 in 100 days. To ensure quality, they were translated by professional translators. The direct translation approach was based on the assumption that most synsets in PWN represent language-independent real-world concepts. Thus also the semantic relations between synsets were assumed mostly language-independent, so the structure of PWN could be reused as well. This approach allowed the creation of an extensive Finnish wordnet directly aligned with PWN and also provided us with a translation relation and thus a bilingual wordnet usable as a dictionary. In this paper, we address several concerns raised with regard to our approach, many of them for the first time. We evaluate the craftsmanship of the translators by checking the spelling and translation quality, the viability of the approach by assessing the synonym quality both on the lexeme and concept level, as well as the usefulness of the resulting lexical resource both for humans and in a language-technological task. We discovered no new problems compared with those already known in PWN. As a whole, the paper contributes to the scientific discourse on what it takes to create a very large wordnet. As a side-effect of the evaluation, we extended FiWN to contain 208,645 word senses in 120,449 synsets, effectively making version 2.0 of FiWN currently the largest wordnet in the world by these statistics.
Cognitive theories in visual attention and perception, categorization, and memory often critically rely on concepts of similarity among objects, and empirically require measures of “sameness” among their stimuli. For instance, a researcher may require similarity estimates among multiple exemplars of a target category in visual search, or targets and lures in recognition memory. Quantifying similarity, however, is challenging when everyday items are the desired stimulus set, particularly when researchers require several different pictures from the same category. In this article, we document a new multidimensional scaling database with similarity ratings for 240 categories, each containing color photographs of 16–17 exemplar objects. We collected similarity ratings using the spatial arrangement method. Reports include: the multidimensional scaling solutions for each category, up to five dimensions, stress and fit measures, coordinate locations for each stimulus, and two new classifications. For each picture, we categorized the item's prototypicality, indexed by its proximity to other items in the space. We also classified pairs of images along a continuum of similarity, by assessing the overall arrangement of each MDS space. These similarity ratings will be useful to any researcher that wishes to control the similarity of experimental stimuli according to an objective quantification of “sameness.”
Researchers have only recently started to take advantage of the developments in technology and communication for sharing data and documents. However, the exchange of experimental material has not taken advantage of this progress yet. In order to facilitate access to experimental material, the Bank of Standardized Stimuli (BOSS) project was created as a free standardized set of visual stimuli accessible to all researchers, through a normative database. The BOSS is currently the largest existing photo bank providing norms for more than 15 dimensions (e.g. familiarity, visual complexity, manipulability, etc.), making the BOSS an extremely useful research tool and a mean to homogenize scientific data worldwide. The first phase of the BOSS was completed in 2010, and contained 538 normative photos. The second phase of the BOSS project presented in this article, builds on the previous phase by adding 930 new normative photo stimuli. New categories of concepts were introduced, including animals, building infrastructures, body parts, and vehicles and the number of photos in other categories was increased. All new photos of the BOSS were normalized relative to their name, familiarity, visual complexity, object agreement, viewpoint agreement, and manipulability. The availability of these norms is a precious asset that should be considered for characterizing the stimuli as a function of the requirements of research and for controlling for potential confounding effects.
Given the importance of lexical frequency for psycholinguistic research and the lack of comprehensive frequency data for sign languages, we collected subjective estimates of lexical frequency for 432 signs in American Sign Language (ASL). Our participants were 59 deaf signers who first began to acquire ASL at ages ranging from birth to 14 years old and who had a minimum of 10 years of experience. Subjective frequency estimates were made on a scale ranging from 1 = rarely see the sign to 7 = always see the sign. The mean subjective frequency ratings for individual signs did not vary in relation to age of sign language exposure (AoLE), chronological age, or length of ASL experience. Nor did AoLE show significant effects on the response times (RTs) for making the ratings. However, RTs were highly correlated with mean frequency ratings. These results suggest that the distributions of subjective lexical frequencies are consistent across signers with varying AoLEs. The implications for research practice are that subjective frequency ratings from random samples of highly experienced deaf signers can provide a reasonable measure of lexical control in sign language experiments. The Appendix gives the mean and median subjective frequency ratings and the mean and median log(RT) of the ASL signs for the entire sample; the supplemental material gives these measures for the three AoLE groups: native, early, and late.
Short Message Service (SMS) messages are short messages sent from one person to another from their mobile phones. They represent a means of personal communication that is an important communicative artifact in our current digital era. As most existing studies have used private access to SMS corpora, comparative studies using the same raw SMS data have not been possible up to now. We describe our efforts to collect a public SMS corpus to address this problem. We use a battery of methodologies to collect the corpus, paying particular attention to privacy issues to address contributors’ concerns. Our live project collects new SMS message submissions, checks their quality, and adds valid messages. We release the resultant corpus as XML and as SQL dumps, along with monthly corpus statistics. We opportunistically collect as much metadata about the messages and their senders as possible, so as to enable different types of analyses. To date, we have collected more than 71,000 messages, focusing on English and Mandarin Chinese.
The impact-es diachronic corpus of historical Spanish compiles over one hundred books—containing approximately 8 million words—in addition to a complementary lexicon which links more than 10,000 lemmas with attestations of the different variants found in the documents. This textual corpus and the accompanying lexicon have been released under an open license (Creative Commons by-nc-sa) in order to permit their intensive exploitation in linguistic research. Approximately 7 % of the words in the corpus (a selection aimed at enhancing the coverage of the most frequent word forms) have been annotated with their lemma, part of speech, and modern equivalent. This paper describes the annotation criteria followed and the standards, based on the Text Encoding Initiative recommendations, used to represent the texts in digital form.
This article describes the creation and application of the Turk Bootstrap Word Sense Inventory for 397 frequent nouns, which is a publicly available resource for lexical substitution. This resource was acquired using Amazon Mechanical Turk. In a bootstrapping process with massive collaborative input, substitutions for target words in context are elicited and clustered by sense; then, more contexts are collected. Contexts that cannot be assigned to a current target word’s sense inventory re-enter the bootstrapping loop and get a supply of substitutions. This process yields a sense inventory with its granularity determined by substitutions as opposed to psychologically motivated concepts. It comes with a large number of sense-annotated target word contexts. Evaluation on data quality shows that the process is robust against noise from the crowd, produces a less fine-grained inventory than WordNet and provides a rich body of high precision substitution data at low cost. Using the data to train a system for lexical substitutions, we show that amount and quality of the data is sufficient for producing high quality substitutions automatically. In this system, co-occurrence cluster features are employed as a means to cheaply model topicality.
Semantic role labeling is traditionally viewed as a sentence-level task concerned with identifying semantic arguments that are overtly realized in a fairly local context (i.e., a clause or sentence). However, this local view potentially misses important information that can only be recovered if local argument structures are linked across sentence boundaries. One important link concerns semantic arguments that remain locally unrealized (null instantiations) but can be inferred from the context. In this paper, we report on the SemEval 2010 Task-10 on “Linking Events and Their Participants in Discourse”, that addressed this problem. We discuss the corpus that was created for this task, which contains annotations on multiple levels: predicate argument structure (FrameNet and PropBank), null instantiations, and coreference. We also provide an analysis of the task and its difficulties.
A widely agreed-upon feature of spoken word recognition is that multiple lexical candidates in memory are simultaneously activated in parallel when a listener hears a word, and that those candidates compete for recognition (Luce, Goldinger, Auer, {\&} Vitevitch, Perception 62:615-625, 2000; Luce {\&} Pisoni, Ear and Hearing 19:1-36, 1998; McClelland {\&} Elman, Cognitive Psychology 18:1-86, 1986). Because the presence of those competitors influences word recognition, much research has sought to quantify the processes of lexical competition. Metrics that quantify lexical competition continuously are more effective predictors of auditory and visual (lipread) spoken word recognition than are the categorical metrics traditionally used (Feld {\&} Sommers, Speech Communication 53:220-228, 2011; Strand {\&} Sommers, Journal of the Acoustical Society of America 130:1663-1672, 2011). A limitation of the continuous metrics is that they are somewhat computationally cumbersome and require access to existing speech databases. This article describes the Phi-square Lexical Competition Database (Phi-Lex): an online, searchable database that provides access to multiple metrics of auditory and visual (lipread) lexical competition for English words, available at www.juliastrand.com/phi-lex .
A corpus of 5,765 consonant-vowel-consonant sequences (CVCs) was compiled, and phonotactic probability and neighborhood density were computed for both child and adult corpora. This corpus of CVCs, provided as supplementary materials, was analyzed to address the following questions: (1) Do computations based on a child corpus differ from those based on an adult corpus? (2) Do the phonotactic probability and/or the neighborhood density of real words differ from those of nonwords? (3) Do phonotactic probability and/or neighborhood density differ across CVCs that vary in consonant age of acquisition? The results showed significant differences in phonotactic probability and neighborhood density for the child versus adult corpora, replicating prior findings. The impact of this difference on future studies will depend on the level of precision needed when specifying probability and density. In addition, significant and large differences in phonotactic probability and neighborhood density were detected between real words and nonwords, which may present methodological challenges for future research. Finally, CVCs composed of earlier-acquired sounds differed significantly in probability and density from those composed of later-acquired sounds, although this effect was relatively small and is less likely to present significant methodological challenges to future studies.
This paper introduces the first version of the Arabic Learner Corpus (ALC), which comprises a collection of texts written by learners of Arabic in Saudi Arabia. The corpus covers two types of students, non-native Arabic speakers (NNAS) learning Arabic as a second language (ASL) for academic purpose (AAP), and native Arabic speaking students (NAS) learning to improve their written Arabic. Both groups are males at pre- university level.
Despite their relatively low sampling factor, the freely available, randomly sampled status streams of Twitter are very useful sources of geographically embedded social network data. To statistically analyze the information Twitter provides via these streams, we have collected a year's worth of data and built a multi-terabyte relational database from it. The database is designed for fast data loading and to support a wide range of studies focusing on the statistics and geographic features of social networks, as well as on the linguistic analysis of tweets. In this paper we present the method of data collection, the database design, the data loading procedure and special treatment of geo-tagged and multi-lingual data. We also provide some SQL recipes for computing network statistics.
This paper describes the Atlante Sintattico d'Italia, Syntactic Atlas of Italy (ASIt) linguistic linked dataset. ASIt is a scientific project aiming to account for minimally different variants within a sample of closely related languages; it is part of the Edisyn network, the goal of which is to establish a European network of researchers in the area of language syntax that use similar standards with respect to methodology of data collection, data storage and annotation, data retrieval and cartography. In this context, ASIt is defined as a curated database which builds on dialectal data gathered during a twenty-year-long survey investigating the distribution of several grammatical phenomena across the dialects of Italy. Both the ASIt linguistic linked dataset and the Resource Description Framework Schema (RDF/S) on which it is based are publicly available and released with a Creative Commons license (CC BY-NC-SA 3.0). We report the characteristics of the data exposed by ASIt, the statistics about the evolution of the data in the last two years, and the possible usages of the dataset, such as the generation of linguistic maps. {\textcopyright} 2012 - IOS Press and the authors. All rights reserved.
The processes involved in past tense verb generation have been central to models of inflectional morphology. However, the empirical support for such models has often been based on studies of accuracy in past tense verb formation on a relatively small set of items. We present the first large-scale study of past tense inflection (the Past Tense Inflection Project, or PTIP) that affords response time, accuracy, and error analyses in the generation of the past tense form from the present tense form for over 2,000 verbs. In addition to standard lexical variables (such as word frequency, length, and orthographic and phonological neighborhood), we have also developed new measures of past tense neighborhood consistency and verb imageability for these stimuli, and via regression analyses we demonstrate the utility of these new measures in predicting past tense verb generation. The PTIP can be used to further evaluate existing models, to provide well controlled stimuli for new studies, and to uncover novel theoretical principles in past tense morphology.
The Nelson and Narens (Journal of Verbal Learning and Verbal Behavior 19:338-368, 1980) general knowledge norms have been valuable to researchers in many fields. However, much has changed over the 32 years since the 1980 norms. For example, in 1980, most people knew the answer to the question "What is the name of the Lone Ranger's Indian sidekick?" (answer: Tonto), whereas in 2012, few people know this answer. Thus, we updated the 1980 norms and expanded them by providing new measures. In particular, we report two new metacognitive measures (confidence judgments and peer judgments) and provide a detailed report of commission errors. Each of these measures will be valuable to researchers, and together they are likely to facilitate future research in a number of fields, such as research investigating memory illusions, metamemory processes, and error correction. The presence of substantial generational shifts from 1980 to 2012 necessitates the use of updated norms.
In this study, we report normative data by native Persian speakers for concept familiarity, age of acquisition (AoA), imageability, image agreement, name agreement, and visual complexity, as well as values for word frequency, word length, and naming latency for 200 of the colored Snodgrass and Vanderwart (Journal of Experimental Psy- chology: Human Learning and Memory 6:174-215, 1980) pictures created by Rossion and Pourtois (Perception 33:217-236, 2004). Using multiple regression analysis, we found independent effects of name agreement, image agree- ment, word frequency, and AoA on picture naming by native Persian speakers from Iran. We concluded that the psycholinguistic properties identified in studies of picture naming in many other languages also predict timed picture naming in Persian. Normativedatafor theratings and picture-naming latencies for the 200 Persian object nouns are provided as an Excel file in the Supplemental materials.
Nonverbal vocal expressions, such as laughter, sobbing, and screams, are an important source of emotional information in social interactions. However, the investigation of how we process these vocal cues entered the research agenda only recently. Here, we introduce a new corpus of nonverbal vocalizations, which we recorded and submitted to perceptual and acoustic validation. It consists of 121 sounds expressing four positive emotions (achievement/triumph, amusement, sensual pleasure, and relief) and four negative ones (anger, disgust, fear, and sadness), produced by two female and two male speakers. For perceptual validation, a forced choice task was used (n = 20), and ratings were collected for the eight emotions, valence, arousal, and authenticity (n = 20). We provide these data, detailed for each vocalization, for use by the research community. High recognition accuracy was found for all emotions (86 {\%}, on average), and the sounds were reliably rated as communicating the intended expressions. The vocalizations were measured for acoustic cues related to temporal aspects, intensity, fundamental frequency (f0), and voice quality. These cues alone provide sufficient information to discriminate between emotion categories, as indicated by statistical classification procedures; they are also predictors of listeners' emotion ratings, as indicated by multiple regression analyses. This set of stimuli seems a valuable addition to currently available expression corpora for research on emotion processing. It is suitable for behavioral and neuroscience research and might as well be used in clinical settings for the assessment of neurological and psychiatric patients. The corpus can be downloaded from Supplementary Materials.
The temporal characteristics of speech can be captured by examining the distributions of the durations of measurable speech components, namely speech segment durations and pause durations. However, several barriers prevent the easy analysis of pause durations: The first problem is that natural speech is noisy, and although recording contrived speech minimizes this problem, it also discards diagnostic information about cognitive processes inherent in the longer pauses associated with natural speech. The second issue concerns setting the distribution threshold, and consists of the problem of appropriately classifying pause segments as either short pauses reflecting articulation or long pauses reflecting cognitive processing, while minimizing the overall classification error rate. This article describes a fully automated system for determining the locations of speech-pause transitions and estimating the temporal parameters of both speech and pause distributions in natural speech. We use the properties of Gaussian mixture models at several stages of the analysis, in order to identify theoretical components of the data distributions, to classify speech components, to compute durations, and to calculate the relevant statistics.;
The Writing Pal is an intelligent tutoring system that provides writing strategy training. A large part of its artificial intelligence resides in the natural language processing algorithms to assess essay quality and guide feedback to students. Because writing is often highly nuanced and subjective, the development of these algorithms must consider a broad array of linguistic, rhetorical, and contextual features. This study assesses the potential for computational indices to predict human ratings of essay quality. Past studies have demonstrated that linguistic indices related to lexical diversity, word frequency, and syntactic complexity are significant predictors of human judgments of essay quality but that indices of cohesion are not. The present study extends prior work by including a larger data sample and an expanded set of indices to assess new lexical, syntactic, cohesion, rhetorical, and reading ease indices. Three models were assessed. The model reported by McNamara, Crossley, and McCarthy (Written Communication 27:57-86, 2010) including three indices of lexical diversity, word frequency, and syntactic complexity accounted for only 6{\%} of the variance in the larger data set. A regression model including the full set of indices examined in prior studies of writing predicted 38{\%} of the variance in human scores of essay quality with 91{\%} adjacent accuracy (i.e., within 1 point). A regression model that also included new indices related to rhetoric and cohesion predicted 44{\%} of the variance with 94{\%} adjacent accuracy. The new indices increased accuracy but, more importantly, afford the means to provide more meaningful feedback in the context of a writing tutoring system.
An increasing number of studies are investigating the cognitive processes underlying human-object interactions. For instance, several researchers have manipulated the type of grip associated with objects in order to study the role of the objects' motor affordances in cognition. The objective of the present study was to develop norms for the types of grip employed when grasping and using objects, with a set of 296 photographs of objects. On the basis of these ratings, we computed measures of agreement to evaluate the extent to which participants agreed about the grip used to interact with these objects. We also collected ratings on the dissimilarity between the grips employed for grasping and for using objects, as well as the number of actions that can typically be performed with the objects. Our results showed grip agreements of 67 {\%} for grasping and of 65 {\%} for using objects. Moreover, our pattern of correlations is highly consistent with the idea that the grips for grasping and using objects represent two different motor dimensions of the objects.
In this article, we describe the most extensive set of word associations collected to date. The database contains over 12,000 cue words for which more than 70,000 participants generated three responses in a multiple-response free association task. The goal of this study was (1) to create a semantic network that covers a large part of the human lexicon, (2) to investigate the implications of a multiple-response procedure by deriving a weighted directed network, and (3) to show how measures of centrality and relatedness derived from this network predict both lexical access in a lexical decision task and semantic relatedness in similarity judgment tasks. First, our results show that the multiple-response procedure results in a more heterogeneous set of responses, which lead to better predictions of lexical access and semantic relatedness than do single-response procedures. Second, the directed nature of the network leads to a decomposition of centrality that primarily depends on the number of incoming links or in-degree of each node, rather than its set size or number of outgoing links. Both studies indicate that adequate representation formats and sufficiently rich data derived from word associations represent a valuable type of information in both lexical and semantic processing.
The combining of individual concepts to form an emergent concept is a fundamental aspect of language, yet much less is known about it than about processing isolated words or sentences. To facilitate research on conceptual combination, we provide meaningfulness ratings for a large set of (2,160) noun-noun pairs. Half of these pairs (1,080) are reversed versions of the other half (e.g., SKI JACKET and JACKET SKI), to facilitate the comparison of successful and unsuccessful conceptual combination independently of constituent lexical items. The computer code used for obtaining these ratings through a Web interface is provided. To further enhance the usefulness of this resource, ancillary measures obtained from other sources are also provided for each pair. These measures include associate production norms, contextual relatedness in terms of latent semantic analysis distance, total number of letters, phrase-level usage frequency, and word-level usage frequency summed across the words in each pair. Results of correlation and regression analyses are also provided for a quantitative description of the stimulus set. A subset of these stimuli was used to identify neural correlates of successful conceptual combination Graves, Binder, Desai, Conant, {\&} Seidenberg, (NeuroImage 53:638-646, 2010). The stimuli can be used in other research and also provide benchmark data for evaluating the effectiveness of computational algorithms for predicting meaningfulness of noun-noun pairs.
The aim of the present study was to provide normative data for the Croatian language using 346 visually presented objects (Cycowicz, Friedman, Rothstein, {\&} Snodgrass Journal of Experimental Child Psychology 65:171-237, 1997; Roach, Schwartz, Martin, Grewal, {\&} Brecher Clinical Aphasiology 24:121-133, 1996; Snodgrass {\&} Vanderwart Journal of Experimental Psychology: Human Learning and Memory 6:174-215, 1980). Picture naming was standardized according to seven variables: naming latency, name agreement, familiarity, visual complexity, word length, number of syllables, and word frequency. The descriptive statistics and correlation pattern of the variables collected in the present study were consistent with normative studies in other languages. These normative data for pictorial stimuli named by young healthy Croatian native speakers will be useful in studies of perception, language, and memory, as well as for preoperative and intraoperative mapping of speech and language brain areas.
Indicators of letter visual similarity have been used for controlling the design of empirical and neuropsychological studies and for rigorously determining the factors that underlie reading ability and literacy acquisition. Additionally, these letter similarity/confusability matrices have been useful for studies examining more general aspects of human cognition, such as perception. Despite many letter visual-similarity matrices being available, they all have two serious limitations if they are to be used by researchers in the reading domain: (1) They have been constructed using atypical reading data obtained from speeded reading-aloud tasks and/or under degraded presentation conditions; (2) they only include letters from the English alphabet. Although some letter visual-similarity matrices have been constructed using data gathered from normal reading conditions, these either are based on old fonts, which may not resemble the letters found in modern print, or were never published. For the first time, this article presents a comprehensive letter visual-similarity/confusability matrix that has been constructed based on untimed responses to clearly presented upper- and lowercase letters that are present in many languages that use Latin-based alphabets, including Catalan, Dutch, English, French, Galician, German, Italian, Portuguese, and Spanish. Such a matrix will be useful for researchers interested in the processes underpinning reading and literacy acquisition.
The substantial increase of social networks and their combination with mobile devices make rigorous analysis of the outcomes of such system of paramount importance for intelligence gathering and decision making purposes. Since the introduction of Twitter system in 2006, tweeting emerged as an efficient open social network that attracted interest from various research/commercial and military communities. This paper investigates the current software architecture of Twitter system and put forward a new architecture dedicated for semantic and spatial analysis of Twitter data. Especially, Twitter Streaming API was used as a basis for tweet collection data stored in MySQL like database. While Lucene system together with WordNet lexical database linked to advanced natural language processing and PostGIS platform were used to ensure semantic and spatial analysis of the collected data. A functional diversity approach was implemented to enforce fault tolerance for the data collection part where its performances were evaluated through comparison with alternative approaches. The proposal enables the discovery of spatial patterns within geo-located Twitter and can provide the user or operator with useful unforeseen elements. {\textcopyright} 2012 Elsevier B.V. All rights reserved.
We present modality exclusivity norms for 400 randomly selected noun concepts, for which participants provided perceptual strength ratings across five sensory modalities (i.e., hearing, taste, touch, smell, and vision). A comparison with previous norms showed that noun concepts are more multimodal than adjective concepts, as nouns tend to subsume multiple adjectival property concepts (e.g., perceptual experience of the concept baby involves auditory, haptic, olfactory, and visual properties, and hence leads to multimodal perceptual strength). To show the value of these norms, we then used them to test a prediction of the sound symbolism hypothesis: Analysis revealed a systematic relationship between strength of perceptual experience in the referent concept and surface word form, such that distinctive perceptual experience tends to attract distinctive lexical labels. In other words, modality-specific norms of perceptual strength are useful for exploring not just the nature of grounded concepts, but also the nature of form-meaning relationships. These norms will be of benefit to those interested in the representational nature of concepts, the roles of perceptual information in word processing and in grounded cognition more generally, and the relationship between form and meaning in language development and evolution.
NIM is Web-based software developed to help experimenters with some of the usual tasks carried out in psycholinguistic studies. It allows the user to search for words according to several variables, such as length, matching substrings, lexical frequency, or part of speech, in English, Spanish, and Catalan. NIM also provides the user with the possibilities to obtain different word metrics, such as lexical frequency, length, and part of speech; to find intralanguage and cross-language lexical neighbors; and to get control words for critical stimuli. Regardless of the language used, the program also enables the user to get the orthographic similarity between word pairs and to identify repeated items in lists of experimental stimuli. NIM is free and is publicly available at http://psico.fcep.urv.cat/utilitats/nim/ .
Sensory experience ratings (SERs) reflect the extent to which a word evokes a sensory and/or perceptual experience in the mind of the reader. Juhasz, Yap, Dicke, Taylor, and Gullick (Quarterly Journal of Experimental Psychology 64:1683-1691, 2011) demonstrated that SERs predict a significant amount of variance in lexical-decision response times in two megastudies of lexical processing when a large number of established psycholinguistic variables are controlled for. Here we provide the SERs for the 2,857 monosyllabic words used in the Juhasz et al. study, as well as newly collected ratings on 3,000 disyllabic words. New analyses with the combined set of words confirmed that SERs predict a reliable amount of variance in the lexical-decision response times and naming times from the English Lexicon Project (Balota, Yap, Cortese, Hutchison, Kessler, Loftus, {\&} Treiman, Behavior Research Methods 39:445-459, 2007) when a large number of surface, lexical, and semantic variables are statistically controlled for. The results suggest that the relative availability of sensory/perceptual information associated with a word contributes to lexical-semantic processing.
Built on the basis of the methods developed for Princeton WordNet and EuroWordNet, Arabic WordNet (AWN) has been an interesting project which combines WordNet structure compliance with Arabic particularities. In this paper, some AWN shortcomings related to coverage and usability are addressed. The use of AWN in question/answering (Q/A) helped us to deeply evaluate the resource from an experience-based perspective. Accordingly, an enrichment of AWN was built by semi-automatically extending its content. Indeed, existing approaches and/or resources developed for other languages were adapted and used for AWN. The experiments conducted in Arabic Q/A have shown an improvement of both AWN coverage as well as usability. Concerning coverage, a great amount of named entities extracted from YAGO were connected with corresponding AWN synsets. Also, a significant number of new verbs and nouns (including Broken Plural forms) were added. In terms of usability, thanks to the use of AWN, the performance for the AWN-based Q/A application registered an overall improvement with respect to the following three measures: accuracy (+9.27 % improvement), mean reciprocal rank (+3.6 improvement) and number of answered questions (+12.79 % improvement).
Since long it has been noted that cross-linguistically recurring polysemies can serve as an indi-cator of conceptual relations, and quite a few approaches to model and analyze such data have been proposed in the recent past. Although – given the nature of the data – it seems natural to model and analyze it with the help of network techniques, there are only a few approaches which make explicit use of them. In this paper, we show how the strict application of weighted network models helps to get more out of cross-linguistic polysemies than would be possible using approaches that are only based on item-to-item comparison. For our study we use a large dataset consisting of 1252 semantic items translated into 195 different languages covering 44 different language families. By analyz-ing the community structure of the network reconstructed from the data, we find that a majority of the concepts (68{\%}) can be separated into 104 large communities consisting of five and more nodes. These large communities almost exclusively constitute meaningful groupings of concepts into con-ceptual fields. They provide a valid starting point for deeper analyses of various topics in historical semantics, such as cognate detection, etymological analysis, and semantic reconstruction.
LAPSyD, the Lyon-Albuquerque Phonological Systems Database, is an online phonological database equipped with powerful query, mapping and visualization tools. It stems from the UPSID and WALS databases, enhanced with newly validated data not only covering segmental inventories but also syllable structures, stress and tonal systems. In its current version it covers around 700 languages and it is accessible at http://www.lapsyd.ddl.ish-lyon.cnrs.fr. This paper provides a description of the data structure in LAPSyD and the features of the interface. Brief illustrations of the types of analysis that can be done with this tool are provided, exploiting the ability to cross-reference data on segments, other phonological properties and language location. Copyright {\textcopyright} 2013 ISCA.
This paper describes the SubCat-Extractor as a novel tool to obtain verb subcategori-sation data from parsed German web corpora. The SubCat-Extractor is based on a set of detailed rules that go beyond what is directly accessible in the parses. The extracted subcategorisation database is represented in a compact but linguistically detailed and flexible format, comprising various aspects of verb information, complement information and sentence information , within a one-line-per-clause style. We describe the tool, the extraction rules and the obtained resource database, as well as actual and potential uses in computational linguistics.
Semantic norms for properties produced by native speakers are valuable tools for researchers interested in the structure of semantic memory and in category-specific semantic deficits in individuals following brain damage. The aims of this study were threefold. First, we sought to extend existing semantic norms by adopting an empirical approach to category (Exp. 1) and concept (Exp. 2) selection, in order to obtain a more representative set of semantic memory features. Second, we extensively outlined a new set of semantic production norms collected from Italian native speakers for 120 artifactual and natural basic-level concepts, using numerous measures and statistics following a feature-listing task (Exp. 3b). Finally, we aimed to create a new publicly accessible database, since only a few existing databases are publicly available online.
We make available word-by-word self-paced reading times and eye-tracking data over a sample of English sentences from narrative sources. These data are intended to form a gold standard for the evaluation of computational psycholinguistic models of sentence comprehension in English. We describe stimuli selection and data collection and present descriptive statistics, as well as comparisons between the two sets of reading times.
As researchers explore the complexity of memory and language hierarchies, the need to expand normed stimulus databases is growing. Therefore, we present 1,808 words, paired with their features and concept-concept information, that were collected using previously established norming methods (McRae, Cree, Seidenberg, {\&} McNorgan Behavior Research Methods 37:547-559, 2005). This database supplements existing stimuli and complements the Semantic Priming Project (Hutchison, Balota, Cortese, Neely, Niemeyer, Bengson, {\&} Cohen-Shikora 2010). The data set includes many types of words (including nouns, verbs, adjectives, etc.), expanding on previous collections of nouns and verbs (Vinson {\&} Vigliocco Journal of Neurolinguistics 15:317-351, 2008). We describe the relation between our and other semantic norms, as well as giving a short review of word-pair norms. The stimuli are provided in conjunction with a searchable Web portal that allows researchers to create a set of experimental stimuli without prior programming knowledge. When researchers use this new database in tandem with previous norming efforts, precise stimuli sets can be created for future research endeavors.
Extraction and normalization of temporal expressions from documents are important steps towards deep text understanding and a prerequisite for many NLP tasks such as information extraction, question answering, and document summarization. There are different ways to express (the same) temporal information in documents. However, after identifying temporal expressions, they can be normalized according to some standard format. This allows the usage of temporal information in a term- and language-independent way. In this paper, we describe the challenges of temporal tagging in different domains, give an overview of existing annotated corpora, and survey existing approaches for temporal tagging. Finally, we present our publicly available temporal tagger HeidelTime, which is easily extensible to further languages due to its strict separation of source code and language resources like patterns and rules. We present a broad evaluation on multiple languages and domains on existing corpora as well as on a newly created corpus for a language/domain combination for which no annotated corpus has been available so far.
Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus and a dialogue subcorpus, the latter containing either conversational or task-oriented speech. More than 25 h were recorded by twenty eight speakers per language. Among these speakers, eight were professional (four radio news broadcasters and four advertising actors). The entire material presented here has been transcribed, aligned with the acoustic signal and prosodically annotated. Two major objectives have guided the design of this project: (i) to offer a wide coverage of representative real-life communicative situations which allow for the characterization of prosody in these two languages; and (ii) to conduct research studies which enable us to contrast the speakers different speaking styles and discursive practices. All material contained in the corpus is provided under a Creative Commons Attribution 3.0 Unported License.
In cognitive science, results are obtained following the manipulation of one stimulus' variable and the control of potential confounding variables. To avoid the tedious task of measuring confounding effects, scientists often refer to normative sets of stimuli. The Bank of Standardized Stimuli (BOSS) is one of these sets. It initially included 480 normative stimuli of common objects and norms for seven variables (name, category, familiarity, visual complexity, object's typicality, manipulability and orientation). The BOSS has expanded and provides a wider variety of stimuli in order to fulfill the needs of experiments. To date, the latest version of the BOSS is comprised of 1,420 normative stimuli, including photos of animals, and new norms (color diagnositicity, symmetry, and action related norms). In order to demonstrate the influence of normative variables on cognitions, experiments on episodic memory were completed using the BOSS. Analyses were conducted as a function of the norms and indicated that name agreement, visual complexity, object/viewpoint agreement, symmetry, and color diagnosticity all influenced memory in distinct ways mostly by affecting the performance to new stimuli and by inducing response biases.
We report object-naming and object recognition times collected from Russian native speakers for the colorized version of the Snodgrass and Vanderwart (Journal of Experimental Psychology: Human Learning and Memory 6:174-215, 1980) pictures (Rossion {\&} Pourtois, Perception 33:217-236, 2004). New norms for image variability, body-object interaction [BOI], and subjective frequency collected in Russian, as well as new name agreement scores for the colorized pictures in French, are also reported. In both object-naming and object comprehension times, the name agreement, image agreement, and age-of-acquisition variables made significant independent contributions. Objective word frequency was reliable in object-naming latencies only. The variables of image variability, BOI, and subjective frequency were not significant in either object naming or object comprehension. Finally, imageability was reliable in both tasks. The new norms and object-naming and object recognition times are provided as supplemental materials.
Although many visual stimulus databases exist, none has data on item similarity levels for multiple items of each kind of stimulus. We present such data for 50 sets of grayscale object photographs. Similarity measures between pictures in each set (e.g., 25 different buttons) were collected using a similarity-sorting method (Goldstone, Behavior Research Methods Instruments {\&} Computers, 26(4):381-386, 1994). A validation experiment used data from 1 picture set and compared responses from standard pairwise measures. This showed close agreement. The similarity-sorting measures were then standardized across picture sets, using pairwise ratings. Finally, the standardized similarity distances were validated in a recognition memory experiment; false alarms increased when targets and foils were more similar. These data will facilitate memory and perception research that needs to make comparisons between stimuli with a range of known target-foil similarities.
We present a corpus of transcribed spoken Hebrew that reflects spoken interactions between children and adults. The corpus is an integral part of the CHILDES database, which distributes similar corpora for over 25 languages. We introduce a dedicated transcription scheme for the spoken Hebrew data that is sensitive to both the phonology and the standard orthography of the language. We also introduce a morphological analyzer that was specifically developed for this corpus. The analyzer adequately covers the entire corpus, producing detailed correct analyses for all tokens. Evaluation on a new corpus reveals high coverage as well. Finally, we describe a morphological disambiguation module that selects the correct analysis of each token in context. The result is a high-quality morphologically-annotated CHILDES corpus of Hebrew, along with a set of tools that can be applied to new corpora.
Social media is a natural laboratory for linguistic and sociological purposes. In micro-blogging platforms such as Twitter, people share hundreds of millions of short messages about their lives and experiences on a daily basis. These messages, coupled with metadata about their authors, provide an opportunity to understand a wide variety of phenomena ranging from political polarization to geographic and demographic lexical variation. Lack of publicly available micro-blogging datasets has been a hindrance to replicable research. In this paper, I introduce Rovereto Twitter n-gram corpus, a publicly available n-gram dataset of Twitter messages, which contains gender-of-the-author and time-of-posting tags associated with the n-grams. I compare this dataset to a more traditional web-based corpus and present a case study which shows the potential of combining an n-gram corpus with demographic metadata.
We introduce the ACL Anthology Network (AAN), a comprehensive manually curated networked database of citations, collaborations, and summaries in the field of Computational Linguistics. We also present a number of statistics about the network including the most cited authors, the most central collaborators, as well as network statistics about the paper citation, author citation, and author collaboration networks.
The project on the Romanian wordnet has been under continuous development for more than 10 years now. It has been in constant use in many projects and applications which determined, to a large extent, the content and coverage of various lexical domains. The article presents the most recent developments of the Romanian wordnet and offers quantitative data for its current version.
In this study, we present the normative values of the adaptation of the International Affective Digitized Sounds (IADS-2; Bradley {\&} Lang, 2007a) for European Portuguese (EP). The IADS-2 is a standardized database of 167 naturally occurring sounds that is widely used in the study of emotions. The sounds were rated by 300 college students who were native speakers of EP, in the three affective dimensions of valence, arousal, and dominance, by using the Self-Assessment Manikin (SAM). The aims of this adaptation were threefold: (1) to provide researchers with standardized and normatively rated affective sounds to be used with an EP population; (2) to investigate sex and cultural differences in the ratings of affective dimensions of auditory stimuli between EP and the American (Bradley {\&} Lang, 2007a) and Spanish (Fern{\'{a}}ndez-Abascal et al., Psicothema 20:104-113 2008; Redondo, Fraga, Padr{\'{o}}n, {\&} Pi{\~{n}}eiro, Behavior Research Methods 40:784-790 2008) standardizations; and (3) to promote research on auditory affective processing in Portugal. Our results indicated that the IADS-2 is a valid and useful database of digitized sounds for the study of emotions in a Portuguese context, allowing for comparisons of its results with those of other international studies that have used the same database for stimulus selection. The normative values of the EP adaptation of the IADS-2 database can be downloaded along with the online version of this article.
We report psycholinguistic norms for 305 French idiomatic expressions (Study 1). For each of the idiomatic expressions, the following variables are reported: knowledge, predictability, literality, compositionality, subjective and objective frequency, familiarity, age of acquisition (AoA), and length. In addition, we have collected comprehension times for each idiom (Study 2). The psycholinguistic relevance of the collected norms is explained, and different analyses (descriptive statistics, correlation and multiple regression analyses) performed on the norms are reported and discussed. The entire set of norms and reading times are provided as supplemental material.
Feature-based descriptions of concepts produced by subjects in a property generation task are widely used in cognitive science to develop empirically grounded concept representations and to study systematic trends in such representations. This article introduces BLIND, a collection of parallel semantic norms collected from a group of congenitally blind Italian subjects and comparable sighted subjects. The BLIND norms comprise descriptions of 50 nouns and 20 verbs. All the materials have been semantically annotated and translated into English, to make them easily accessible to the scientific community. The article also presents a preliminary analysis of the BLIND data that highlights both the large degree of overlap between the groups and interesting differences. The complete BLIND norms are freely available and can be downloaded from http://sesia.humnet.unipi.it/blind{\_}data .
This article introduces EsPal: a Web-accessible repository containing a comprehensive set of properties of Spanish words. EsPal is based on an extensible set of data sources, beginning with a 300 million token written database and a 460 million token subtitle database. Properties available include word frequency, orthographic structure and neighborhoods, phonological structure and neighborhoods, and subjective ratings such as imageability. Subword structure properties are also available in terms of bigrams and trigrams, biphones, and bisyllables. Lemma and part-of-speech information and their corresponding frequencies are also indexed. The website enables users either to upload a set of words to receive their properties or to receive a set of words matching constraints on the properties. The properties themselves are easily extensible and will be added over time as they become available. It is freely available from the following website: http://www.bcbl.eu/databases/espal/ .
Speeded naming and lexical decision data for 1,661 target words following related and unrelated primes were collected from 768 subjects across four different universities. These behavioral measures have been integrated with demographic information for each subject and descriptive characteristics for every item. Subjects also completed portions of the Woodcock-Johnson reading battery, three attentional control tasks, and a circadian rhythm measure. These data are available at a user-friendly Internet-based repository ( http://spp.montana.edu ). This Web site includes a search engine designed to generate lists of prime-target pairs with specific characteristics (e.g., length, frequency, associative strength, latent semantic similarity, priming effect in standardized and raw reaction times). We illustrate the types of questions that can be addressed via the Semantic Priming Project. These data represent the largest behavioral database on semantic priming and are available to researchers to aid in selecting stimuli, testing theories, and reducing potential confounds in their studies.
This paper provides a deduction-based approach for automatically classifying compound-internal relations in GermaNet, the German version of the Princeton WordNet for English. More specifically, meronymic relations between simplex and compound nouns provide the necessary input to the deduction patterns that involve different types of compound-internal relations. The scope of these deductions extends to all four meronymic relations modeled in version 6.0 of GermaNet: component, member, substance, and portion. This deduction-based approach provides an effective method for automatically enriching the set of semantic relations included in GermaNet.
The English-language Princeton WordNet (PWN) and some wordnets for other languages have been extensively used as lexical–semantic knowledge sources in language technology applications, due to their free availability and their size. The ubiquitousness of PWN-type wordnets tends to overshadow the fact that they represent one out of many possible choices for structuring a lexical–semantic resource, and it could be enlightening to look at a differently structured resource both from the point of view of theoretical–methodological considerations and from the point of view of practical text processing requirements. The resource described here—SALDO—is such a lexical–semantic resource, intended primarily for use in language technology applications, and offering an alternative organization to PWN-style wordnets. We present our work on SALDO, compare it with PWN, and discuss some implications of the differences. We also describe an integrated infrastructure for computational lexical resources where SALDO forms the central component.
The aim of this article is to describe a database of diphone positional frequencies in French. More specifically, we provide frequencies for word-initial, word-internal, and word-final diphones of all words extracted from a subtitle corpus of 50 million words that come from movie and TV series dialogue. We also provide intra- and intersyllable diphone frequencies, as well as interword diphone frequencies. To our knowledge, no other such tool is available to psycholinguists for the study of French sequential probabilities. This database and its new indicators should help researchers conducting new studies on speech segmentation.
Information about the affective meanings of words is used by researchers working on emotions and moods, word recognition and memory, and text-based sentiment analysis. Three components of emotions are traditionally distinguished: valence (the pleasantness of a stimulus), arousal (the intensity of emotion provoked by a stimulus), and dominance (the degree of control exerted by a stimulus). Thus far, nearly all research has been based on the ANEW norms collected by Bradley and Lang (1999) for 1,034 words. We extended that database to nearly 14,000 English lemmas, providing researchers with a much richer source of information, including gender, age, and educational differences in emotion norms. As an example of the new possibilities, we included stimuli from nearly all of the category norms (e.g., types of diseases, occupations, and taboo words) collected by Van Overschelde, Rawson, and Dunlosky (Journal of Memory and Language 50:289-335, 2004), making it possible to include affect in studies of semantic memory.
The MoveOn speech and noise database was purposely designed and implemented in support of research on spoken dialogue interaction in a motorcycle environment. The distinctiveness of the MoveOn database results from the requirements of the application domain—an information support and operational command and control system for the two-wheel police force—and also from the specifics of the adverse open-air acoustic environment. In this article, we first outline the target application, motivating the database design and purpose, and then report on the implementation details. The main challenges related to the choice of equipment, the organization of recording sessions, and some difficulties that were experienced during this effort, are discussed. We offer a detailed account of the database statistics, the suggested data splits in subsets, and discuss results from automatic speech recognition experiments which illustrate the degree of complexity of the operational environment.
Availability of databases is a necessity in the speech processing field. The publically available databases in Arabic language are few. In this paper we describe a rich database for Arabic language. The database is rich in many dimensions: in text, environments, microphone type, number of recording sessions, recording system, the transmission channel, the country of origin, and the mother language. This richness makes the database an important resource for research in Arabic Language processing and very useful in many speech processing tasks, such as speaker recognition, speech recognition, and accent identification. The speakers were speaking in Modern Standard Arabic (MSA).
The Quranic Arabic Corpus (http://corpus.quran.com) is a collaboratively constructed linguistic resource initiated at the University of Leeds, with multiple layers of annotation including part-of-speech tagging, morphological segmentation (Dukes and Habash 2010) and syntactic analysis using dependency grammar (Dukes and Buckwalter 2010). The motivation behind this work is to produce a resource that enables further analysis of the Quran, the 1,400 year-old central religious text of Islam. This project contrasts with other Arabic treebanks by providing a deep linguistic model based on the historical traditional grammar known as i′rāb (إعراب). By adapting this well-known canon of Quranic grammar into a familiar tagset, it is possible to encourage online annotation by Arabic linguists and Quranic experts. This article presents a new approach to linguistic annotation of an Arabic corpus: online supervised collaboration using a multi-stage approach. The different stages include automatic rule-based tagging, initial manual verification, and online supervised collaborative proofreading. A popular website attracting thousands of visitors per day, the Quranic Arabic Corpus has approximately 100 unpaid volunteer annotators each suggesting corrections to existing linguistic tagging. To ensure a high-quality resource, a small number of expert annotators are promoted to a supervisory role, allowing them to review or veto suggestions made by other collaborators. The Quran also benefits from a large body of existing historical grammatical analysis, which may be leveraged during this review. In this paper we evaluate and report on the effectiveness of the chosen annotation methodology. We also discuss the unique challenges of annotating Quranic Arabic online and describe the custom linguistic software used to aid collaborative annotation.
In recent years, building reference speech corpora was an important part of the activities which provided the necessary linguistic infrastructure in many European countries, for languages with many speakers (e.g., French, German, Spanish, Italian) as well as for those with smaller numbers of speakers (e.g., Swedish, Dutch, Czech, Slovak). This paper describes the process of the creation of a reference speech corpus and its distribution to potential users, as it was done in the case of the Slovene corpus GOS. The corpus structure and fieldwork experiences with recording, labelling system, and two levels of transcription (pronunciation-based and standardized) are described, as well as the main characteristics of the corpus interface (web concordancer) and the availability of the original corpus files.
To cite this version: Pollet Samvelian, Pegah Faghiri. Introducing PersPred, a syntactic and semantic database for Persian Complex Predicates. Abstract This paper introduces PersPred, the first manually elaborated syntactic and semantic database for Persian Complex Predicates (CPs). Beside their theoretical interest, Per-sian CPs constitute an important challenge in Persian lexicography and for NLP. The first delivery, PersPred 1 1 , contains 700 CPs, for which 22 fields of lexical, syntactic and semantic information are encoded. The semantic classification PersPred provides allows to account for the productivity of these combinations in a way which does justice to their compositionality without overlooking their id-iomaticity.
This paper describes the conversion of ItalwordNet and of a domain WordNet into RDF and their linking to the (L)LOD cloud and to other existing resources. A brief presentation of the resources is given, and the conversion and resulting datasets are described.
Normative data on the objective age of acquisition (AoA) for 286 Russian words are presented in this article. In addition, correlations between the objective AoA and subjective ratings, name agreement, picture name agreement, imageability, familiarity, word frequency, and word length are provided, as are correlations between the objective AoA and two measures of exemplar dominance (exemplar generation frequency and the number of times an exemplar was named first). The correlations between the aforementioned variables are generally consistent with the correlations reported in other normative studies. The objective AoA data are highly correlated with the subjective AoA ratings, whereas the correlations between the objective AoA and other psycholinguistic variables are moderate. The correlations between the objective AoA of Russian words and similar data for other languages are moderately high. The complete word norms may be downloaded from supplementary material.
Semantic ambiguity is typically measured by sum-ming the number of senses or dictionary definitions that a word has. Such measures are somewhat subjective and may not adequately capture the full extent of variation in word meaning, particularly for polysemous words that can be used in many different ways, with subtle shifts in meaning. Here, we describe an alternative, computationally derived measure of ambiguity based on the proposal that the meanings of words vary continuously as a function of their contexts. On this view, words that appear in a wide range of contexts on diverse topics are more variable in meaning than those that appear in a restricted set of similar contexts. To quantify this variation, we performed latent semantic analysis on a large text corpus to estimate the semantic similarities of different linguistic contexts. From these estimates, we calculated the degree to which the different contexts associated with a given word vary in their meanings. We term this quantity a word's semantic diversity (SemD). We suggest that this approach provides an objective way of quantifying the subtle, context-dependent variations in word meaning that are often present in language. We demonstrate that SemD is correlated with other measures of ambiguity and contextual variability, as well as with frequency and imageability. We also show that SemD is a strong predictor of performance in semantic judgments in healthy individuals and in patients with semantic deficits, accounting for unique variance beyond that of other predictors. SemD values for over 30,000 English words are provided as supplementary materials.
Newly measured rating norms provide a database of emotion-related dimensions for 524 French trait words. Measures include valence, approach/avoidance tendencies associated with the trait, possessor- and other-relevance of the trait, and discrete emotions conveyed by the trait (i.e., anger, disgust, fear, happiness, and sadness). The normative data were obtained from 328 participants and were revealed to be stable across samples and gender. These data go beyond a dimensional structure and consider more fine-grained descriptions such as the categorical emotions, as well as the perspective of the evaluator conveyed by the traits. They should thus be particularly useful for researchers interested in emotion or in the emotional dimension of cognition, action, or personality. The database is available as supplementary material.
This article presents norms of valence/pleasantness, activity/arousal, power/dominance, and age of acquisition for 4,300 Dutch words, mainly nouns, adjectives, adverbs, and verbs. The norms are based on ratings with a 7-point Likert scale by independent groups of students from two Belgian (Ghent and Leuven) and two Dutch (Rotterdam and Leiden-Amsterdam) samples. For each variable, we obtained high split-half reliabilities within each sample and high correlations between samples. In addition, the valence ratings of a previous, more limited study (Hermans {\&} De Houwer, Psychologica Belgica, 34:115-139, 1994) correlated highly with those of the present study. Therefore, the new norms are a valuable source of information for affective research in the Dutch language.
Stimulus material for studying object-directed actions is needed in different research contexts, such as action observation, action memory, and imitation. Action items have been generated many times in individual laboratories across the world, but they are used in very few experiments. For future studies in the field, it would be worthwhile to have a larger set of action stimulus material available to a broader research community. Some smaller action databases have already been published, but those often focus on psycholinguistic parameters and static action stimuli. With this article, we introduce an action database with dynamic action stimuli. The database contains action descriptions of 1,754 object-directed actions that have been rated for familiarity in Germany and in China. For 784 of these actions, action video clips are available. With the use of our database, it is possible to identify actions that differ in familiarity between Western and Eastern cultures. This variable may be of interest to some researchers in the field, since it has been shown that familiarity influences action information processing. Action descriptions are listed and categorized in tables that can be downloaded, along with the corresponding video clips, as supplemental material.
In the context of Systemic Functional Linguistics, Appraisal is a theory describing the types of language utilised in communicating emotion and opinion. Robust automatic analyses of Appraisal could contribute in a number of ways to computational sentiment analysis by: distinguishing various types of evaluation, for example affect, ethics or aesthetics; discriminating between an author’s opinions and the opinions of authors referenced by the author and determining the strength of evaluations. This paper reviews the typology described by Appraisal, presents a methodology for annotating Appraisal, and the use of this to annotate a corpus of book reviews. It discusses an inter-annotator agreement study, and considers instances of systematic disagreement that indicate areas in which Appraisal may be refined or clarified. Although the annotation task is difficult, there are many instances where the annotators agree; these are used to create a gold-standard corpus for future experimentation with Appraisal.
We present a new database of lexical decision times for English words and nonwords, for which two groups of British participants each responded to 14,365 monosyllabic and disyllabic words and the same number of nonwords for a total duration of 16 h (divided over multiple sessions). This database, called the British Lexicon Project (BLP), fills an important gap between the Dutch Lexicon Project (DLP; Keuleers, Diependaele, {\&} Brysbaert, Frontiers in Language Sciences. Psychology, 1, 174, 2010) and the English Lexicon Project (ELP; Balota et al., 2007), because it applies the repeated measures design of the DLP to the English language. The high correlation between the BLP and ELP data indicates that a high percentage of variance in lexical decision data sets is systematic variance, rather than noise, and that the results of megastudies are rather robust with respect to the selection and presentation of the stimuli. Because of its design, the BLP makes the same analyses possible as the DLP, offering researchers with a new interesting data set of word-processing times for mixed effects analyses and mathematical modeling. The BLP data are available at http://crr.ugent.be/blp and as Electronic Supplementary Materials.
The focus of this article is on the creation of a collection of sentences manually annotated with respect to their sentence structure. We show that the concept of linear segments—linguistically motivated units, which may be easily detected automatically—serves as a good basis for the identification of clauses in Czech. The segment annotation captures such relationships as subordination, coordination, apposition and parenthesis; based on segmentation charts, individual clauses forming a complex sentence are identified. The annotation of a sentence structure enriches a dependency-based framework with explicit syntactic information on relations among complex units like clauses. We have gathered a collection of 3,444 sentences from the Prague Dependency Treebank, which were annotated with respect to their sentence structure (these sentences comprise 10,746 segments forming 6,341 clauses). The main purpose of the project is to gain a development data—promising results for Czech NLP tools (as a dependency parser or a machine translation system for related languages) that adopt an idea of clause segmentation have been already reported. The collection of sentences with annotated sentence structure provides the possibility of further improvement of such tools.
This study presents the adaptation of the Affective Norms for English Words (ANEW; Bradley {\&} Lang, 1999a) for European Portuguese (EP). The EP adaptation of the ANEW was based on the affective ratings made by 958 college students who were EP native speakers. Subjects assessed about 60 words by considering the affective dimensions of valence, arousal, and dominance, using the Self-Assessment Manikin (SAM) in either a paper-and-pencil or a Web survey procedure. Results of the adaptation of the ANEW for EP are presented. Furthermore, the differences between EP, American (Bradley {\&} Lang, 1999a), and Spanish (Redondo, Fraga, Padr{\'{o}}n, {\&} Comesa{\~{n}}a, Behavior Research Methods, 39, 600-605, 2007) standardizations were explored. Results showed that the ANEW words were understood in a similar way by EP, American, and Spanish subjects, although some sex and cross-cultural differences were observed. The EP adaptation of the ANEW is shown to be a valid and useful tool that will allow researchers to control and/or manipulate the affective properties of stimuli, as well as to develop cross-linguistic studies. The normative values of EP adaptation of the ANEW can be downloaded at http://brm.psychonomic-journals.org/content/supplemental .
There are two different levels of interoperability for language resources: operational interoperability and conceptual interoperability. The former refers to the standardization of the formal aspects of language resources so that different resources can work together. The latter refers to the standardization of the notional representation of the semantic content of the analysis. This article addresses both issues but focuses on the latter through a description of the annotation and analysis of the International Corpus of English, which is a corpus for the study of English as a global language. The project is parameterised by component, regional sub-corpora and a set of pre-defined textual categories. The one-million-word British component has been constructed, grammatically tagged, and syntactically parsed. This article is first of all a description of steps taken to ensure conformity within the project. These include corpus design, part-of-speech tagging, and syntactic parsing. The article will then present a study that examines the use of adverbial clauses across speech and writing, illustrating the imminent necessity for interoperable analysis of linguistic data.
The paper presents the MULTEXT-East language resources, a multilingual dataset for language engineering research, focused on the morphosyntactic level of linguistic description. The MULTEXT-East dataset includes the morphosyntactic specifications, morphosyntactic lexica, and a parallel corpus, the novel “1984” by George Orwell, which is sentence aligned and contains hand-validated morphosyntactic descriptions and lemmas. The resources are uniformly encoded in XML, using the Text Encoding Initiative Guidelines, TEI P5, and cover 16 languages, mainly from Central and Eastern Europe: Bulgarian, Croatian, Czech, English, Estonian, Hungarian, Macedonian, Persian, Polish, Resian, Romanian, Russian, Serbian, Slovak, Slovene, and Ukrainian. This dataset, unique in terms of languages covered and the wealth of encoding, is extensively documented, and freely available for research purposes. The paper overviews the MULTEXT-East resources by type and language and gives some conclusions and directions for further work.
We present a new online psycholinguistic resource for Greek based on analyses of written corpora combined with text processing technologies developed at the Institute for Language & Speech Processing (ILSP), Greece. The “ILSP PsychoLinguistic Resource” (IPLR) is a freely accessible service via a dedicated web page, at http://speech.ilsp.gr/iplr. IPLR provides analyses of user-submitted letter strings (words and nonwords) as well as frequency tables for important units and conditions such as syllables, bigrams, and neighbors, calculated over two word lists based on printed text corpora and their phonetic transcription. Online tools allow retrieval of words matching user-specified orthographic or phonetic patterns. All results and processing code (in the Python programming language) are freely available for noncommercial educational or research use.
Over recent years, there has been a growing interest in the computational treatment of nominalized Noun Phrases due to the rich semantic information they contain. These Noun Phrases can be understood as verbal paraphrases and, just like them, they can also denote argument and thematic-role relations. This paper presents the methodology followed to annotate the argument structure of deverbal nominalizations in the Spanish AnCora-Es corpus. We focus on the automated annotation process that is mostly based on the semantic information specified in a verbal lexicon but also on the syntactic and semantic information annotated in the corpus. The heuristic rules that make use of this information rely on linguistic assumptions that are also evaluated as we evaluate the reliability of the automated process. The automated annotation was manually checked in order to ensure the accuracy of the final resource. We demonstrate its feasibility (77% F-measure) and show that it facilitates corpus annotation, which is always a time-consuming and costly process. The result is the enrichment of the AnCora-Es corpus with the argument structure and thematic roles of deverbal nominalizations. It is the first Spanish corpus with this kind of information that is freely available.
The Rovereto Emotion and Cooperation Corpus (RECC) is a new resource collected to investigate the relationship between cooperation and emotions in an interactive setting. Previous attempts at collecting corpora to study emotions have shown that this data are often quite difficult to classify and analyse, and coding schemes to analyse emotions are often found not to be reliable. We collected a corpus of task-oriented (MapTask-style) dialogues in Italian, in which the segments of emotional interest are identified using psycho-physiological indexes (Heart Rate and Galvanic Skin Conductance) which are highly reliable. We then annotated these segments in accordance with novel multimodal annotation schemes for cooperation (in terms of effort) and facial expressions (an indicator of emotional state). High agreement was obtained among coders on all the features. The RECC corpus is to our knowledge the first resource with psycho-physiological data aligned with verbal and nonverbal behaviour data.
We provide imageability estimates for 3,000 disyllabic words (as supplementary materials that may be downloaded with the article from www.springerlink.com ). Imageability is a widely studied lexical variable believed to influence semantic and memory processes (see, e.g., Paivio, 1971). In addition, imageability influences basic word recognition processes (Plaut, McClelland, Seidenberg, {\&} Patterson, 1996). In fact, neuroimaging studies have suggested that reading high- and low-imageable words elicits distinct neural activation patterns for the two types e.g., Bedny {\&} Thompson-Schill (Brain and Language 98:127-139, 2006; Graves, Binder, Desai, Conant, {\&} Seidenberg NeuroImage 53:638-646, 2010). Despite the usefulness of this variable, imageability estimates have not been available for large sets of words. Furthermore, recent megastudies of word processing e.g., Balota et al. (Behavior Research Methods 39:445-459, 2007) have expanded the number of words that interested researchers can select according to other lexical characteristics (e.g., average naming latencies, lexical decision times, etc.). However, the dearth of imageability estimates (as well as those of other lexical characteristics) limits the items that researchers can include in their experiments. Thus, these imageability estimates for disyllabic words expand the number of words available for investigations of word processing, which should be useful for researchers interested in the influences of imageability both as an input and as an outcome variable.
This paper proposes to advance in the current state-of-the-art of automatic Language Resource (LR) building by taking into consideration three elements: (1) the knowledge available in existing LRs, (2) the vast amount of information available from the collaborative paradigm that has emerged from the Web 2.0 and (3) the use of standards to improve interoperability. We present a case study in which a set of LRs for different languages (WordNet for English and Spanish and Parole-Simple-Clips for Italian) are extended with Named Entities (NE) by exploiting Wikipedia and the aforementioned LRs. The practical result is a multilingual NE lexicon connected to these LRs and to two ontologies: SUMO and SIMPLE. Furthermore, the paper addresses an important problem which affects the Computational Linguistics area in the present, interoperability, by making use of the ISO LMF standard to encode this lexicon. The different steps of the procedure (mapping, disambiguation, extraction, NE identification and postprocessing) are comprehensively explained and evaluated. The resulting resource contains 974,567, 137,583 and 125,806 NEs for English, Spanish and Italian respectively. Finally, in order to check the usefulness of the constructed resource, we apply it into a state-of-the-art Question Answering system and evaluate its impact; the NE lexicon improves the system’s accuracy by 28.1%. Compared to previous approaches to build NE repositories, the current proposal represents a step forward in terms of automation, language independence, amount of NEs acquired and richness of the information represented.
The SUBTLEX-US corpus has been parsed with the CLAWS tagger, so that researchers have information about the possible word classes (parts-of-speech, or PoSs) of the entries. Five new columns have been added to the SUBTLEX-US word frequency list: the dominant (most frequent) PoS for the entry, the frequency of the dominant PoS, the frequency of the dominant PoS relative to the entry's total frequency, all PoSs observed for the entry, and the respective frequencies of these PoSs. Because the current definition of lemma frequency does not seem to provide word recognition researchers with useful information (as illustrated by a comparison of the lemma frequencies and the word form frequencies from the Corpus of Contemporary American English), we have not provided a column with this variable. Instead, we hope that the full list of PoS frequencies will help researchers to collectively determine which combination of frequencies is the most informative.
In this article, we automatically create two large and richly annotated data sets for studying the English dative alternation. With an intrinsic and an extrinsic evaluation, we address the question of whether such data sets that are obtained and enriched automatically are suitable for linguistic research, even if they contain errors. The extrinsic evaluation consists of building logistic regression models with these data sets. We conclude that the automatic approach for detecting instances of the dative alternation still needs human intervention, but that it is indeed possible to annotate the instances with features that are syntactic, semantic and discourse-related in nature. Only the automatic classification of the concreteness of nouns is problematic.
Emotional words are increasingly used in the study of word processing. To elucidate whether the experimental effects obtained with these words are due either to their affective content or to other semantic characteristics, it is necessary to conduct experiments with affectively valenced words obtained from different semantic categories. In the present article, we present affective ratings for 380 Spanish words belonging to three semantic categories: animals, people, and objects. The norms are based on the assessments made by 504 participants, who rated about 47 words either in valence and arousal, by using the Self-Assessment Manikin (Bradley {\&} Lang, Journal of Behavioral Therapy and Experimental Psychiatry, 25, 49-59. 1994), or in concreteness and familiarity. These ratings will help researchers select stimuli for experiments in which both the affective properties of words and their membership to a given semantic category have to be taken into account. The database is available as an online supplement for this article.
Words that are homonyms-that is, for which a single written and spoken form is associated with multiple, unrelated interpretations, such as COMPOUND, which can denote an {\textless} enclosure {\textgreater} or a {\textless} composite {\textgreater} meaning-are an invaluable class of items for studying word and discourse comprehension. When using homonyms as stimuli, it is critical to control for the relative frequencies of each interpretation, because this variable can drastically alter the empirical effects of homonymy. Currently, the standard method for estimating these frequencies is based on the classification of free associates generated for a homonym, but this approach is both assumption-laden and resource-demanding. Here, we outline an alternative norming methodology based on explicit ratings of the relative meaning frequencies of dictionary definitions. To evaluate this method, we collected and analyzed data in a norming study involving 544 English homonyms, using the eDom norming software that we developed for this purpose. Dictionary definitions were generally sufficient to exhaustively cover word meanings, and the methods converged on stable norms with fewer data and less effort on the part of the experimenter. The predictive validity of the norms was demonstrated in analyses of lexical decision data from the English Lexicon Project (Balota et al., Behavior Research Methods, 39, 445-459, 2007), and from Armstrong and Plaut (Proceedings of the 33rd Annual Meeting of the Cognitive Science Society, 2223-2228, 2011). On the basis of these results, our norming method obviates relying on the unsubstantiated assumptions involved in estimating relative meaning frequencies on the basis of classification of free associates. Additional details of the norming procedure, the meaning frequency norms, and the source code, standalone binaries, and user manual for the software are available at http://edom.cnbc.cmu.edu .
The iconicity of a Chinese character, or the degree to which it looks like the concept that it represents, has been suggested as affecting the learning and processing of the character. However, previous studies have not provided good empirical information on the iconicity of specific characters. To fill this gap, 40 U.S. adults with no knowledge of Chinese were given an English word or short phrase together with two Chinese characters and were asked which character matched the meaning of the English word. The right and wrong answers had the same number of strokes, and different wrong answers were used for different participants. We examined all 213 simple-structure Chinese characters that occur in textbooks for elementary school children. The overall percentage of correct responses was 53.6{\%}, slightly but significantly higher than would be expected by chance. Using a false discovery rate procedure, we found that 15 of the 213 characters were guessed at a level higher than chance. The proportion of correct responses to each character, which can be taken as an indicator of its degree of iconicity, should be useful to researchers studying Chinese character reading and writing. The full database, showing the proportion of correct guesses and other psycholinguistic variables for each character, can be downloaded from http://brm.psychonomic-journals.org/content/supplemental .
We present age-of-acquisition (AoA) ratings for 30,121 English content words (nouns, verbs, and adjectives). For data collection, this megastudy used the Web-based crowdsourcing technology offered by the Amazon Mechanical Turk. Our data indicate that the ratings collected in this way are as valid and reliable as those collected in laboratory conditions (the correlation between our ratings and those collected in the lab from U.S. students reached .93 for a subsample of 2,500 monosyllabic words). We also show that our AoA ratings explain a substantial percentage of the variance in the lexical-decision data of the English Lexicon Project, over and above the effects of log frequency, word length, and similarity to other words. This is true not only for the lemmas used in our rating study, but also for their inflected forms. We further discuss the relationships of AoA with other predictors of word recognition and illustrate the utility of AoA ratings for research on vocabulary growth.
To understand how and when object knowledge influences the neural underpinnings of language compre- hension and linguistic behavior, it is critical to determine the specific kinds of knowledge that people have. To extend the normative data currently available, we report a relatively more comprehensive set of object attribute rating norms for 559 concrete object nouns, each rated on seven attributes corresponding to sensory and motor modalities—color, mo- tion, sound, smell, taste, graspability, and pain—in addition to familiarity (376 raters, M 0 23 raters per item). The mean ratings were subjected to principal-components analysis, revealing two primary dimensions plausibly interpreted as relating to survival. We demonstrate the utility of these ratings in accounting for lexical and semantic decision latencies. These ratings should prove useful for the design and interpretation of experimental tests of conceptual and perceptual object processing.
Past research has demonstrated cross-linguistic, cross-modal, and task-dependent differences in neighborhood density effects, indicating a need to control for neighborhood variables when developing and interpreting research on language processing. The goals of the present paper are two-fold: (1) to introduce CLEARPOND (Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities), a centralized database of phonological and orthographic neighborhood information, both within and between languages, for five commonly-studied languages: Dutch, English, French, German, and Spanish; and (2) to show how CLEARPOND can be used to compare general properties of phonological and orthographic neighborhoods across languages. CLEARPOND allows researchers to input a word or list of words and obtain phonological and orthographic neighbors, neighborhood densities, mean neighborhood frequencies, word lengths by number of phonemes and graphemes, and spoken-word frequencies. Neighbors can be defined by substitution, deletion, and/or addition, and the database can be queried separately along each metric or summed across all three. Neighborhood values can be obtained both within and across languages, and outputs can optionally be restricted to neighbors of higher frequency. To enable researchers to more quickly and easily develop stimuli, CLEARPOND can also be searched by features, generating lists of words that meet precise criteria, such as a specific range of neighborhood sizes, lexical frequencies, and/or word lengths. CLEARPOND is freely-available to researchers and the public as a searchable, online database and for download at http://clearpond.northwestern.edu.
In psychology, lexical norms related to the semantic properties of words, such as concreteness and valence, are important research resources. Collecting such norms by asking judges to rate the words is very time consuming, which strongly limits the number of words that compose them. In the present article, we present a technique for estimating lexical norms based on the latent semantic analysis of a corpus. The analyses conducted emphasize the technique's effectiveness for several semantic dimensions. In addition to the extension of norms, this technique can be used to check human ratings to identify words for which the rating is very different from the corpus-based estimate.
In this article, we present a set of 12 norms that characterize emotional terms in French, English, German, Spanish, Italian, and Finnish. The high correlation between the norm values in the two emotional dimensions of valence and arousal suggests an interlingual homogeneity of emotional representations and allows a significant metanorm-EMONORM-to be established with 6,383 terms characterized in valence and 4,345 terms characterized in arousal. This metanorm is a resource for creating experimental materials in studies on language and emotions. Furthermore, we perform three tests using EMONORM, with the objectives of (1) identifying basic emotions from their valence and arousal values, (2) determining the orientation of texts referring to positive and negative emotions, and (3) evaluating the intensity of emotions expressed in texts. The results are highly similar to those for human judgments. Finally, we present EMOVAL/SEMOTEX, a Web application for static and dynamic valence and arousal emotional analysis of texts using EMONORM ( http://www.semotex.fr ).
The QWERTY keyboard mediates communication for millions of language users. Here, we investigated whether differences in the way words are typed correspond to differences in their meanings. Some words are spelled with more letters on the right side of the keyboard and others with more letters on the left. In three experiments, we tested whether asymmetries in the way people interact with keys on the right and left of the keyboard influence their evaluations of the emotional valence of the words. We found the predicted relationship between emotional valence and QWERTY key position across three languages (English, Spanish, and Dutch). Words with more right-side letters were rated as more positive in valence, on average, than words with more left-side letters: the QWERTY effect. This effect was strongest in new words coined after QWERTY was invented and was also found in pseudowords. Although these data are correlational, the discovery of a similar pattern across languages, which was strongest in neologisms, suggests that the QWERTY keyboard is shaping the meanings of words as people filter language through their fingers. Widespread typing introduces a new mechanism by which semantic changes in language can arise.
In a previous article, we presented a systematic computational study of the extraction of semantic representations from the word-word co-occurrence statistics of large text corpora. The conclusion was that semantic vectors of pointwise mutual information values from very small co-occurrence windows, together with a cosine distance measure, consistently resulted in the best representations across a range of psychologically relevant semantic tasks. This article extends that study by investigating the use of three further factors--namely, the application of stop-lists, word stemming, and dimensionality reduction using singular value decomposition (SVD)--that have been used to provide improved performance elsewhere. It also introduces an additional semantic task and explores the advantages of using a much larger corpus. This leads to the discovery and analysis of improved SVD-based methods for generating semantic representations (that provide new state-of-the-art performance on a standard TOEFL task) and the identification and discussion of problems and misleading results that can arise without a full systematic study.
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World understand the Classic Arabic language because it is the language of the Qur’an. Despite the fact that in the last decade the number of Arabic Internet users (Middle East and North and East of Africa) has increased considerably, systems to analyze Arabic digital resources automatically are not as easily available as they are for English. Therefore, in this work, an attempt is made to build a real time Named Entity Recognition system that can be used in web applications to detect the appearance of specific named entities and events in news written in Arabic. Arabic is a highly inflectional language, thus we will try to minimize the impact of Arabic affixes on the quality of the pattern recognition model applied to identify named entities. These patterns are built up by processing and integrating different gazetteers, from DBPedia (http://dbpedia.org/About, 2009) to GATE (A general architecture for text engineering, 2009) and ANERGazet (http://users.dsic.upv.es/grupos/nle/?file=kop4.php).
Throughout the last decades, numerous picture data sets have been developed, such as the Snodgrass and Vanderwart (1980) set, and have been normalized for variables such as name and familiarity; however, due to cultural and linguistic differences, norms can vary from one country to another. The effect due specifically to culture has already been demonstrated by comparing samples from different countries where the same language is spoken. On the other hand, it is still not clear how differences between languages may affect norms. The present study explores this issue by collecting and comparing norms on names and many other features from French Canadian speakers and English Canadian speakers living in Montreal, who thus live in similar cultural environments. Norms were collected for the photos of objects from the Bank of Standardized Stimuli (BOSS) by asking participants to name the objects, to categorize them, and to rate their familiarity, visual complexity, object agreement, viewpoint agreement, and manipulability. Names and ratings from the French speakers are available in Appendix A, available in the supplemental materials. The results show that most of the norms are comparable across linguistic groups and also that the ratings given are correlated across linguistic groups. The only significant group differences were found in viewpoint agreement and visual complexity. Overall, there was good concordance between the norms collected from French and English native speakers living in the same cultural setting.
Emotions are inherent to any human activity, including human–computer interactions, and that is the reason why recognizing emotions expressed in natural language is becoming a key feature for the design of more natural user interfaces. In order to obtain useful corpora for this purpose, the manual classification of texts according to their emotional content has been the technique most commonly used by the research community. The use of corpora is widespread in Natural Language Processing, and the existing corpora annotated with emotions support the development, training and evaluation of systems using this type of data. In this paper we present the development of an annotated corpus oriented to the narrative domain, called EmoTales, which uses two different approaches to represent emotional states: emotional categories and emotional dimensions. The corpus consists of a collection of 1,389 English sentences from 18 different folk tales, annotated by 36 different people. Our model of the corpus development process includes a post-processing stage performed after the annotation of the corpus, in which a reference value for each sentence was chosen by taking into account the tags assigned by annotators and some general knowledge about emotions, which is codified in an ontology. The whole process is presented in detail, and revels significant results regarding the corpus such as inter-annotator agreement, while discussing topics such as how human annotators deal with emotional content when performing their work, and presenting some ideas for the application of this corpus that may inspire the research community to develop new ways to annotate corpora using a large set of emotional tags.
Acronyms are an idiosyncratic part of our everyday vocabulary. Research in word processing has used acronyms as a tool to answer fundamental questions such as the nature of the word superiority effect (WSE) or which is the best way to account for word-reading processes. In this study, acronym naming was assessed by looking at the influence that a number of variables known to affect mainstream word processing has had in acronym naming. The nature of the effect of these factors on acronym naming was examined using a multilevel regression analysis. First, 146 acronyms were described in terms of their age of acquisition, bigram and trigram frequencies, imageability, number of orthographic neighbors, frequency, orthographic and phonological length, print-to-pronunciation patterns, and voicing characteristics. Naming times were influenced by lexical and sublexical factors, indicating that acronym naming is a complex process affected by more variables than those previously considered.
Although many recent advances have taken place in corpus-based tools, the techniques used to guide exploration and evaluation of these systems have advanced little. Typically, the plausibility of a semantic space is explored by sampling the nearest neighbors to a target word and evaluating the neighborhood on the basis of the modeler's intuition. Tools for visualization of these large-scale similarity spaces are nearly nonexistent. We present a new open-source tool to plot and visualize semantic spaces, thereby allowing researchers to rapidly explore patterns in visual data that describe the statistical relations between words. Words are visualized as nodes, and word similarities are shown as directed edges of varying strengths. The "Word-2-Word" visualization environment allows for easy manipulation of graph data to test word similarity measures on their own or in comparisons between multiple similarity metrics. The system contains a large library of statistical relationship models, along with an interface to teach them from various language sources. The modularity of the visualization environment allows for quick insertion of new similarity measures so as to compare new corpus-based metrics against the current state of the art. The software is available at www.indiana.edu/{\~{}}semantic/word2word/.
Supervised machine learning methods to model word sense often rely on human labelers to provide a single, ground truth label for each word in its context. We examine issues in establishing ground truth word sense labels using a fine-grained sense inventory from WordNet. Our data consist of a sentence corpus of 1,000 sentences: 100 for each of ten moderately polysemous words. Each word was given multiple sense labels—or a multilabel—from trained and untrained annotators. The multilabels give a nuanced representation of the degree of agreement on instances. A suite of assessment metrics is used to analyze the sets of multilabels, such as comparisons of sense distributions across annotators. Our assessment indicates that the general annotation procedure is reliable, but that words differ regarding how reliably annotators can assign WordNet sense labels, independent of the number of senses. We also investigate the performance of an unsupervised machine learning method to infer ground truth labels from various combinations of labels from the trained and untrained annotators. We find tentative support for the hypothesis that performance depends on the quality of the set of multilabels, independent of the number of labelers or their training.
This work presents a new set of 360 high quality colour images belonging to 23 semantic subcategories. Two hundred and thirty-six Spanish speakers named the items and also provided data from seven relevant psycholinguistic variables: age of acquisition, familiarity, manipulability, name agreement, typicality and visual complexity. Furthermore, we also present lexical frequency data derived from Internet search hits. Apart from the high number of variables evaluated, knowing that it affects the processing of stimuli, this new set presents important advantages over other similar image corpi: (a) this corpus presents a broad number of subcategories and images; for example, this will permit researchers to select stimuli of appropriate difficulty as required, (e.g., to deal with problems derived from ceiling effects); (b) the fact of using coloured stimuli provides a more realistic, ecologically-valid, representation of real life objects. In sum, this set of stimuli provides a useful tool for research on visual object- and word-processing, both in neurological patients and in healthy controls.
To establish a valid database of vocal emotional stimuli in Mandarin Chinese, a set of Chinese pseudosentences (i.e., semantically meaningless sentences that resembled real Chinese) were produced by four native Mandarin speakers to express seven emotional meanings: anger, disgust, fear, sadness, happiness, pleasant surprise, and neutrality. These expressions were identified by a group of native Mandarin listeners in a seven-alternative forced choice task, and items reaching a recognition rate of at least three times chance performance in the seven-choice task were selected as a valid database and then subjected to acoustic analysis. The results demonstrated expected variations in both perceptual and acoustic patterns of the seven vocal emotions in Mandarin. For instance, fear, anger, sadness, and neutrality were associated with relatively high recognition, whereas happiness, disgust, and pleasant surprise were recognized less accurately. Acoustically, anger and pleasant surprise exhibited relatively high mean f0 values and large variation in f0 and amplitude; in contrast, sadness, disgust, fear, and neutrality exhibited relatively low mean f0 values and small amplitude variations, and happiness exhibited a moderate mean f0 value and f0 variation. Emotional expressions varied systematically in speech rate and harmonics-to-noise ratio values as well. This validated database is available to the research community and will contribute to future studies of emotional prosody for a number of purposes. To access the database, please contact pan.liu@mail.mcgill.ca.
Identifying objects in conversation is a fundamental human capability necessary to achieve efficient collaboration on any real world task. Hence the deepening of our understanding of human referential behaviour is indispensable for the creation of systems that collaborate with humans in a meaningful way. We present the construction of REX-J, a multi-modal Japanese corpus of referring expressions in situated dialogs, based on the collaborative task of solving the Tangram puzzle. This corpus contains 24 dialogs with over 4 h of recordings and over 1,400 referring expressions. We outline the characteristics of the collected data and point out the important differences from previous corpora. The corpus records extra-linguistic information during the interaction (e.g. the position of pieces, the actions on the pieces) in synchronization with the participants’ utterances. This in turn allows us to discuss the importance of creating a unified model of linguistic and extra-linguistic information from a new perspective. Demonstrating the potential uses of this corpus, we present the analysis of a specific type of referring expression (“action-mentioning expression”) as well as the results of research into the generation of demonstrative pronouns. Furthermore, we discuss some perspectives on potential uses of this corpus as well as our planned future work, underlining how it is a valuable addition to the existing databases in the community for the study and modeling of referring expressions in situated dialog.
The present study introduces the first substantial German database with norms for semantic typicality, age of acquisition, and concept familiarity for 824 exemplars of 11 semantic categories, including four natural (ANIMALS, BIRDS, FRUITS,: and VEGETABLES: ) and five man-made (CLOTHING, FURNITURE, VEHICLES, TOOLS: , and MUSICAL INSTRUMENTS: ) categories, as well as PROFESSIONS: and SPORTS: . Each category exemplar in the database was collected empirically in an exemplar generation study. For each category exemplar, norms for semantic typicality, estimated age of acquisition, and concept familiarity were gathered in three different rating studies. Reliability data and additional analyses on effects of semantic category and intercorrelations between age of acquisition, semantic typicality, concept familiarity, word length, and word frequency are provided. Overall, the data show high inter- and intrastudy reliabilities, providing a new resource tool for designing experiments with German word materials. The full database is available in the supplementary material of this file and also at www.psychonomic.org/archive .
In this paper, we present first results from annotating abstract (discourse-deictic) anaphora in German. Our annotation guidelines provide linguistic tests for identifying the antecedent, and for determining the semantic types of both the antecedent and the anaphor. The corpus consists of selected speaker turns from the Europarl corpus. To date, 100 texts have been annotated according to these guidelines. The annotations show that anaphoric personal and demonstrative pronouns differ with respect to the distance to their antecedents. A semantic analysis reveals that, contrary to suggestions put forward in the literature, referents of anaphors do not tend to be more abstract than the referents of their antecedents.
Picture naming was investigated primarily to determine its dependence on certain imagery-related variables, with a secondary aim of developing a new set of Japanese norms for 360 pictures. Pictures refined from the original Nishimoto, Miyawaki, Ueda, Une, and Takahashi (Behavior Research Methods 37:398-416, 2005) set were used. Naming behaviors were measured using four imagery-related measures (imageability, vividness, image agreement, and image variability) and four conventional measures (naming time, name agreement, familiarity, and age of acquisition), as well as a number of other measures (17 total). A simultaneous multiple regression analysis performed on naming times showed that the most reliable predictor was H, a measure of name diversity; two image-related measures (image agreement and vividness) and age of acquisition also contributed substantially to the prediction of naming times. The accuracy of picture naming (measured as name agreement) was predicted by vividness, age of acquisition, familiarity, and image agreement. This suggests that certain processes involving mental imagery play a role in picture naming. The full set of norms and pictures may be downloaded from http://www.psychonomic.org/archive/ or along with the article from http://www.springerlink.com .
Although superlatives are commonly used in natural language, so far there has been no large-scale computational investigation of the types of comparisons they express. This article describes a comprehensive annotation scheme for superlatives, which classifies superlatives according to their surface forms and motivates an initial focus on so-called “ISA superlatives”. This type of superlative comparison is especially suitable for a computational approach because both their targets and comparison sets are explicitly realised in the text, and the proposed annotation scheme offers guidelines for annotating the spans of such comparative elements. The annotations are tested and evaluated on 500 tokens of superlatives with good inter-annotator agreement. In addition to providing a platform for investigating superlatives on a larger scale, this research also introduces a new text-based Wikipedia corpus in which all superlative instances have been annotated according to the proposed annotation scheme, and which has been used to develop a tool that can reliably distinguish between different superlative types, and identify the comparative components of ISA superlatives.
This paper presents an overview of a project that aims at creating a representative Catalogue of cross-linguistically recurrent semantic shifts1 in the languages of the world and at implementing this Catalogue in the form of a searchable computer database2. Such a catalogue is useful in several theoretical and methodological respects. First of all, both universal and language-specific semantic shifts can be considered a window onto human cognitive mechanisms operative in the domain of linguistic conceptualization. Second, the catalogue provides a rich empirical basis for the study of genetic and areal tendencies in semantic change, as well as in polysemy patterns. Another potential application is historical reconstruction, since the catalogue gives evidence for attested paths of diachronic semantic evolution. In section 1 we outline the general concept of the Catalogue of Semantic Shifts. Section 2 describes the design of the computer database. In section 3 we discuss some problematic points that we have faced while working on the Catalogue. The next three sections demonstrate how the Catalogue can be used in linguistic research and present an analysis of three selected issues, namely semantic shifts in the domain of dimension (section 4), motivation strategies in the domain of folk biology (section 5), and euphemization as a mechanism of semantic change (section 6).
This paper describes the preparation, recording, analyzing, and evaluation of a new speech corpus for Modern Standard Arabic (MSA). The speech corpus contains a total of 415 sentences recorded by 40 (20 male and 20 female) Arabic native speakers from 11 different Arab countries representing three major regions (Levant, Gulf, and Africa). Three hundred and sixty seven sentences are considered as phonetically rich and balanced, which are used for training Arabic Automatic Speech Recognition (ASR) systems. The rich characteristic is in the sense that it must contain all phonemes of Arabic language, whereas the balanced characteristic is in the sense that it must preserve the phonetic distribution of Arabic language. The remaining 48 sentences are created for testing purposes, which are mostly foreign to the training sentences and there are hardly any similarities in words. In order to evaluate the speech corpus, Arabic ASR systems were developed using the Carnegie Mellon University (CMU) Sphinx 3 tools at both training and testing/decoding levels. The speech engine uses 3-emitting state Hidden Markov Models (HMM) for tri-phone based acoustic models. Based on experimental analysis of about 8 h of training speech data, the acoustic model is best using continuous observation’s probability model of 16 Gaussian mixture distributions and the state distributions were tied to 500 senones. The language model contains uni-grams, bi-grams, and tri-grams. For same speakers with different sentences, Arabic ASR systems obtained average Word Error Rate (WER) of 9.70%. For different speakers with same sentences, Arabic ASR systems obtained average WER of 4.58%, whereas for different speakers with different sentences, Arabic ASR systems obtained average WER of 12.39%.
Research into similarities between music and language processing is currently experiencing a strong renewed interest. Recent methodological advances have led to neuroimaging studies presenting striking similarities between neural patterns associated with the processing of music and language--notably, in the study of participants' responses to elements that are incongruous with their musical or linguistic context. Responding to a call for greater systematicity by leading researchers in the field of music and language psychology, this article describes the creation, selection, and validation of a set of auditory stimuli in which both congruence and resolution were manipulated in equivalent ways across harmony, rhythm, semantics, and syntax. Three conditions were created by changing the contexts preceding and following musical and linguistic incongruities originally used for effect by authors and composers: Stimuli in the incongruous-resolved condition reproduced the original incongruity and resolution into the same context; stimuli in the incongruous-unresolved condition reproduced the incongruity but continued postincongruity with a new context dictated by the incongruity; and stimuli in the congruous condition presented the same element of interest, but the entire context was adapted to match it so that it was no longer incongruous. The manipulations described in this article rendered unrecognizable the original incongruities from which the stimuli were adapted, while maintaining ecological validity. The norming procedure and validation study resulted in a significant increase in perceived oddity from congruous to incongruous-resolved and from incongruous-resolved to incongruous-unresolved in all four components of music and language, making this set of stimuli a theoretically grounded and empirically validated resource for this growing area of research.
The Alcohol Language Corpus (ALC) is the first publicly available speech corpus comprising intoxicated and sober speech of 162 female and male German speakers. Recordings are done in the automotive environment to allow for the development of automatic alcohol detection and to ensure a consistent acoustic environment for the alcoholized and the sober recording. The recorded speech covers a variety of contents and speech styles. Breath and blood alcohol concentration measurements are provided for all speakers. A transcription according to SpeechDat/Verbmobil standards and disfluency tagging as well as an automatic phonetic segmentation are part of the corpus. An Emu version of ALC allows easy access to basic speech parameters as well as the us of R for statistical analysis of selected parts of ALC. ALC is available without restriction for scientific or commercial use at the Bavarian Archive for Speech Signals.
Multilingual text processing is useful because the information content found in different languages is complementary, both regarding facts and opinions. While Information Extraction and other text mining software can, in principle, be developed for many languages, most text analysis tools have only been applied to small sets of languages because the development effort per language is large. Self-training tools obviously alleviate the problem, but even the effort of providing training data and of manually tuning the results is usually considerable. In this paper, we gather insights by various multilingual system developers on how to minimise the effort of developing natural language processing applications for many languages. We also explain the main guidelines underlying our own effort to develop complex text mining software for tens of languages. While these guidelines—most of all: extreme simplicity—can be very restrictive and limiting, we believe to have shown the feasibility of the approach through the development of the Europe Media Monitor (EMM) family of applications (http://emm.newsbrief.eu/overview.html). EMM is a set of complex media monitoring tools that process and analyse up to 100,000 online news articles per day in between twenty and fifty languages. We will also touch upon the kind of language resources that would make it easier for all to develop highly multilingual text mining applications. We will argue that—to achieve this—the most needed resources would be freely available, simple, parallel and uniform multilingual dictionaries, corpora and software tools.
The role of the Web for text corpus construction is becoming increasingly significant. However, the contribution of the Web is largely confined to building a general virtual corpus or low quality specialised corpora. In this paper, we introduce a new technique called SPARTAN for constructing specialised corpora from the Web by systematically analysing website contents. Our evaluations show that the corpora constructed using our technique are independent of the search engines employed. In particular, SPARTAN-derived corpora outperform all corpora based on existing techniques for the task of term recognition.
The present study reports descriptive normative measures for 245 Italian verbal idiomatic expressions. For each of the idiomatic expressions the following variables are reported: Length, Knowledge, Familiarity, Age of Acquisition, Predictability, Syntactic flexibility, Literality and Compositionality. Syntactic flexibility was assessed using five syntactic operations: adverb insertion, adjective insertion, left dislocation, passive and movement. The psycholinguistic relevance of each dimension, their measures and the correlations among them are provided and discussed. The databases are freely available for down-loading from the Psychonomic Society Web archive at www.psychonomic.org/archive/.
We constructed a corpus of digitized texts containing about 4{\%} of all books ever printed. Analysis of this corpus enables us to investigate cultural trends quantitatively. We survey the vast terrain of 'culturomics,' focusing on linguistic and cultural phenomena that were reflected in the English language between 1800 and 2000. We show how this approach can provide insights about fields as diverse as lexicography, the evolution of grammar, collective memory, the adoption of technology, the pursuit of fame, censorship, and historical epidemiology. Culturomics extends the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.
Individual happiness is a fundamental societal metric. Normally measured through self-report, happiness has often been indirectly characterized and overshadowed by more readily quantifiable economic indicators such as gross domestic product. Here, we examine expressions made on the online, global microblog and social networking service Twitter, uncovering and explaining temporal variations in happiness and information levels over timescales ranging from hours to years. Our data set comprises over 46 billion words contained in nearly 4.6 billion expressions posted over a 33 month span by over 63 million unique users. In measuring happiness, we use a real-time, remote-sensing, non-invasive, text-based approach---a kind of hedonometer. In building our metric, made available with this paper, we conducted a survey to obtain happiness evaluations of over 10,000 individual words, representing a tenfold size improvement over similar existing word sets. Rather than being ad hoc, our word list is chosen solely by frequency of usage and we show how a highly robust metric can be constructed and defended.
We review recent evidence indicating that researchers in experimental psychology may have used suboptimal estimates of word frequency. Word frequency measures should be based on a corpus of at least 20 million words that contains language participants in psychology experiments are likely to have been exposed to. In addition, the quality of word frequency measures should be ascertained by correlating them with behavioral word processing data. When we apply these criteria to the word frequency measures available for the German language, we find that the commonly used Celex frequencies are the least powerful to predict lexical decision times. Better results are obtained with the Leipzig frequencies, the dlexDB frequencies, and the Google Books 2000–2009 frequencies. However, as in other languages the best performance is observed with subtitle-based word frequencies. The SUBTLEX-DE word frequencies collected for the present ms are made available in easy-to-use files and are free for educational purposes.
Ratings of realism, masculinity, race, and racial stereotypy were collected on a set of computer-generated faces representing European, South East Asian, and African American ethnicities. To determine if these faces are processed in the same way as photographs of real faces, we demonstrated with these faces superior memory performance for upright faces over inverted faces (the face inversion effect). Further, in observers of European decent, we found both superior memory for European faces and a larger inversion effect for European than African American faces. Based on these results, we believe that this set of faces may be of use in perceptual investigations in which race is a critical manipulation.
We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of speech per language is relatively small compared to major corpora in world languages, and we report on our investigation of the stability of the ASR models derived from the corpus. We also report on phoneme distance measures across languages, and describe initial phone recognisers that were developed using this data. We find that a surprisingly small number of speakers (fewer than 50) and around 10 to 20 h of speech per language are sufficient for the purposes of acceptable phone-based recognition.
A normative study was conducted using the Deese/Roediger-McDermott paradigm (DRM) to obtain false recognition for 60 six-word lists in Spanish, designed with a completely new methodology. For the first time, lists included words (e.g., bridal, newlyweds, bond, commitment, couple, to marry) simultaneously associated with three critical words (e.g., love, wedding, marriage). Backward associative strength between lists and critical words was taken into account when creating the lists. The results showed that all lists produced false recognition. Moreover, some lists had a high false recognition rate (e.g., 65{\%}; jail, inmate, prison: bars, prisoner, cell, offender, penitentiary, imprisonment). This is an aspect of special interest for those DRM experiments that, for example, record brain electrical activity. This type of list will enable researchers to raise the signal-to-noise ratio in false recognition event-related potential studies as they increase the number of critical trials per list, and it will be especially useful for the design of future research.
Age of acquisition (AoA) estimates are provided for 3,460 senses of 1,208 words (i.e., words with multiple meanings e.g., duck). The AoA rating estimates appear to be relatively consistent across participants. The Spearman-Brown split-half reliability coefficient is .95, while the correlations between each participant's ratings and the overall mean ratings yielded correlation coefficients between .325 to .794 with a mean of .69 (SD = .10). These estimates will be of use to those interested in: (a) the influence of AoA on word processing, (b) the influence of AoA on meaning access, (c) the structure of semantic memory, and (d) developmental trends in lexical ambiguity resolution. These AoA estimates can be downloaded from the Psychonomic Society's Web archive of norms, stimuli, and data at www.psychonomic.org/archive.
The lexical database dlexDB supplies in form of an online database frequency-based norms of numerous process-related word properties for psychological and linguistic research. These values include well known variables such as printed frequency of word form and lemma as documented also in CELEX (Baayen, Piepenbrock und Gulikers, 1995). In addition, we compute new values like frequencies based on syllables, and morphemes as well as frequencies of character chains, and multiple word combinations. The statistics are based on the Kernkorpus des Digitalen Wrterbuchs der deutschen Sprache (DWDS) with over 100 million running words. We illustrate the validity of these norms with new results about fixation durations in sentence reading.
This study examines the relationship between the linguistic characteristics of body paragraphs of student essays and the total number of paragraphs in the essays. Results indicate a significant relationship between the total number of paragraphs and a variety of linguistic characteristics known to affect student essay scores. These linguistic characteristics (e.g., semantic overlap, syntactic complexity) contribute to two underlying factors (i.e., textual cohesion and difficulty) that are used as dependent variables in mixed-effect models. Results suggest that student essays with 5-8 paragraphs tend to be more linguistically consistent than student essays with 3, 4, and 9 paragraphs. Essays with totals of 5-8 paragraphs, considered by many educators to contain an optimal number of paragraphs, may include functionally and structurally similar paragraphs. These findings could aid writing researchers and educators in obtaining a clearer view of the relationship between the total number of paragraphs comprising an essay and the linguistic characteristics that affect essay evaluation. Consequently, writing interventions may become better equipped to pinpoint student difficulties and facilitate student writing skills by providing more detailed and informed feedback.
Our knowledge about affective processes, especially concerning effects on cognitive demands like word processing, is increasing steadily. Several studies consistently document valence and arousal effects, and although there is some debate on possible interactions and different notions of valence, broad agreement on a two dimensional model of affective space has been achieved. Alternative models like the discrete emotion theory have received little interest in word recognition research so far. Using backward elimination and multiple regression analyses, we show that five discrete emotions (i.e., happiness, disgust, fear, anger and sadness) explain as much variance as two published dimensional models assuming continuous or categorical valence, with the variables happiness, disgust and fear significantly contributing to this account. Moreover, these effects even persist in an experiment with discrete emotion conditions when the stimuli are controlled for emotional valence and arousal levels. We interpret this result as evidence for discrete emotion effects in visual word recognition that cannot be explained by the two dimensional affective space account.
It is generally assumed that orthographic-phonological (O-P) consistencies are higher for Japanese kana words than for kanji words and that orthographic-semantic (O-S) consistencies are higher for kanji words than for kana words. In order to examine the validity of these assumptions, we attempted to measure the O-P and O-S consistencies for 339 kana words and 775 kanji words. Orthographic neighbors were first generated for each of these words. In order to measure the O-P consistencies of the words, their neighbors were then classified as phonological friends or enemies, based on whether the characters shared with the original word were pronounced the same in the two words. In order to measure the O-S consistencies, the similarity in meaning of each of the neighbors to the original word was rated on a 7-point scale. Based on the ratings, the neighbors were classified as semantic friends or enemies. The results indicated that both the O-P consistencies for kanji words and the O-S consistencies for kana words were greater than previously assumed and that the two scripts were actually quite similar on both types of consistency measures. The implications for the nature of the reading processes for kana and kanji words are discussed.
Frequency of occurrence is an important attribute of lexical units, and one that is widely used in psychological research and theorization. Although printed frequency norms have long been available for Spanish, and subtitle-based norms have more recently been published, oral frequency norms have not been systematically compiled for a representative set of words. In this study, a corpus of over three million units, representing present-day use of the language in Spain, was used to derive a frequency count of spoken words. The corpus consisted of 913 separate documents that contained transcriptions of oral recordings obtained in a wide variety of situations, mostly radio and television programs. The resulting database, containing absolute and relative frequency values for 67,979 orally produced words, is presented. Validity analyses showed significant correlations of oral frequency with other frequency measures and suggest that oral frequency can predict some types of lexical processing with the same or higher levels of precision, when contrasted with text- or subtitle-based frequencies. In conclusion, we discuss ways in which these oral frequency norms can be put to use. The norms can be downloaded from www.springerlink.com.
The Berlin Affective Word List (BAWL, V{\~{o}}, Jacobs, {\&} Conrad, Behavior Research Methods, 35, 606-609, 2006) and the BAWL-R (V{\~{o}} et al. in Behavior Research Methods 38, 606-609, 2009) are two commonly used lists to investigate affective properties of German words. The two-dimensional valence and arousal model of affect underlying the BAWL is traditionally contrasted with models describing affect in discrete emotional categories, which, however, are not currently incorporated in the BAWL. In order to allow future studies to investigate affective processing from both perspectives--or to directly compare them--in the present study, we collected data by assigning nouns taken from the BAWL-R to discrete emotion intensities, which in turn allowed the assignment to discrete emotion categories. In the study, we present Discrete Emotion Norms for Nouns-Berlin Affective Word List (DENN-BAWL). Using these ratings and the psycholinguistic indexes from the BAWL-R, the DENN-BAWL allows researchers to design experiments using highly controlled and reliable word material. Data have been archived at www.fu-berlin.de/allgpsy/DENN-BAWL.
According to recent embodied cognition theories, mental concepts are represented by modality-specific sensory-motor systems. Much of the evidence for modality-specificity in conceptual processing comes from the property-verification task. When applying this and other tasks, it is important to select items based on their modality-exclusivity. We collected modality ratings for a set of 387 properties, each of which was paired with two different concepts, yielding a total of 774 concept-property items. For each item, participants rated the degree to which the property could be experienced through five perceptual modalities (vision, audition, touch, smell, and taste). Based on these ratings, we computed a measure of modality exclusivity, the degree to which a property is perceived exclusively through one sensory modality. In this paper, we briefly sketch the theoretical background of conceptual knowledge, discuss the use of the property-verification task in cognitive research, provide our norms and statistics, and validate the norms in a memory experiment. We conclude that our norms are important for researchers studying modality-specific effects in conceptual processing.
Affective stimuli are increasingly used in emotion research. Typically, stimuli are selected from databases providing affective norms. The validity of these norms is a critical factor with regard to the applicability of the stimuli for emotion research. We therefore probed the validity of the Leipzig Affective Norms for German (LANG) by correlating valence and arousal ratings across different sensory modalities. A sample of 120 words was selected from the LANG database, and auditory recordings of these words were obtained from two professional actors. The auditory stimuli were then rated again for valence and arousal. This cross-modal validation approach yielded very high correlations between auditory and visual ratings ({\textgreater}.95). These data confirm the strong validity of the Leipzig Affective Norms for German and encourage their use in emotion research.
Structural metadata extraction (MDE) research aims to develop techniques for automatic conversion of raw speech recognition output to forms that are more useful to humans and downstream automatic processes. The MDE annotation includes inserting boundaries of sentence-like units to the flow of speech, labeling non-content words like filled pauses and discourse markers for optional removal, and identifying sections of disfluent speech. This paper describes design, creation, and analysis of data resources for structural MDE from spoken Czech. The annotation is based on the LDC’s MDE annotation standard for English, with changes applied to accommodate specific phenomena of Czech. In addition to the necessary language-dependent modifications, we further proposed and applied several language-independent modifications slightly refining the original annotation scheme. We created two Czech MDE speech corpora—one in the domain of broadcast news and the other in the domain of broadcast conversations. Both corpora have already been published at LDC. The analysis section of this paper presents a variety of statistics about fillers, edit disfluencies, and sentence-like units. The two Czech corpora are not only compared with each other, but also with statistics relating to the available English MDE corpora. We also report the statistics indicating that edit disfluencies have a different part of speech (POS) distribution in comparison with the overall POS distribution. The findings from the corpus analysis should help guide strategies for developing automatic MDE systems.
Recent studies have shown that word frequency estimates obtained from films and television subtitles are better to predict performance in word recognition experiments than the traditional word frequency estimates based on books and newspapers. In this study, we present a subtitle-based word frequency list for Spanish, one of the most widely spoken languages. The subtitle frequencies are based on a corpus of 41M words taken from contemporary movies and TV series (screened between 1990 and 2009). In addition, the frequencies have been validated by correlating them with the RTs from two megastudies involving 2,764 words each (lexical decision and word naming tasks). The subtitle frequencies explained 6{\%} more of the variance than the existing written frequencies in lexical decision, and 2{\%} extra in word naming.
The lexical database dlexDB supplies in form of an online database frequency-based norms of numerous processrelated word properties for psychological and linguistic research. These values include well known variables such as printed frequency of word form and lemma as documented also in CELEX (Baayen, Piepenbrock und Gulikers, 1995). In addition, we compute new values like frequencies based on syllables, and morphemes as well as frequencies of character chains, and multiple word combinations. The statistics are based on the Kernkorpus des Digitalen Wörterbuchs der deutschen Sprache (DWDS) with over 100 million running words. We illustrate the validity of these norms with new results about fixation durations in sentence reading. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
Plagiarism is widely acknowledged to be a significant and increasing problem for higher education institutions (McCabe 2005; Judge 2008). A wide range of solutions, including several commercial systems, have been proposed to assist the educator in the task of identifying plagiarised work, or even to detect them automatically. Direct comparison of these systems is made difficult by the problems in obtaining genuine examples of plagiarised student work. We describe our initial experiences with constructing a corpus consisting of answers to short questions in which plagiarism has been simulated. This corpus is designed to represent types of plagiarism that are not included in existing corpora and will be a useful addition to the set of resources available for the evaluation of plagiarism detection systems.
We present a set of 150 pictures with morphologically complex English compound names. The pictures were collected from various sources and were standardized to appear as grayscale line drawings of a fixed size. All the compounds had two constituents and were primarily of the noun-noun type. Following previous studies, we collected name agreement (percentage and H), familiarity, image agreement, and visual complexity norms, as well as frequency estimates for the whole compound word and its first and second constituents. These pictures and their corresponding norms (available from the Psychonomic Society's supplemental archive) are a valuable tool in the study of the morphological representation of complex words in language processing.
Six previous studies of the variables affecting anagram solution are re-examined for the evidence that number of syllables contributes to solution difficulty. It was shown that the number of syllables in a solution word was confounded with imagery for one study and with diagram frequency for another. More importantly it was shown that the number of syllables has a large effect on anagram solution difficulty in the re-analysis of the results from the other four studies. In these studies, the number of syllables was either more important than the principal variable examined in the experiment or the second most important variable. Overall the effect size for the number of syllables was large, d = 1.14. The results are discussed in the light of other research and it is suggested that anagram solution may have more in common with other word identification and reading processes than has been previously thought.
Sentiment analysis of microblogs such as Twitter has recently gained a fair amount of attention. One of the simplest sentiment analysis approaches compares the words of a posting against a labeled word list, where each word has been scored for valence, -- a 'sentiment lexicon' or 'affective word lists'. There exist several affective word lists, e.g., ANEW (Affective Norms for English Words) developed before the advent of microblogging and sentiment analysis. I wanted to examine how well ANEW and other word lists performs for the detection of sentiment strength in microblog posts in comparison with a new word list specifically constructed for microblogs. I used manually labeled postings from Twitter scored for sentiment. Using a simple word matching I show that the new word list may perform better than ANEW, though not as good as the more elaborate approach found in SentiStrength.
The renewed focus of attention on investigating spontaneous speech samples in speech and language research has increased the need for recordings of speech in interactive settings. The DiapixUK task is a new and extended set of picture materials based on the Diapix task by Van Engen et al. (Language and Speech, 53, 510-540, 2010), where two people are recorded while conversing to solve a 'spot the difference' task. The new task materials allow for multiple recordings of the same speaker pairs due to a larger set of picture pairs that have a number of tested features: equal difficulty across all 12 picture pairs, no learning effect of completing more than one picture task and balanced contributions from both speakers. The new materials also provide extra flexibility, making them useful in a wide range of research projects; they are multi-layered electronic images that can be adapted to suit different research needs. This article presents details of the development of the DiapixUK materials, along with data taken from a large corpus of spontaneous speech that are used to demonstrate its new features. Current and potential applications of the task are also discussed.
In emotional research, efficient designs often rely on successful emotion induction. For visual stimulation, the only reliable database available so far is the International Affective Picture System (IAPS). However, extensive use of these stimuli lowers the impact of the images by increasing the knowledge that participants have of them. Moreover, the limited number of pictures for specific themes in the IAPS database is a concern for studies centered on a specific emotion thematic and for designs requiring a lot of trials from the same kind (e.g., EEG recordings). Thus, in the present article, we present a new database of 730 pictures, the Geneva Affective PicturE Database, which was created to increase the availability of visual emotion stimuli. Four specific negative contents were chosen: spiders, snakes, and scenes that induce emotions related to the violation of moral and legal norms (human rights violation or animal mistreatment). Positive and neutral pictures were also included: Positive pictures represent mainly human and animal babies as well as nature sceneries, whereas neutral pictures mainly depict inanimate objects. The pictures were rated according to valence, arousal, and the congruence of the represented scene with internal (moral) and external (legal) norms. The constitution of the database and the results of the picture ratings are presented.
This study provides implicit verb causality norms for a corpus of 305 English verbs. A web-based sentence completion study was conducted, with 96 respondents completing fragments such as "John liked Mary because..." The resulting bias scores are provided as supplementary material in the Psychonomic Society Archive, where we also present lexical and semantic verb features, such as the frequency, semantic class and emotional valence. Our results replicate those of previous studies with much smaller numbers of verbs and respondents. Novel effects of gender and its interaction with verb valence illustrate the type of issues that can be investigated using stable norms for a large number of verbs. The corpus will facilitate future studies in a range of areas, including psycholinguistics and social psychology.
We provide imageability estimates for 3,000 disyllabic words (as supplementary materials that may be downloaded with the article from www.springerlink.com ). Imageability is a widely studied lexical variable believed to influence semantic and memory processes (see, e.g., Paivio, 1971). In addition, imageability influences basic word recognition processes (Plaut, McClelland, Seidenberg, {\&} Patterson, 1996). In fact, neuroimaging studies have suggested that reading high- and low-imageable words elicits distinct neural activation patterns for the two types e.g., Bedny {\&} Thompson-Schill (Brain and Language 98:127-139, 2006; Graves, Binder, Desai, Conant, {\&} Seidenberg NeuroImage 53:638-646, 2010). Despite the usefulness of this variable, imageability estimates have not been available for large sets of words. Furthermore, recent megastudies of word processing e.g., Balota et al. (Behavior Research Methods 39:445-459, 2007) have expanded the number of words that interested researchers can select according to other lexical characteristics (e.g., average naming latencies, lexical decision times, etc.). However, the dearth of imageability estimates (as well as those of other lexical characteristics) limits the items that researchers can include in their experiments. Thus, these imageability estimates for disyllabic words expand the number of words available for investigations of word processing, which should be useful for researchers interested in the influences of imageability both as an input and as an outcome variable.
In this paper, we describe tools and resources for the study of African languages developed at the Collaborative Research Centre 632 “Information Structure”. These include deeply annotated data collections of 25 sub-Saharan languages that are described together with their annotation scheme, as well as the corpus tool ANNIS, which provides unified access to a broad variety of annotations created with a range of different tools. With the application of ANNIS to several African data collections, we illustrate its suitability for the purpose of language documentation, distributed access, and the creation of data archives.
We collected imageability and body-object interaction (BOI) ratings for 599 multisyllabic nouns. We then examined the effects of these variables on a subset of these items in picture-naming, word-naming, lexical decision, and semantic categorization. Picture-naming latencies were taken from the International Picture-Naming Project database (Szekely, Jacobsen, D'Amico, Devescovi, Andonova, Herron, et al. Journal of Memory and Language, 51, 247-250, 2004), word-naming and lexical decision latencies were taken from the English Lexicon Project database (Balota, Yap, Cortese, Hutchison, Kessler, Loftis, et al. Behavior Research Methods, 39, 445-459, 2007), and we collected semantic categorization latencies. Results from hierarchical multiple regression analyses showed that imageability and BOI separately accounted for unique latency variability in each task, even with several other predictor variables (e.g., print frequency, number of syllables and morphemes, age of acquisition) entered first in the analyses. These ratings should be useful to researchers interested in manipulating or controlling for the effects of imageability and BOI for multisyllabic stimuli in lexical and semantic tasks.
The aim of the present study was to provide Russian normative data for the Snodgrass and Vanderwart (Behavior Research Methods, Instruments, {\&} Computers, 28, 516-536, 1980) colorized pictures (Rossion {\&} Pourtois, Perception, 33, 217-236, 2004). The pictures were standardized on name agreement, image agreement, conceptual familiarity, imageability, and age of acquisition. Objective word frequency and objective visual complexity measures are also provided for the most common names associated with the pictures. Comparative analyses between our results and the norms obtained in other, similar studies are reported. The Russian norms may be downloaded from the Psychonomic Society supplemental archive.
This paper serves as an initial announcement of the avail- ability of a corpus of articulatory data called mngu0. This cor- pus will ultimately consist of a collection of multiple sources of articulatory data acquired from a single speaker: electro- magnetic articulography (EMA), audio, video, volumetric MRI scans, and 3D scans of dental impressions. This data will be provided free for research use. In this first stage of the release, we are making available one subset of EMA data, consisting of more than 1,300 phonetically diverse utterances recorded with a Carstens AG500 electromagnetic articulograph. Distribution of mngu0 will be managed by a dedicated “forum-style” web site. This paper both outlines the general goals motivating the distribution of the data and the creation of the mngu0 web fo- rum, and also provides a description of the EMA data contained in this initial release.
Semantic interpretation of language requires extensive and rich lexical knowledge bases (LKB). The Basque WordNet is a LKB based on WordNet and its multilingual counterparts EuroWordNet and the Multilingual Central Repository. This paper reviews the theoretical and practical aspects of the Basque WordNet lexical knowledge base, as well as the steps and methodology followed in its construction. Our methodology is based on the joint development of wordnets and annotated corpora. The Basque WordNet contains 32,456 synsets and 26,565 lemmas, and is complemented by a hand-tagged corpus comprising 59,968 annotations.
Verb Phrase Ellipsis (VPE) has been studied in great depth in theoretical linguistics, but empirical studies of VPE are rare. We extend the few previous corpus studies with an annotated corpus of VPE in all 25 sections of the Wall Street Journal corpus (WSJ) distributed with the Penn Treebank. We annotated the raw files using a stand-off annotation scheme that codes the auxiliary verb triggering the elided verb phrase, the start and end of the antecedent, the syntactic type of antecedent (VP, TV, NP, PP or AP), and the type of syntactic pattern between the source and target clauses of the VPE and its antecedent. We found 487 instances of VPE (including predicative ellipsis, antecedent-contained deletion, comparative constructions, and pseudo-gapping) plus 67 cases of related phenomena such as do so anaphora. Inter-annotator agreement was high, with a 0.97 average F-score for three annotators for one section of the WSJ. Our annotation is theory neutral, and has better coverage than earlier efforts that relied on automatic methods, e.g. simply searching the parsed version of the Penn Treebank for empty VP’s achieves a high precision (0.95) but low recall (0.58) when compared with our manual annotation. The distribution of VPE source–target patterns deviates highly from the standard examples found in the theoretical linguistics literature on VPE, once more underlining the value of corpus studies. The resulting corpus will be useful for studying VPE phenomena as well as for evaluating natural language processing systems equipped with ellipsis resolution algorithms, and we propose evaluation measures for VPE detection and VPE antecedent selection. The stand-off annotation is freely available for research purposes.
This paper addresses some of the issues learned during the course of building a written language resource, called ‘Peykare’, for the contemporary Persian. After defining five linguistic varieties and 24 different registers based on these linguistic varieties, we collected texts for Peykare to do a linguistic analysis, including cross-register differences. For tokenization of Persian, we propose a descriptive generalization to normalize orthographic variations existing in texts. To annotate Peykare, we use EAGLES guidelines which result to have a hierarchy in the part-of-speech tags. To this aim, we apply a semi-automatic approach for the annotation methodology. In the paper, we also give a special attention to the Ezafe construction and homographs which are important in Persian text analyses.
We present a set of language resources and tools—a morphological parser, a morphological disambiguator, and a text corpus—for exploiting Turkish morphology in natural language processing applications. The morphological parser is a state-of-the-art finite-state transducer-based implementation of Turkish morphology. The disambiguator is based on the averaged perceptron algorithm and has the best accuracy reported for Turkish in the literature. The text corpus has been compiled from the web and contains about 500 million tokens. This is the largest Turkish web corpus published.
The blog phenomenon is universal. Blogs are characterized by their evaluative use, in that they enable Internet users to express their opinion on a given subject. From this point of view, they are an ideal resource for the constitution of an annotated sentiment analysis corpus, crossing the subject and the opinion expressed on this subject. This paper presents the Blogoscopy corpus for the French language which was built up with personal thematic blogs. The annotation was governed by three principles: theoretical, as opinion is grounded in a linguistic theory of evaluation, practical, as every opinion is linked to an object, and methodological as annotation rules and successive phases are defined to ensure quality and thoroughness.
The aim of the present study was to expand the scope of category norm and typicality data to include verbs for use when investigating semantic memory in fields such as linguistics, psychology, and aphasiology. Two experiments were conducted. In the first, participants were asked to list verbs within 10 semantic categories (e.g. breaking, cleaning, cooking, etc.) and 10 noun categories (e.g. animals, fruit, tools, etc.). In the second experiment, participants were asked to rate the typicality of verbs within 8 of the previously investigated verb categories. Although participants listed fewer verbs in verb categories than nouns in noun categories, the overall patterns with regard to correlation analyses between production frequency, mean rank of responses, lexical frequency, and typicality were consistent with those observed in noun categories. These patterns are also consistent with those observed in previous research. Potential similarities and differences between nouns and verbs, as well as future applications of such data, are discussed.
Although word co-occurrences within a document have been demonstrated to be semantically useful, word interactions over a local range have been largely neglected by psychologists due to practical challenges. Shannon's (Bell Systems Technical Journal, 27, 379–423, 623–665, 1948) conceptualization of information theory suggests that these interactions should be useful for understanding communica- tion. Computational advances make an examination of local word–word interactions possible for a large text corpus. We used Brants and Franz's (2006) dataset to generate conditional probabilities for 62,474 word pairs and entropy calculations for 9,917 words in Nelson, McEvoy, and Schreiber's (Behavior Research Methods, Instruments, {\&} Computers, 36, 402–407, 2004) free association norms. Semantic asso- ciativity correlated moderately with the probabilities and was stronger when the two words were not adjacent. The number of semantic associates for a word and the entropy of a word were also correlated. Finally, language entropy decreases from 11 bits for single words to 6 bits per word for four-word sequences. The probabilities and entropies discussed here are included in the supplemental materials for the article.
Recent research on anagram solution has produced two original findings. First, it has shown that a new bigram frequency measure called top rank, which is based on a comparison of summed bigram frequencies, is an important predictor of anagram difficulty. Second, it has suggested that the measures from a type count are better than token measures at predicting anagram difficulty. Testing these hypotheses has been difficult because the computation of the bigram statistics is difficult. We present a program that calculates bigram measures for two-to nine-letter words. We then show how the program can be used to compare the contribution of top rank and other bigram frequency measures derived from both a token and a type count. Contrary to previous research, we report that type measures are not better at predicting anagram solution times and that top rank is not the best predictor of anagram difficulty. Lastly we use this program to show that type bigram frequencies are not as good as token bigram frequencies at predicting word identification reaction time.
The Europe Media Monitor (EMM) family of applications is a set of multilingual tools that gather, cluster and classify news in currently fifty languages and that extract named entities and quotations (reported speech) from twenty languages. In this paper, we describe the recent effort of adding the African Bantu language Swahili to EMM. EMM is designed in an entirely modular way, allowing plugging in a new language by providing the language-specific resources for that language. We thus describe the type of language-specific resources needed, the effort involved, and ways of boot-strapping the generation of these resources in order to keep the effort of adding a new language to a minimum. The text analysis applications pursued in our efforts include clustering, classification, recognition and disambiguation of named entities (persons, organisations and locations), recognition and normalisation of date expressions, as well as the identification of reported speech quotations by and about people.
Sensory experience rating (SER), a new variable motivated by the grounded cognition framework of conceptual processing (e.g., Barsalou, 2008 ), indexes the degree to which a word evokes sensory/perceptual experiences. In the present study, SERs were collected for over 2,850 words. While SER is correlated with imageability, age of acquisition, and word frequency, the latter variables (along with seven others) account for less than 30{\%} of the variance in SER. Reanalyses of two large-scale studies demonstrate that SER significantly predicts lexical decision times when other established predictor variables are statistically controlled. These results suggest that conceptual processing is grounded in sensory systems. Additionally, a major benefit of this variable is that it allows psycholinguistic researchers to examine semantic-perceptual links for all word classes with a single rating.
Research in machine translation and corpus annotation has greatly benefited from the increasing availability of word-aligned parallel corpora. This paper presents ongoing research on the development and application of the sawa corpus, a two-million-word parallel corpus English—Swahili. We describe the data collection phase and zero in on the difficulties of finding appropriate and easily accessible data for this language pair. In the data annotation phase, the corpus was semi-automatically sentence and word-aligned and morphosyntactic information was added to both the English and Swahili portion of the corpus. The annotated parallel corpus allows us to investigate two possible uses. We describe experiments with the projection of part-of-speech tagging annotation from English onto Swahili, as well as the development of a basic statistical machine translation system for this language pair, using the parallel corpus and a consolidated database of existing English—Swahili translation dictionaries. We particularly focus on the difficulties of translating English into the morphologically more complex Bantu language of Swahili.
The psychological community frequently investigates semantic norms of properties produced by native speakers after being presented concept words, and these norms are of great value for a wide variety of psychological experiments. This paper presents a new set of norms that includes a collection of properties from a production experiment for the German and the Italian languages. Stimuli consisted of 50 concrete objects taken from 10 different concept classes. The data comprise annotations of semantic relation types and several statistical measures, which facilitate the comparison of the two target languages.
In Experiment 1, separate samples rated nouns on danger, using either an online survey or the same survey in person. In Experiment 2, a single sample rated words on familiarity, using both methods. Women's in-person and online ratings correlated significantly better than men's. In-person ratings correlated significantly better with existing norms in 4 of 8 instances. There were significant effects of condition on mean ratings and completion times. Ratings from participants who withdrew from the experiment correlated significantly less well with existing norms than did ratings from those who completed the whole experiment, in 12 of 16 instances. Analysis of existing data showed that a different statistical conclusion is reached depending on whether in-person or online ratings are used. Furthermore, the categorization of 17.9{\%} (Experiment 1) and 5.3{\%} (Experiment 2) of the items as high or low depends on which ratings are used. Ratings gathered in person and online cannot be freely substituted.
Various areas of research (e.g., memory, metamemory, visual word recognition, associative priming) rely on the careful construction of reliable word lists. ListChecker Pro 1.2 is a computer program that accesses the University of South Florida word association norms (Nelson, McEvoy, {\&} Schreiber, 1998, 2004) to report characteristics of words (e.g., frequency, concreteness), as well as direct and indirect associative relationships (e.g., shared associates, mediators). The present article presents the input requirements, menu options, and output obtained by ListChecker Pro 1.2. In addition, a randomly selected list of words from the associative versus semantic priming literature was submitted to ListChecker Pro 1.2 to demonstrate how seemingly unrelated words can be associated. The zipped file containing the program and database can be downloaded from www.eakinmemorylab.psychology.msstate.edu.
This study addresses the need in discourse psychology for computational techniques that analyze text on multiple levels of cohesion and text difficulty. Discourse psychologists often investigate phenomena related to discourse processing using lengthy texts containing multiple paragraphs, as opposed to single word and sentence stimuli. Characterizing such texts in terms of cohesion and coherence is challenging. Some computational tools are available, but they are either fragmented over different databases or they assess single, specific features of text. Coh-Metrix is a computational linguistic tool that measures text cohesion and text difficulty on a range of word, sentence, paragraph, and discourse dimensions. This study investigated the validity of Coh-Metrix as a measure of cohesion in text using stimuli from published discourse psychology studies as a benchmark. Results showed that Coh-Metrix indexes of cohesion (individually and combined) significantly distinguished the high- versus low-cohesion versions of these texts. The results also showed that commonly used readability indexes (e.g., Flesch-Kincaid) inappropriately distinguished between low- and high-cohesion texts. These results provide a validation of Coh-Metrix, thereby paving the way for its use by researchers in cognitive science, discourse processes, and education, as well as for textbook writers, professionals in instructional design, and instructors. 2010 Copyright Taylor and Francis Group, LLC.
In the present study, we presented picture-naming latencies along with ratings for a set of important characteristics of pictures and picture names: age of acquisition, frequency, picture-name agreement, name agreement, visual complexity, familiarity, and word length. The validity of these data was established by calculating correlations with previous studies. Regression analyses show that our ratings account for a larger amount of variance in RTs than do previous data. RTs were predicted by all variables except complexity and length. A complete database presenting details about all of these variables is available in the supplemental materials, downloadable from http://brm.psychonomic-journals.org/content/supplemental.
Hyperspace analog to language (HAL) is a high-dimensional model of semantic space that uses the global co-occurrence frequency of words in a large corpus of text as the basis for a representation of semantic memory. In the original HAL model, many parameters were set without any a priori rationale. We have created and publicly released a computer application, the High Dimensional Explorer (HiDEx), that makes it possible to systematically alter the values of these parameters to examine their effect on the co-occurrence matrix that instantiates the model. We took an empirical approach to understanding the influence of the parameters on the measures produced by the models, looking at how well matrices derived with different parameters could predict human reaction times in lexical decision and semantic decision tasks. New parameter sets give us measures of semantic density that improve the model's ability to predict behavioral measures. Implications for such models are discussed.
Despite the ubiquity and importance of metaphor in thought and communication, its neural mediation remains elusive. We suggest that this uncertainty reflects, in part, stimuli that have not been designed with recent conceptual frameworks in mind or that have been hampered by inadvertent differences between metaphoric and literal conditions. In this article, we begin addressing these shortcomings by developing a large, flexible, extensively normed, and theoretically motivated set of metaphoric and literal sentences. On the basis of the results of three norming studies, we provide 280 pairs of closely matched metaphoric and literal sentences that are characterized along 10 dimensions: length, frequency, concreteness, familiarity, naturalness, imageability, figurativeness, interpretability, valence, and valence judgment reaction time. In addition to allowing for control of these potentially confounding lexical and sentential factors, these stimuli are designed to address questions about the role of novelty, metaphor type, and sensory-motor grounding in determining the neural basis of metaphor comprehension. Supplemental data for this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This study presents a set of sentence contexts and their cloze probabilities for European Portuguese children and adolescents. Seventy-three sentence contexts (35 low- and 38 high-constraint sentence stems) were presented to 90 children and 102 adolescents. Participants were asked to complete the sentence contexts with the first word that came to mind. For each sentence context, responses were listed and cloze probabilities of the words that were chosen to complete the sentence context were computed. Additionally, idiosyncratic and invalid responses (structural and semantic errors) were analyzed. A high degree of consistency in responses among the two age samples (children and adolescents) was found, along with a decrease of idiosyncratic and invalid responses in older participants. These results shed light on age-related changes in the effects of linguistic context on word production, and also in knowledge's representation. The full set of norms may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This paper describes the development of the Wildcat Corpus of native- and foreign-accented English,a corpus containing scripted and spontaneous speech recordings from 24 native speakers of American English and 52 non-native speakers of English.The core element of this corpus is a set of spontaneous speech recordings, for which a new method of eliciting dialogue-based, laboratory-quality speech recordings was developed (the Diapix task). Dialogues between two native speakers of English, between two non-native speakers of English (with either shared or different LIs), and between one native and one non-native speaker of English are included and analyzed in terms of general measures of communicative efficiency.The overall finding was that pairs of native talkers were most efficient, followed by mixed native/non-native pairs and non-native pairs with shared LI. Non-native pairs with different LIs were least efficient.These results support the hypothesis that successful speech communication depends both on the alignment of talkers to the target language and on the alignment of talkers to one another in terms of native language background.
This article describes the design and implementation of a Dutch Electronic Lexicon of Multiword Expressions (DuELME). DuELME describes the core properties of over 5,000 Dutch multiword expressions. This article gives an overview of the decisions made in order to come to a standard lexical representation and discusses the description fields this representation comprises. We discuss the approach taken, which is innovative since it is based on the Equivalence Class Method (ECM). It is shown that introducing parameters to the ECM optimizes the method. The selection of the lexical entries and their properties is corpus-based. We describe the extraction of candidate expressions from corpora and discuss the selection criteria of the lexical entries. Moreover, we present the results of an evaluation of the standard representation in Alpino, a Dutch dependency parser.
There is increasing interest in the role that manipulability plays in processing objects. To date, Magni{\'{e}}, Besson, Poncet, and Dolisi's (2003) manipulability ratings, based on the degree to which objects can be uniquely pantomimed, have been the reference point for many studies. However, these ratings do not fully capture some relevant dimensions of manipulability, including whether an object is graspable and the extent to which functional motor associations above and beyond graspability are present. To address this, we collected ratings of these dimensions, in addition to ratings of familiarity and age of acquisition (AoA), for a set of 320 black-and-white photographs of objects. Familiarity and AoA ratings were highly correlated with previously reported ratings of the same dimensions (r = .853, p {\textless} .001, and r = .771, p {\textless} .001, respectively), validating the present norms. Grasping and functional use ratings, in contrast, were more moderately correlated with Magni{\'{e}} et al.'s pantomime manipulability ratings (r = .507, p {\textless} .001). These results were taken as evidence that the new manipulability ratings collected in this research capture distinct aspects of object manipulability. The complete stimuli and norms from this study may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This article presents affective ratings for 210 British English and Finnish nouns, including taboo words. The norms were collected with 135 native British English and 304 native Finnish speakers, who rated the words according to their emotional valence, emotional charge, offensiveness, concreteness, and familiarity. The ratings between the two languages were found to be strongly correlated. The present ratings were also strongly correlated with the American English emotional valence and arousal ratings available in the Affective Norms for English Words database (Bradley {\&} Lang, 1999) and the Janschewitz (2008) database for taboo words. These ratings will help researchers to select stimulus materials for a wide range of experiments involving both monolingual and bilingual processing of British English and Finnish emotional words. Materials associated with this article may be accessed as an online supplement from http://brm.psychonomic-journals.org/content/supplemental.
A set of semantically neutral sentences and derived pseudosentences was produced by two native European Portuguese speakers varying emotional prosody in order to portray anger, disgust, fear, happiness, sadness, surprise, and neutrality. Accuracy rates and reaction times in a forced-choice identification of these emotions as well as intensity judgments were collected from 80 participants, and a database was constructed with the utterances reaching satisfactory accuracy (190 sentences and 178 pseudosentences). High accuracy (mean correct of 75{\%} for sentences and 71{\%} for pseudosentences), rapid recognition, and high-intensity judgments were obtained for all the portrayed emotional qualities. Sentences and pseudosentences elicited similar accuracy and intensity rates, but participants responded to pseudosentences faster than they did to sentences. This database is a useful tool for research on emotional prosody, including cross-language studies and studies involving Portuguese-speaking participants, and it may be useful for clinical purposes in the assessment of brain-damaged patients. The database is available for download from http://brm.psychonomic-journals.org/content/supplemental.
The Corpus of Contemporary American English is the first large, genre-balanced corpus of any language, which has been designed and constructed from the ground up as a ‘monitor corpus', and which can be used to accurately track and study recent changes in the language. The 400 million words corpus is evenly divided between spoken, fiction, popular magazines, newspapers, and academic journals. Most importantly, the genre balance stays almost exactly the same from year to year, which allows it to accurately model changes in the ‘real world'. After discussing the corpus design, we provide a number of concrete examples of how the corpus can be used to look at recent changes in English, including morphology (new suffixes –friendly and –gate), syntax (including prescriptive rules, quotative like, so not ADJ, the get passive, resultatives, and verb complementation), semantics (such as changes in meaning with web, green, or gay), and lexis––including word and phrase frequency by year, and using the corpus architecture to produce lists of all words that have had large shifts in frequency between specific historical periods.
Previous evidence has shown that word frequencies calculated from corpora based on film and television subtitles can readily account for reading performance, since the language used in subtitles greatly approximates everyday language. The present study examines this issue in a society with increased exposure to subtitle reading. We compiled SUBTLEX-GR, a subtitled-based corpus consisting of more than 27 million Modern Greek words, and tested to what extent subtitle-based frequency estimates and those taken from a written corpus of Modern Greek account for the lexical decision performance of young Greek adults who are exposed to subtitle reading on a daily basis. Results showed that SUBTLEX-GR frequency estimates effectively accounted for participants' reading performance in two different visual word recognition experiments. More importantly, different analyses showed that frequencies estimated from a subtitle corpus explained the obtained results significantly better than traditional frequencies derived from written corpora.
Ratings were collected from 102 native speakers of Spanish on the subjective frequency of occurrence of 330 Spanish words, including 120 deverbal compounds and their constituents. These ratings were found to be highly reliable, whether items were analyzed together or separately by type (i.e., compounds, nouns, verbs), as evidenced by indexes of internal consistency and test-retest reliability that were equal to or greater than.98. The validity of the normative ratings was attested to by statistically significant correlations with objective frequency, estimated at.63 for all items together, and.41,.51, and.78 for compounds, nouns, and verbs, respectively. Among the substantive issues addressed was the potential dependency in ratings for compounds and their associated verb-noun constituents. No relationship was discerned, supporting the idea that compound and constituent ratings are statistically independent in this experimental task. The theoretical and methodological implications of the findings are discussed. The ratings can be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This technical report describes the implementation and use of ChildFreq, a tool for assessing lexical norms of children from one to seven years old. As the name implies, ChildFreq works by extracting word frequencies from a large corpus of child language. These can then be ordered by age or mean length of utterance, and it is also possible to split the data by the children's gender. A query of words to count the frequency of produces both a line chart and a table with more detailed information. The child language data is taken from the English part of the CHILDES database 1 and comprises more than 5,000 transcriptions ,a total of ≈ 3, 500, 000 word tokens. The children's ages range from six months to seven years, with most children being three years old. ChildFreq is freely available online at http://childfreq.sumsar.net .
The article reports on the importance and impact of the electronic corpora in linguistics. It mentions that since the electronic corpus of English, problem exists for corpus compilers because of the different needs of linguists. It suggests that the problem lies on the database of the text descriptors that would guide researchers through the existing repositories to enable researchers to create their own corpora.
In this paper we introduce the first version of noWaC, a large web-based corpus of Bokm{\aa}l Norwegian currently containing about 700 million tokens. The corpus has been built by crawling, downloading and processing web documents in the .no top-level internet domain. The procedure used to collect the noWaC corpus is largely based on the techniques described by Ferraresi et al. (2008). In brief, first a set of "seed" URLs containing documents in the target language is collected by sending queries to commercial search engines (Google and Yahoo). The obtained seeds (overall 6900 URLs) are then used to start a crawling job using the Heritrix web-crawler limited to the .no domain. The downloaded documents are then processed in various ways in order to build a linguistic corpus (e.g. filtering by document size, language identification, duplicate and near duplicate detection, etc.).
Memory researchers using paired associates have benefited greatly from the Swahili-English norms reported by Nelson and Dunlosky (1994). Given recent increases in the amount and kinds of research using paired associates, however, researchers would now benefit from an expanded set of normative measures for foreign language vocabulary words. We report data for 120 Lithuanian-English word pairs collected from 236 undergraduates. Participants completed three study-test trials and were asked to make metacognitive judgments for each item. We report normative recall performance, recall latencies, and error types for each item across trials, as well as the perceived difficulty of each item on the basis of metacognitive judgments.
We present a new database of Dutch word frequencies based on film and television subtitles, and we validate it with a lexical decision study involving 14,000 monosyllabic and disyllabic Dutch words. The new SUBTLEX frequencies explain up to 10{\%} more variance in accuracies and reaction times (RTs) of the lexical decision task than the existing CELEX word frequency norms, which are based largely on edited texts. As is the case for English, an accessibility measure based on contextual diversity explains more of the variance in accuracy and RT than does the raw frequency of occurrence counts. The database is freely available for research purposes and may be downloaded from the authors' university site at http://crr.ugent.be/subtlex-nl or from http://brm.psychonomic-journals.org/content/supplemental.
Age of acquisition (AoA) is an important psycholinguistic variable that affects the speed and accuracy of lexical processing in tasks such as word naming, picture naming, and lexical decision. In the present work, we collected AoA ratings for 1,749 Portuguese words (nouns, verbs, adjectives, and adverbs), using a 9-point scale that was first proposed by Carroll and White (1973). We analyzed the relation between AoA ratings and other psycholinguistic variables (length measures, neighborhood density, written-word frequency, familiarity, imageability, and concreteness), and we assessed reliability by correlating our ratings with those from other databases presented for Portuguese, English, Spanish, and Italian. The full database can be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
The present article introduces SYLLABARIUM, a new Web tool addressing the needs of linguists, psycholinguists, and cognitive scientists who work with Spanish and/or Basque and are interested in retrieving information about several syllable-related parameters. This new online syllabic database allows the user to generate complete lists of Spanish and Basque syllables with information about the syllable frequency. Among other measures, for a given orthographic syllable, SYLLABARIUM provides its number of occurrences (i.e., the type frequency), the summed lexical frequency of the words that contain this syllable (i.e., the token frequency), and the positional distribution of type and token frequencies. The cross-language feature of SYLLABARIUM is of special interest to researchers aiming to explore the influence of the syllable in bilingualism. The Web tool is available at www.bcbl.eu/syllabarium.
Faces are widely used as stimuli in various research fields. Interest in emotion-related differences and age-associated changes in the processing of faces is growing. With the aim of systematically varying both expression and age of the face, we created FACES, a database comprising N = 171 naturalistic faces of young, middle-aged, and older women and men. Each face is represented with two sets of six facial expressions (neutrality, sadness, disgust, fear, anger, and happiness), resulting in 2,052 individual images. A total of N = 154 young, middle-aged, and older women and men rated the faces in terms of facial expression and perceived age. With its large age range of faces displaying different expressions, FACES is well suited for investigating developmental and other research questions on emotion, motivation, and cognition, as well as their interactions. Information on using FACES for research purposes can be found at http://faces.mpib-berlin.mpg.de.
The main objective of this study is to investigate the abstract-concrete dichotomy by introducing a new variable: the mode of acquisition (MoA) of a concept. MoA refers to the way in which concepts are acquired: through experience, through language, or through both. We asked 250 participants to rate 417 words on seven dimensions: age of acquisition, concreteness, familiarity, context availability, imageability, abstractness, and MoA. The data were analyzed by considering MoA ratings and their relationship with the other psycholinguistic variables. Distributions for concreteness, abstractness, and MoA ratings indicate that they are qualitatively different. A partial correlation analysis revealed that MoA is an independent predictor of concreteness or abstractness, and a hierarchical multiple regression analysis confirmed MoA as being a valid predictor of abstractness. Strong correlations with measures for the English translation equivalents in the MRC database confirmed the reliability of our norms. The full database of MoA ratings and other psycholinguistic variables may be downloaded from http://brm.psychonomic-journals.org/content/supplemental or www.abstract-project.eu.
We present the MATCH corpus, a unique data set of 447 dialogues in which 26 older and 24 younger adults interact with nine different spoken dialogue systems. The systems varied in the number of options presented and the confirmation strategy used. The corpus also contains information about the users’ cognitive abilities and detailed usability assessments of each dialogue system. The corpus, which was collected using a Wizard-of-Oz methodology, has been fully transcribed and annotated with dialogue acts and “Information State Update” (ISU) representations of dialogue context. Dialogue act and ISU annotations were performed semi-automatically. In addition to describing the corpus collection and annotation, we present a quantitative analysis of the interaction behaviour of older and younger users and discuss further applications of the corpus. We expect that the corpus will provide a key resource for modelling older people’s interaction with spoken dialogue systems.
There are currently stimuli with published norms available to study several psychological aspects of language and visual cognitions. Norms represent valuable information that can be used as experimental variables or systematically controlled to limit their potential influence on another experimental manipulation. The present work proposes 480 photo stimuli that have been normalized for name, category, familiarity, visual complexity, object agreement, viewpoint agreement, and manipulability. Stimuli are also available in grayscale, blurred, scrambled, and line-drawn version. This set of objects, the Bank Of Standardized Stimuli (BOSS), was created specifically to meet the needs of scientists in cognition, vision and psycholinguistics who work with photo stimuli.
We describe annotation of multiword expressions (MWEs) in the Prague dependency treebank, using several automatic pre-annotation steps. We use subtrees of the tectogrammatical tree structures of the Prague dependency treebank to store representations of the MWEs in the dictionary and pre-annotate following occurrences automatically. We also show a way to measure reliability of this type of annotation.
The study of the cognitive processes in the production of language demands careful selection of stimuli and requires normative databases. The main goal of the present research was to collect normative data for the set of 400 figures taken from Cycowicz, Friedman, Rothstein, and Snodgrass (1997; including the 260 figures of Snodgrass {\&} Vanderwart, 1980) using a sample of native Argentinean Spanish speakers. The pictures have been standardized on the following variables: name agreement, image agreement, familiarity, visual complexity, image variability, age of acquisition, and word association. The obtained norms were compared with the normative data of other studies in Spanish, English, and French. This comparison highlights the variability of some of the measures (e.g., name agreement in naming and verbal association) across the different studies and confirms the necessity of elaborating specific norms that are adapted to the studied population's linguistic and sociocultural context. The norms described may be downloaded as supplemental materials for this article from http://brm.psychonomic-journals.org/content/supplemental.
Verb bias, or the tendency of a verb to appear with a certain type of complement, has been employed in psycholinguistic literature as a tool to test competing models of sentence processing. To date, the vast majority of sentence processing research involving verb bias has been conducted almost exclusively with monolingual speakers, and predominantly with monolingual English speakers, despite the fact that most of the world's population is bilingual. To test the generality of competing theories of sentence comprehension, it is important to conduct cross-linguistic studies of sentence processing and to add bilingual data to theories of sentence comprehension. Given this, it is critical for the field to develop verb bias estimates from monolingual speakers of languages other than English and from bilingual populations. We begin to address these issues in two norming studies. Study 1 provides verb bias norming data for 135 Spanish verbs. A second aim of Study 1 was to determine whether verb bias estimates remain stable over time. In Study 2, we asked whether Spanish-English speakers are able to learn verb-specific information, such as verb bias, in their second language. The answer to this question is critical to conducting studies that examine when, during the course of sentence comprehension, bilingual speakers exploit verb information specific to the second language. To facilitate cross-linguistic work, we compared our verb bias results with those provided by monolingual English speakers in a previous norming study conducted by Garnsey, Lotocky, Pearlmutter, and Myers (1997). Our Spanish data demonstrated that individual verbs showed significant similarities in their verb bias across the 3 years of data collection. We also show that bilinguals are able to learn the biases of verbs in their second language, even when immersed in the first language environment. Appendixes A-C, containing the bilingual norms discussed in the article, may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
An online calculator was developed (www.bncdnet.ku.edu/cml/info{\_}ccc.vi) to compute phonotactic probability--the likelihood of occurrence of a sound sequence--and neighborhood density--the number of phonologically similar words--on the basis of child corpora of American English (Kolson, 1960; Moe, Hopkins, {\&} Rush, 1982) and to compare its results to those of an adult calculator. Phonotactic probability and neighborhood density were computed for a set of 380 nouns (Fenson et al., 1993) using both the child and adult corpora. The child and adult raw values were significantly correlated. However, significant differences were detected. Specifically, child phonotactic probability was higher than adult phonotactic probability, especially for high-probability words, and child neighborhood density was lower than adult neighborhood density, especially for words with high-density neighborhoods. These differences were reduced or eliminated when relative measures (i.e., z scores) were used. Suggestions are offered regarding which values to use in future research.
Word stem completion tasks involve showing participants a number of words and then later asking them to complete word stems to make a full word. If the stem is completed with one of the studied words, it indicates memory. It is a test widely used to assess both implicit and explicit forms of memory. An important aspect of stimulus selection is that target words should not frequently be generated spontaneously from the word stem, to ensure that production of the word really represents memory. In this article, we present a database of spontaneous stem completion rates for 395 stems from a group of 80 British undergraduate psychology students. It includes information on other characteristics of the words (word frequency, concreteness, imageability, age of acquisition, common part of speech, and number of letters) and, as such, can be used to select suitable words to include in a stem completion task. Supplemental materials for this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
Our purpose in the present study is to provide a normative set of nonsensical pictures known as droodles and to demonstrate the role of semantic comprehension in facilitating recall of pictorial stimuli. The set consists of 98 pairs of droodles. Experiment 1 standardized these pictorial stimuli with respect to several variables, such as appropriateness of verbal labels, relationship between two droodles, and correct recall. Appropriateness of verbal labels was rated higher for pictures presented in pairs than for pictures presented singly. Experiment 2 used the standardized set of droodles in a recall experiment similar to those of Bower, Karlin, and Dueck (1975) and others. As we expected, semantic interpretation can strongly facilitate recall. Multiple regression analysis showed that several measures had significant power of explanation for recall performance. The full set of norms and pictures from this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
To facilitate investigations of verbal emotional processing, we introduce$\backslash$nthe Leipzig Affective Norms for German (LANG), a list of 1,000 German$\backslash$nnouns that have been rated for emotional valence, arousal, and concreteness.$\backslash$nA critical factor regarding the quality of normative word data is$\backslash$ntheir reliability. We therefore acquired ratings from a sample that$\backslash$nwas tested twice, with an interval of 2 years, to calculate test-retest$\backslash$nreliability. Furthermore, we recruited a second sample to test reliability$\backslash$nacross independent samples. The results show (1) the typical quadratic$\backslash$nrelation of valence and arousal, replicating previous data, (2) very$\backslash$nhigh test-retest reliability ({\textgreater}.95), and (3) high correlations between$\backslash$nthe two samples ({\textgreater}.85). Because the range of ratings was also very$\backslash$nhigh, we provide a comprehensive set of words with reliable affective$\backslash$nnorms, which makes it possible to select highly controlled subsamples$\backslash$nvarying in emotional status. The database is available as a supplement$\backslash$nfor this article at http://brm.psychonomic-journals.org/content/supplemental.
The development of a set of everyday, nonverbal, digitized sounds for use in auditory confrontation naming applications is described. Normative data are reported for 120 sounds of varying lengths representing a wide variety of acoustic events such as sounds produced by animals, people, musical instruments, tools, signals, and liquids. In Study 1, criteria for scoring naming accuracy were developed and rating data were gathered on degree of confidence in sound identification and the perceived familiarity, complexity, and pleasantness of the sounds. In Study 2, the previously developed criteria for scoring naming accuracy were applied to the naming responses of a new sample of subjects, and oral naming times were measured. In Study 3 data were gathered on how subjects categorized the sounds: In the first categorization task - free classification - subjects generated category descriptions for the sounds; in the second task - constrained classification - a different sample of subjects selected the most appropriate category label for each sound from a list of 27 labels generated in the first task. Tables are provided in which the 120 stimuli are sorted by familiarity, complexity, pleasantness, duration, naming accuracy, speed of identification, and category placement. The. WAV sound files are freely available to researchers and clinicians via a sound archive on the World Wide Web; the URL is http://www.cofc.edu/{\~{}}marcellm/confront.htm.
In recent years, psycholinguistics has seen a remarkable growth of research based on the analysis of data from large-scale studies of word recognition, in particular lexical decision and word naming. We present the data of the Dutch Lexicon Project (DLP) in which a group of 39 participants made lexical decisions to 14,000 words and the same number of nonwords. To examine whether the extensive practice precludes comparison with the traditional short experiments, we look at the differences between the first and the last session, compare the results with the English Lexicon Project (ELP) and the French Lexicon Project (FLP), and examine to what extent established findings in Dutch psycholinguistics can be replicated in virtual experiments. Our results show that when good nonwords are used, practice effects are minimal in lexical decision experiments and do not invalidate the behavioral data. For instance, the word frequency curve is the same in DLP as in ELP and FLP. Also, the Dutch-English cognate effect is the same in DLP as in a previously published factorial experiment. This means that large-scale word recognition studies can make use of psychophysical and psychometrical approaches. In addition, our data represent an important collection of very long series of individual reaction times that may be of interest to researchers in other areas.
In this article, we present a new lexical database for Modern Standard Arabic: Aralex. Based on a contemporary text corpus of 40 million words, Aralex provides information about (1) the token frequencies of roots and word patterns, (2) the type frequency, or family size, of roots and word patterns, and (3) the frequency of bigrams, trigrams in orthographic forms, roots, and word patterns. Aralex will be a useful tool for studying the cognitive processing of Arabic through the selection of stimuli on the basis of precise frequency counts. Researchers can use it as a source of information on natural language processing, and it may serve an educational purpose by providing basic vocabulary lists. Aralex is distributed under a GNU-like license, allowing people to interrogate it freely online or to download it from www.mrc-cbu.cam.ac.uk:8081/aralex.online/login.jsp.
In the present study, we provide a new technique for the collection of homograph norms that reduces subjectivity in the determination of meaning dominance by allowing participants rather than experimenters to indicate to which meaning or meanings the associates were related. To evaluate the effectiveness of this new technique, a subset of homograph norms were used in three separate experiments, demonstrating that (1) when presented with additional meaning categories, participants classified the associates consistently into the primary and secondary meaning categories; (2) overall, the participants were most familiar with primary meanings, followed by secondary, tertiary, and quaternary meanings; and (3) the meaning categories provided to the participants during norms collection were appropriate, since the two meanings provided for each homograph by the participants were consistent with the original data. Finally, in a fourth experiment, we compared the results of this new technique with a parallel set collected in Australia. The high degree of similarity in the results provides validity for this procedure. The homograph norms discussed in this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This article describes the preparation, recording and orthographic transcription of a new speech corpus, the Nijmegen Corpus of Casual French (NCCFr). The corpus contains a total of over 36 h of recordings of 46 French speakers engaged in conversations with friends. Casual speech was elicited during three different parts, which together provided around 90 min of speech from every pair of speakers. While Parts 1 and 2 did not require participants to perform any specific task, in Part 3 participants negotiated a common answer to general questions about society. Comparisons with the ESTER corpus of journalistic speech show that the two corpora contain speech of considerably different registers. A number of indicators of casualness, including swear words, casual words, verlan, disfluencies and word repetitions, are more frequent in the NCCFr than in the ESTER corpus, while the use of double negation, an indicator of formal speech, is less frequent. In general, these estimates of casualness are constant through the three parts of the recording sessions and across speakers. Based on these facts, we conclude that our corpus is a rich resource of highly casual speech, and that it can be effectively exploited by researchers in language science and technology. {\textcopyright} 2009 Elsevier B.V. All rights reserved.
In this paper the transcription and evaluation of the corpus DIMEx100 for Mexican Spanish is presented. First we describe the corpus and explain the linguistic and computational motivation for its design and collection process; then, the phonetic antecedents and the alphabet adopted for the transcription task are presented; the corpus has been transcribed at three different granularity levels, which are also specified in detail. The corpus statistics for each transcription level are also presented. A set of phonetic rules describing phonetic context observed empirically in spontaneous conversation is also validated with the transcription. The corpus has been used for the construction of acoustic models and a phonetic dictionary for the construction of a speech recognition system. Initial performance results suggest that the data can be used to train good quality acoustic models.
Three decades after their publication, Bloom and Fischler's (1980) sentence completion norms continue to demonstrate widespread utility. The aim of the present study was to extend this contribution by expanding the existing database of high-constraint, high cloze probability sentences. Using the criteria established by Bloom and Fischler, we constructed 398 new sentences and presented these along with 100 sentences from their original list to be normed using a sample of 400 participants. Of the 498 sentences presented, 400 met criteria for high cloze probability-that is, .67 or higher probability of being completed by a specific single word. Of these, 321 sentences were from the new set and an additional 79 were from Bloom and Fischler's set. A high degree of correspondence was observed between responses obtained by Bloom and Fischler for their high-constraint set. A second experiment utilized an N400 event-related potential paradigm to provide further validation of the contextual constraint for the newly generated set. As expected, N400 amplitude was greater for sentences that violated contextual expectancy by ending in a word other than the newly established completion norm. Sentence completion norms are frequently used in cognitive research, and this larger database of high cloze probability sentences is expected to be of benefit to the research community for many years to come. The full set of stimuli and sentence completion norms from this study may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
This article describes the enrichment of the AnCora corpora of Spanish and Catalan (400 k each) with coreference links between pronouns (including elliptical subjects and clitics), full noun phrases (including proper nouns), and discourse segments. The coding scheme distinguishes between identity links, predicative relations, and discourse deixis. Inter-annotator agreement on the link types is 85–89% above chance, and we provide an analysis of the sources of disagreement. The resulting corpora make it possible to train and test learning-based algorithms for automatic coreference resolution, as well as to carry out bottom-up linguistic descriptions of coreference relations as they occur in real data.
Parent report has proven a valid and cost-effective means of evaluating early child language. Norming datasets for these instruments, which provide the basis for standardized comparisons of individual children to a population, can also be used to derive norms for the acquisition of individual words in production and comprehension and also early gestures and symbolic actions. These lexical norms have a wide range of uses in basic research, assessment and intervention. In addition, cross-linguistic comparisons of lexical development are greatly facilitated by the availability of norms from diverse languages. This report describes the development of CLEX, a new web-based cross-linguistic database for lexical data from adaptations of the MacArthur-Bates Communicative Development Inventories. CLEX provides tools for a range of analyses within and across languages. It is designed to incorporate additional language datasets easily, and to permit users to define mappings between lexical items in pairs of languages for more specific cross-linguistic comparisons.
The present study provides a set of objective age of acquisition (AoA) norms for 223 Italian words that may be useful for conducting cross-linguistic studies or experiments on Italian language processing. The data were collected by presenting children from the ages of 2 to 11 with a normed picture set (Lotto, Dell'Acqua, {\&} Job, 2001). Following the study of Morrison, Chappell, and Ellis (1997), we report two measures of objective AoA. Both measures strongly correlated with each other, and they also showed a good correlation with the rated AoA provided by adult participants. Furthermore, we assessed the relationship between the AoA measures and other variables used in psycholinguistic experiments. Regression analyses showed that familiarity, typicality, and word frequency were significant predictors of AoA. AoA, but not word frequency, was found to determine naming latencies. Finally, we present a path model in which AoA is a mediator in predicting speed in picture naming. The norms and the picture set can also be downloaded from http://dpss.psy.unipd.it/files/strumenti.php and from http://brm.psychonomic-journals.org/content/supplemental.
This paper describes a multichannel acoustic data collection recorded under the European DICIT project, during Wizard of Oz (WOZ) experiments carried out at FAU and FBK-irst laboratories. The application of interest in DICIT is a distant-talking interface for control of interactive TV working in a typical living room, with many interfering devices. The objective of the experiments was to collect a database supporting efficient development and tuning of acoustic processing algorithms for signal enhancement. In DICIT, techniques for sound source localization, multichannel acoustic echo cancellation, blind source separation, speech activity detection, speaker identification and verification as well as beamforming are combined to achieve a maximum possible reduction of the user speech impairments typical of distant-talking interfaces. The collected database permitted to simulate at preliminary stage a realistic scenario and to tailor the involved algorithms to the observed user behaviors. In order to match the project requirements, the WOZ experiments were recorded in three languages: English, German and Italian. Besides the user inputs, the database also contains non-speech related acoustic events, room impulse response measurements and video data, the latter used to compute three-dimensional positions of each subject. Sessions were manually transcribed and segmented at word level, introducing also specific labels for acoustic events.
Pseudowords play an important role in psycholinguistic experiments, either because they are required for performing tasks, such as lexical decision, or because they are the main focus of interest, such as in nonword-reading and nonce-inflection studies. We present a pseudoword generator that improves on current methods. It allows for the generation of written polysyllabic pseudowords that obey a given language's phonotactic constraints. Given a word or nonword template, the algorithm can quickly generate pseudowords that match the template in subsyllabic structure and transition frequencies without having to search through a list with all possible candidates. Currently, the program is available for Dutch, English, German, French, Spanish, Serbian, and Basque, and, with little effort, it can be expanded to other languages.
This paper describes a recently completed common resource for the study of spoken discourse, the NXT-format Switchboard Corpus. Switchboard is a long-standing corpus of telephone conversations (Godfrey et al. in SWITCHBOARD: Telephone speech corpus for research and development. In Proceedings of ICASSP-92, pp. 517–520, 1992). We have brought together transcriptions with existing annotations for syntax, disfluency, speech acts, animacy, information status, coreference, and prosody; along with substantial new annotations of focus/contrast, more prosody, syllables and phones. The combined corpus uses the format of the NITE XML Toolkit, which allows these annotations to be browsed and searched as a coherent set (Carletta et al. in Lang Resour Eval J 39(4):313–334, 2005). The resulting corpus is a rich resource for the investigation of the linguistic features of dialogue and how they interact. As well as describing the corpus itself, we discuss our approach to overcoming issues involved in such a data integration project, relevant to both users of the corpus and others in the language resource community undertaking similar projects.
Malay, a language spoken by 250 million people, has a shallow alphabetic orthography, simple syllable structures, and transparent affixation--characteristics that contrast sharply with those of English. In the present article, we first compare the letter-phoneme and letter-syllable ratios for a sample of alphabetic orthographies to highlight the importance of separating language-specific from language-universal reading processes. Then, in order to develop a better understanding of word recognition in orthographies with more consistent mappings to phonology than English, we compiled a database of lexical variables (letter length, syllable length, phoneme length, morpheme length, word frequency, orthographic and phonological neighborhood sizes, and orthographic and phonological Levenshtein distances) for 9,592 Malay words. Separate hierarchical regression analyses for Malay and English revealed how the consistency of orthography-phonology mappings selectively modulates the effects of different lexical variables on lexical decision and speeded pronunciation performance. The database of lexical and behavioral measures for Malay is available at http://brm.psychonomic-journals.org/content/supplemental.
In this study, we provide normative data for objects in a set of 80 digital color pictures (e.g., nature scenes, human activities, cartoon characters, magazine covers). In Experiment 1, four objects in each picture were rated by 48 observers on a 6-point Likert scale for their relevance to the overall meaning of the scene. In Experiment 2, Salience Toolbox software (Walther {\&} Koch, 2006) provided additional information about whether the four relevance-rated objects were located in areas that were high or low in visual salience. Brief descriptions of the four objects, their locations in the picture, their categorizations as high or low in salience, the means and standard deviations of their relevance ratings, and statistical analyses specifying which pairs of objects in a picture differed significantly on their relevance to the meaning of the scene are given in the Appendix. An example is provided of how the pictures could be used to create stimuli for a change blindness task in which detection of item onset versus offset is contrasted for low-relevance and high-relevance features. The 80 pictures are accessible as jpg files from the first author's Web site at http://marcellm.people.cofc.edu/research.htm.
The French Lexicon Project involved the collection of lexical decision data for 38,840 French words and the same number of nonwords. It was directly inspired by the English Lexicon Project (Balota et al., 2007) and produced very comparable frequency and word length effects. The present article describes the methods used to collect the data, reports analyses on the word frequency and the word length effects, and describes the Excel files that make the data freely available for research purposes. The word and pseudoword data from this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
A new stimulus set of 60 male-face stimuli in seven in-depth orientations was developed. The set can be used in research on configural versus featural mechanisms of face processing. Configural, or holistic, changes are produced by changing the global form of the face, whereas featural, or part-based, changes are attained by altering the local form of internal facial features. For each face in the set, there is one other face that differs only by its global form and one other face that differs only by its internal features. In all faces, extrafacial cues have been eliminated or standardized. The stimulus set also contains a color-coded division of each face in areas of interest, which is useful for eye movement research on face scanning strategies. We report a matching experiment with upright and inverted face pairs that demonstrates that the face stimulus set is indeed useful for research on configural and featural face perception. The stimulus set may be downloaded from the Psychonomic Society's archive (brm.psychonomic-journals.org/content/supplemental) or from our Web site (http://ppw.kuleuven.be/labexppsy/newSite/resources).
Some phrases can be interpreted in their context either idiomatically (figuratively) or literally. The precise identification of idioms is essential in order to achieve full-fledged natural language processing. Because of this, the authors of this paper have created an idiom corpus for Japanese. This paper reports on the corpus itself and the results of an idiom identification experiment conducted using the corpus. The corpus targeted 146 ambiguous idioms, and consists of 102,856 examples, each of which is annotated with a literal/idiomatic label. All sentences were collected from the World Wide Web. For idiom identification, 90 out of the 146 idioms were targeted and a word sense disambiguation (WSD) method was adopted using both common WSD features and idiom-specific features. The corpus and the experiment are both, as far as can be determined, the largest of their kinds. It was discovered that a standard supervised WSD method works well for idiom identification and it achieved accuracy levels of 89.25 and 88.86%, with and without idiom-specific features, respectively. It was also found that the most effective idiom-specific feature is the one that involves the adjacency of idiom constituents.
We report work on adding semantic role labels to the Chinese Treebank, a corpus already annotated with phrase structures. The work involves locating all verbs and their nominalizations in the corpus, and semi-automatically adding semantic role labels to their arguments, which are constituents in a parse tree. Although the same procedure is followed, different issues arise in the annotation of verbs and nominalized predicates. For verbs, identifying their arguments is generally straightforward given their syntactic structure in the Chinese Treebank as they tend to occupy well-defined syntactic positions. Our discussion focuses on the syntactic variations in the realization of the arguments as well as our approach to annotating dislocated and discontinuous arguments. In comparison, identifying the arguments for nominalized predicates is more challenging and we discuss criteria and procedures for distinguishing arguments from non-arguments. In particular we focus on the role of support verbs as well as the relevance of event/result distinctions in the annotation of the predicate-argument structure of nominalized predicates. We also present our approach to taking advantage of the syntactic structure in the Chinese Treebank to bootstrap the predicate-argument structure annotation of verbs. Finally, we discuss the creation of a lexical database of frame files and its role in guiding predicate-argument annotation. Procedures for ensuring annotation consistency and inter-annotator agreement evaluation results are also presented.
Recent work has shown that people routinely use perceptual information during language comprehension and conceptual processing, from single-word recognition to modality-switching costs in property verification. In investigating such links between perceptual and conceptual representations, the use of modality-specific stimuli plays a central role. To aid researchers working in this area, we provide a set of norms for 423 adjectives, each describing an object property, with mean ratings of how strongly that property is experienced through each of five perceptual modalities (visual, haptic, auditory, olfactory, and gustatory). The data set also contains estimates of modality exclusivity—that is, a measure of the extent to which a particular property may be considered unimodal (i.e., perceived through one sense alone). Although there already exists a number of sets of word and object norms, we provide the first set to categorize words describing object properties along the dimensions of the five perceptual modalities. We hope that the norms will be of use to researchers working at the interface between linguistic, conceptual, and perceptual systems. The modality exclusivity norms may be downloaded as supplemental materials for this article from brm.psychonomic-journals.org/ content/supplemental.
Subjective frequency and imageability estimates for a sample of 3,600 French nouns were collected from two independent groups of 72 young adults each. Both groups received standard instructions and provided their ratings on a 7-point scale. The timing, sequencing, presentation of lexical stimuli, and recording of responses were controlled by a computer. All estimates of internal consistency and test-retest reliability ({\textgreater} or =.98) confirm the high level of precision and reliability of the ratings. Correlations with ratings drawn from similar studies were found to be positive and significant for subjective frequency (r {\textgreater} or = .85) and for imageability (r {\textgreater} or = .69). Subjective frequency was positively and significantly correlated with objective frequency estimates drawn from 10 different sources (r {\textgreater} or = .42). Subjective frequency and imageability were significantly correlated (r = .26), a relationship that was driven primarily by a sudden drop in imageability ratings for words with a subjective frequency rating below 2.5. The methodological implications of these findings are discussed. The ratings can be downloaded as supplemental materials from brm.psychonomic-journals.org/content/supplemental.
The aim of the French Media project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.
The LEXIN database offers psycholinguistic indexes of the 13,184 different words (types) computed from 178,839 occurrences of these words (tokens) contained in a corpus of 134 beginning readers widely used in Spain. This database provides four statistical indicators: F (overall word frequency), D (index of dispersion across selected readers), U (estimated frequency per million words), and SFI (standard frequency index). It also gives information about the number of letters, syntactic category, and syllabic structure of the words included. To facilitate comparisons, LEXIN provides data from LEXESP's (Sebasti{\'{a}}n-Gall{\'{e}}s, Mart{\'{i}}, Cuetos, {\&} Carreiras, 2000), Alameda and Cuetos's (1995), and Mart{\'{i}}nez and Garc{\'{i}}a's (2004) Spanish adult psycholinguistic frequency databases. Access to the LEXIN database is facilitated by a computer program. The LEXIN program allows for the creation of word lists by letting the user specify searching criteria. LEXIN can be useful for researchers in cognitive psychology, particularly in the areas of psycholinguistics and education.
This article introduces ukWaC, deWaC and itWaC, three very large corpora of English, German, and Italian built by web crawling, and describes the methodology and tools used in their construction. The corpora contain more than a billion words each, and are thus among the largest resources for the respective languages. The paper also provides an evaluation of their suitability for linguistic research, focusing on ukWaC and itWaC. A comparison in terms of lexical coverage with existing resources for the languages of interest produces encouraging results. Qualitative evaluation of ukWaC versus the British National Corpus was also conducted, so as to highlight differences in corpus composition (text types and subject matters). The article concludes with practical information about format and availability of corpora and tools.
The appropriate selection of both pictorial and linguistic experimental stimuli requires a previous language-specific standardization process of the materials across different variables. Considering that such normative data have not yet been collected for Modern Greek, in this study normative data for the color version of the Snodgrass and Vanderwart picture set (Rossion {\&} Pourtois, 2004) were collected from 330 native Greek adults. Participants named the pictures (providing name agreement ratings) and rated them for visual complexity and age of acquisition. The obtained measures represent a useful tool for further research on Greek language processing and constitute the first picture normative study for this language. The picture norms from this study and previous ones may be downloaded from brm.psychonomic-journals.org/content/supplemental.
This study aimed to extend the International Affective Picture System (IAPS; Lang, Bradley, {\&} Cuthbert, 2005) norms by obtaining reaction time (RT) normative data for 308 selected photographs. Pictures were presented one at a time for 33, 100, or 250 msec, or under free-time display, to 96 women and 48 men. The participants' task involved assessing the emotional valence of each picture and responding as quickly as possible as to whether it was unpleasant, neutral, or pleasant. RTs provided an index of processing efficiency. The manipulation of display time served to estimate the time course in the valence identification of each picture. Some categories of depicted scenes (e.g., erotica and mutilations) were classified more consistently and efficiently than were others as pleasant or unpleasant. There were minimal differences between men and women. Overall, the present data provide researchers investigating cognition/emotion relationships with an objective criterion to select pictorial stimuli on the basis of RTs. Data for all pictures may be downloaded from brm.psychonomic-journals.org/content/supplemental.
Word frequency is the most important variable in research on word processing and memory. Yet, the main criterion for selecting word frequency norms has been the availability of the measure, rather than its quality. As a result, much research is still based on the old Kucera and Francis frequency norms. By using the lexical decision times of recently published megastudies, we show how bad this measure is and what must be done to improve it. In particular, we investigated the size of the corpus, the language register on which the corpus is based, and the definition of the frequency measure. We observed that corpus size is of practical importance for small sizes (depending on the frequency of the word), but not for sizes above 16-30 million words. As for the language register, we found that frequencies based on television and film subtitles are better than frequencies based on written sources, certainly for the monosyllabic and bisyllabic words used in psycholinguistic research. Finally, we found that lemma frequencies are not superior to word form frequencies in English and that a measure of contextual diversity is better than a measure based on raw frequency of occurrence. Part of the superiority of the latter is due to the words that are frequently used as names. Assembling a new frequency norm on the basis of these considerations turned out to predict word processing times much better than did the existing norms (including Kucera {\&} Francis and Celex). The new SUBTL frequency norms from the SUBTLEX(US) corpus are freely available for research purposes from http://brm.psychonomic-journals.org/content/supplemental, as well as from the University of Ghent and Lexique Web sites.
In this study, we compared four expert graders with latent semantic analysis (LSA) to assess short summaries of an expository text. As is well known, there are technical difficulties for LSA to establish a good semantic representation when analyzing short texts. In order to improve the reliability of LSA relative to human graders, we analyzed three new algorithms by two holistic methods used in previous research (Le{\'{o}}n, Olmos, Escudero, Ca{\~{n}}as, {\&} Salmer{\'{o}}n, 2006). The three new algorithms were (1) the semantic common network algorithm, an adaptation of an algorithm proposed by W. Kintsch (2001, 2002) with respect to LSA as a dynamic model of semantic representation; (2) a best-dimension reduction measure of the latent semantic space, selecting those dimensions that best contribute to improving the LSA assessment of summaries (Hu, Cai, Wiemer-Hastings, Graesser, {\&} McNamara, 2007); and (3) the Euclidean distance measure, used by Rehder et al. (1998), which incorporates at the same time vector length and the cosine measures. A total of 192 Spanish middle-grade students and 6 experts took part in this study. They read an expository text and produced a short summary. Results showed significantly higher reliability of LSA as a computerized assessment tool for expository text when it used a best-dimension algorithm rather than a standard LSA algorithm. The semantic common network algorithm also showed promising results.
This article presents a new database of 2,654 German nouns rated by a sample of 3,907 subjects on three psycholinguistic attributes: concreteness, valence, and arousal. As a new means of data collection in the field of psycholinguistic research, all ratings were obtained via the Internet, using a tailored Web application. Analysis of the obtained word norms showed good agreement with two existing norm sets. A cluster analysis revealed a plausible set of four classes of nouns: abstract concepts, aversive events, pleasant activities, and physical objects. In an additional application example, we demonstrate the usefulness of the database for creating parallel word lists whose elements match as closely as possible. The complete database is available for free from ftp://ftp.uni-duesseldorf.de/pub/psycho/lahl/WWN. Moreover, the Web application used for data collection is inherently capable of collecting word norms in any language and is going to be released for public use as well.
It has been demonstrated previously that, for some experimental paradigms, Web-based research can reliably replicate lab-based results. Yet questions remain as to what types of research can be reproduced, and where differences arise when they cannot be. The present article examines the effect of research location (laboratory vs. online) on normative data collection tasks. Specifically, participants were randomly assigned to a laboratory or online condition and were asked to rate 593 photorealistic images on the basis of object familiarity (N=103) and object visual complexity (N=98). Dependent measures were compared across location conditions, including response latencies and image rating agreement. Our results suggest that norming data collected online are reliable, but an interesting interplay between task type and research location was observed. Specifically, we found that participating online (i.e., a more familiar environment) leads to systematically higher familiarity ratings than in the lab (i.e., an unfamiliar environment). These differences are not found when the alternate complexity rating task is used.
Communication using icons is now commonplace. It is therefore important to understand the processes involved in icon comprehension and the stimulus cues that individuals utilize to facilitate identification. In this study, we examined predictors of icon identification as participants gained experience with icons over a series of learning trials. A dynamic pattern of findings emerged in which the primary predictors of identification changed as learning progressed. In early learning trials, semantic distance (the closeness of the relationship between icon and function) was the best predictor of performance, accounting for up to 55{\%} of the variance observed, whereas familiarity with the function was more important in later trials. Other stimulus characteristics, such as our familiarity with the graphic in the icon and its concreteness, were also found to be important for icon design. The theoretical implications of these findings are discussed, with particular emphasis on the parallels with picture naming. The icon identification norms from this study may be downloaded from brm.psychonomic-journals.org/content/supplemental.
Recent work in computational linguistics points out the need for systems to be sensitive to the veracity or factuality of events as mentioned in text; that is, to recognize whether events are presented as corresponding to actual situations in the world, situations that have not happened, or situations of uncertain interpretation. Event factuality is an important aspect of the representation of events in discourse, but the annotation of such information poses a representational challenge, largely because factuality is expressed through the interaction of numerous linguistic markers and constructions. Many of these markers are already encoded in existing corpora, albeit in a somewhat fragmented way. In this article, we present FactBank, a corpus annotated with information concerning the factuality of events. Its annotation has been carried out from a descriptive framework of factuality grounded on both theoretical findings and data analysis. FactBank is built on top of TimeBank, adding to it an additional level of semantic information.
The present methodological note provides categorization norms for 70 semantic categories collected from 200 participants. The categories were mainly derived from a French translation of the Van Overschelde, Rawson, and Dunlosky (2004) inductor terms including a large set of semantic categories. Our study also extends recent French norms (Léger, Boumlak, {\&} Tijus, 2008; Marchal {\&} Nicolas, 2003) and tests their stability. These 70 cat- egories constitute the widest French categorization norms to date and will be of use to studies in the fields of both linguistics and psycholinguistics. A cross-linguistic comparison, with both quantitative and qualitative results, is also performed, which should prove useful for bilingual and/or cross-linguistic studies. The norms collected for all 70 categories are available for download as supplemental materials from http://brm.psychonomic-journals$\backslash$n.org/content/supplemental.
The study presented here provides researchers with a revised list of affective German words, the Berlin Affective Word List Reloaded (BAWL-R). This work is an extension of the previously published BAWL (V{\~{o}}, Jacobs, {\&} Conrad, 2006), which has enabled researchers to investigate affective word processing with highly controlled stimulus material. The lack of arousal ratings, however, necessitated a revised version of the BAWL. We therefore present the BAWL-R, which is the first list that not only contains a large set of psycholinguistic indexes known to influence word processing, but also features ratings regarding emotional arousal, in addition to emotional valence and imageability. The BAWL-R is intended to help researchers create stimulus material for a wide range of experiments dealing with the affective processing of German verbal material.
Color is undeniably important to object representations, but so too is the ability of context to alter the color of an object. The present study examined how implied perceptual information about typical and atypical colors is represented during language comprehension. Participants read sentences that implied a (typical or atypical) color for a target object and then performed a modified Stroop task in which they named the ink color of the target word (typical, atypical, or unrelated). Results showed that color naming was facilitated both when ink color was typical for that object (e.g., bear in brown ink) and when it matched the color implied by the previous sentence (e.g., bear in white ink following Joe was excited to see a bear at the North Pole). These findings suggest that unusual contexts cause people to represent in parallel both typical and scenario-specific perceptual information, and these types of information are discussed in relation to the specialization of perceptual simulations.
Two sentences are paraphrases if their meanings are equivalent but their words and syntax are different. Paraphrasing can be used to aid comprehension, stimulate prior knowledge, and assist in writing-skills development. As such, paraphrasing is a feature of fields as diverse as discourse psychology, composition, and computer science. Although automated paraphrase assessment is both commonplace and useful, research has centered solely on artificial, edited paraphrases and has used only binary dimensions (i.e., is or is not a paraphrase). In this study, we use an extensive database (N = 1,998) of natural paraphrases generated by high school students that have been assessed along 10 dimensions (e.g., semantic completeness, lexical similarity, syntactical similarity). This study investigates the components of paraphrase quality emerging from these dimensions and examines whether computational approaches can simulate those human evaluations. The results suggest that semantic and syntactic evaluations are the primary components of paraphrase quality, and that computationally light systems such as latent semantic analysis (semantics) and minimal edit distances (syntax) present promising approaches to simulating human evaluations of paraphrases. (PsycINFO Database Record (c) 2012 APA, all rights reserved). (journal abstract)
The present study provides a French child database containing a large corpus of words (N = 600) that were rated on emotional valence (positive, neutral, and negative) by French children differing in both age (5, 7, and 9 years old) and sex (girls and boys). Good response reliability was observed in each of the three age groups. The results showed some age differences in the children's ratings. With increasing age, the percentage of words rated positive decreased, whereas the percentage of neutral words increased and the percentage of negative words remained stable. Our study did not reveal marked differences across sex groups. The database compiled here should become a useful tool for experimental studies in which verbal material is used with children. It would be worthwhile in future research to study how children process emotional words and also to control the emotional variable in the same way as other linguistic variables in the experimental design. The norms from this study may be downloaded from brm.psychonomic-journals.org/content/supplemental.
This article accompanies the archiving by the Pychonomic Society of the Toglia and Battig (1978) semantic word norms. Herein are outlined the various phases of the project, as well as the challenges that were faced in staying the course during the labor-intensive development of the norms. An examination of the number of citations of this set of norms over the years demonstrates a stable employment of these norms by investigators in many fields. Indeed, a concluding section details the wide range of research topics that have been studied with the use of this extensive set of word ratings. The complete Toglia and Battig article and norms may be downloaded as supplemental materials for this article from brm.psychonomic-journals.org/content/supplemental.
Lexical co-occurrence models of semantic memory represent word meaning by vectors in a high-dimensional space. These vectors are derived from word usage, as found in a large corpus of written text. Typically, these models are fully automated, an advantage over models that represent semantics that are based on human judgments (e.g., feature-based models). A common criticism of co-occurrence models is that the representations are not grounded: Concepts exist only relative to each other in the space produced by the model. It has been claimed that feature-based models offer an advantage in this regard. In this article, we take a step toward grounding a cooccurrence model. A feed-forward neural network is trained using back propagation to provide a mapping from co-occurrence vectors to feature norms collected from subjects. We show that this network is able to retrieve the features of a concept from its co-occurrence vector with high accuracy and is able to generalize this ability to produce an appropriate list of features from the co-occurrence vector of a novel concept.
Orthographic transparency metrics for opaque or deep languages, such as French and English, have tended to focus on feedforward and/or feedback directions, with claims made for the influence of both on reading. In the present study, data for five transparency metrics for southern British English, three of which are neither feedforward nor feedback, are presented, demonstrating the complex relationships between the metrics and offering an explanation for feedback effects in children's reading accuracy. The structure of such metrics from a variety of corpus sizes and origins is investigated, and it is concluded that large corpus sizes do not make a substantial contribution to the value of such metrics, when compared with smaller samples, and that adult and child corpuses have very similar profiles. Probabilities of occurrence for the phonemes, graphemes, and sonographs in this study may be downloaded from brm.psychonomic-journals.org/content/supplemental.
The internal validity of several types of experiments in experimental psychology and neuroscience depends in part on the possibility of controlling or manipulating critical lexical variables such as word frequency of occurrence. Two ways of estimating this variable are (1) objective frequency counts and (2) subjective ratings of word frequency. Each method produces estimates that generally agree (i.e., they are highly correlated) but that disagree substantially concerning the relative frequency of a number of words. To investigate this issue more closely, the global and local agreement of subjective frequency estimates was examined in detail for a pool of 6,202 words drawn from the OMNILEX database of French words (Desrochers, 2006; www.omnilex.uottawa.ca). The results indicated that objective and subjective frequencies are strongly correlated, subjective frequencies share a significant amount of bias variance with other lexical characteristics (e.g., imageability), and the codeterminants of subjective frequency are in an antagonistic relationship with one another. The implications of these results for the selection of lexical stimuli are discussed, and multiple variables to aid in item selection are reported. Supplemental materials for this study may be downloaded from brm.psychonomic-journals.org/content/supplemental.
A Web-based database is developed to provide psycholinguists with a large-scale phonological representation system for all Mandarin Chinese monosyllables. The construction of the system is based on the slot-based phonological pattern generator (PatPho), with an adequate consideration of the language-specific features of the Chinese phonology. Users can retrieve the relevant phonological representations through an interactive query system on the Web. The query outcomes can be saved in a number of formats, such as Excel spreadsheets, for further analyses. This representation system can be used for a variety of purposes--in particular, connectionist language modeling and, more generally, the study of Chinese phonology.
In this article, we present 84 nonobjects we created by using the colored object pictures from Rossion and Pourtois (2004). These nonobjects were explored on a number of measures, including object resemblance, visual complexity, and an object decision task (ODT). Object resemblance for nonobjects is a construct comparable to the "word-likeness" of phonotactically legal pseudowords. The nonobjects were rated as possible objects, showing similarity to real objects. Visual complexity ratings for objects and nonobjects were comparable. In the ODT, response times (RTs) were significantly longer for nonobjects than for real-object pictures. This RT difference is analogous to the word advantage, or lexicality effect, found in lexical decision tasks, in which responses for words are generally faster than those for nonwords. This nonobject set is freely available and has the advantage of having a companion set of real-object pictures. The nonobjects are available in color and in grayscale from brm.psychonomic-journals.org/content/supplemental.
Orthographic transparency refers to the systematicity in the mapping between orthographic letter sequences and phonological phoneme sequences in both directions, for reading and spelling. Measures of transparency previously used in the analysis of orthographies of other languages include regularity, consistency, and entropy. However, previous reports have typically been hampered by severe restrictions, such as using only monosyllables or only word-initial phonemes. Greek is sufficiently transparent to allow complete sequential alignment between graphemes and phonemes, therefore permitting full analyses at both letter and grapheme levels, using every word in its entirety. Here, we report multiple alternative measures of transparency, using both type and token counts, and compare these with estimates for other languages. We discuss the problems stemming from restricted analysis sets and the implications for psycholinguistic experimentation and computational modeling of reading and spelling.
A strong body of work has explored the interaction between visual perception and language comprehension; for example, recent studies exploring predictions from embodied cognition have focused particularly on the common representation of sensory-motor and semantic information. Motivated by this background, we provide a set of norms for the axis and direction of motion implied in 299 English verbs, collected from approximately 100 native speakers of British English. Until now, there have been no freely available norms of this kind for a large set of verbs that can be used in any area of language research investigating the semantic representation of motion. We have used these norms to investigate the interaction between language comprehension and low-level visual processes involved in motion perception, validating the norming procedure's ability to capture the motion content of individual verbs. Supplemental materials for this study may be downloaded from brm.psychonomic-journals.org/content/supplemental.
The Corpus of Contemporary American English ( COCA ), which was released online in early 2008, is the first large and diverse corpus of American English. In this paper, we first discuss the design of the corpus — which contains more than 385 million words from 1990–2008 (20 million words each year), balanced between spoken, fiction, popular magazines, newspapers, and academic journals. We also discuss the unique relational databases architecture, which allows for a wide range of queries that are not available (or are quite difficult) with other architectures and interfaces. To conclude, we consider insights from the corpus on a number of cases of genre-based variation and recent linguistic variation, including an extended analysis of phrasal verbs in contemporary American English.
Frequency, familiarity, and age of acquisition are important factors for word recognition that must be considered by researchers of language acquisition. Current psycholinguistic databases, based on studies of native English speakers, include objective frequency count, subjective rating of familiarity, and age of acquisition. One can easily employ those databases to obtain a stimulus list for one's studies. For word recognition researchers interested in non-native English speakers in of Taiwan, however, there is currently no existing database. In this study, we created a psycholinguistic database which includes subjective familiarity rating and age of acquisition for 3,080 English words. Participants were 120 college students in Taiwan. They were asked to make judgments about 4,000 stimulus words. For recognized stimulus words, participants gave a rating of familiarity and self-report of age of acquisition; for non-recognized words, participants were asked to move on to the next stimulus word. Further analysis of the database showed that familiarity index, age of acquisition, and number of syllables are important factors for the recognition of a word. Variance in word recognition ratings for each factor was explained and implications were discussed.
This paper is a contribution to the discussion on compiling computational lexical resources from conventional dictionaries. It describes the theoretical as well as practical problems that are encountered when reusing a conventional dictionary for compiling a lexical-semantic resource in terms of a wordnet. More specifically, it describes the methodological issues of compiling a wordnet for Danish, DanNet, from a monolingual basis, and not—as is often seen—by applying the translational expansion method with Princeton WordNet as the English source. Thus, we apply as our basis a large, corpus-based printed dictionary of modern Danish. Using this approach, we discuss the issues of readjusting inconsistent and/or underspecified hyponymy hierarchies taken from the conventional dictionary, sense distinctions as opposed to the synonym sets of wordnets, generating semantic wordnet relations on the basis of sense definitions, and finally, supplementing missing or implicit information.
The strength-sampling model of free association (Nelson, McEvoy, {\&} Dennis, 2000) claims that the probability of word association in free-association norms results from a sampling process. For a given cue word, each response word has an underlying distribution of strength values. In the free-association task, presentation of the cue word activates a random sample of strengths, one for each response. The highest strength wins, and its response is reported. In the present work, gradient descent was used to compute the theoretical mean strengths for each cue-response pair in the Nelson, McEvoy, and Schreiber (2004) norms. The resulting database may be downloaded from www.psychonomic.org/archive/.
Ratings for age of acquisition (AoA) and subjective frequency were collected for the 1,493 monosyllabic French words that were most known to French students. AoA ratings were collected by asking participants to estimate in years the age at which they learned each word. Subjective frequency ratings were collected on a 7-point scale, ranging from never encountered to encountered several times daily. The results were analyzed to address the relationship between AoA and subjective frequency ratings with other psycholinguistic variables (objective frequency, imageability, number of letters, and number of orthographic neighbors). The results showed high reliability ratings with other databases. Supplementary materials for this study may be downloaded from the Psychonomic Society's Archive of Norms, Stimuli, and Data, www.psychonomic.org/archive.
A list of role names for future use in research on gender stereotyping was created and evaluated. In two stud- ies, 126 role names were rated with reference to their gender stereotypicality by English-, French-, and German- speaking students of universities in Switzerland (French and German) and in the U.K. (English). Role names were either presented in specific feminine and masculine forms (Study 1) or in the masculine form (generic masculine) only (Study 2). The rankings of the stereotypicality ratings were highly reliable across languages and question- naire versions, but the overall mean of the ratings was less strongly male if participants were also presented with the female versions of the role names and if the latter were presented on the left side of the questionnaires.
The general aim of this study is to validate the cognitive relevance of the geometric model used in the semantic atlases (SA). With this goal in mind, we compare the results obtained by the automatic contexonym organizing model (ACOM)--an SA-derived model for word sense representation based on contextual links--with human subjects' responses on a word association task. We begin by positioning the geometric paradigm with respect to the hierarchical paradigm (WordNet) and the vector paradigm (latent semantic analysis [LSA] and the hyperspace analogue to language model). Then we compare ACOM's responses with Hirsh and Tree's (2001) word association norms based on the responses of two groups of subjects. The results showed that words associated by 50{\%} or more of the Hirsh and Tree subjects were also proposed by ACOM (e.g., 71{\%} of the words in the norms were also given by ACOM). Finally, we compare ACOM and LSA on the basis of the same association norms. The results indicate better performance for the geometric model.
Four experiments were conducted to assess two models of topic sentencehood identification: the derived model and the free model. According to the derived model, topic sentences are identified in the context of the paragraph and in terms of how well each sentence in the paragraph captures the paragraph's theme. In contrast, according to the free model, topic sentences can be identified on the basis of sentential features without reference to other sentences in the paragraph (i.e., without context). The results of the experiments suggest that human raters can identify topic sentences both with and without the context of the other sentences in the paragraph. Another goal of this study was to develop computational measures that approximated each of these models. When computational versions were assessed, the results for the free model were promising; however, the derived model results were poor. These results collectively imply that humans' identification of topic sentences in context may rely more heavily on sentential features than on the relationships between sentences in a paragraph.
This study examined the relationship between language experience and false memory produced by the DRM paradigm. The word lists used in Stadler, et al. (Memory {\&} Cognition, 27, 494-500, 1999) were first translated into Chinese. False recall and false recognition for critical non-presented targets were then tested on a group of Chinese users. The average co-occurrence rate of the list word and the critical word was calculated based on two large Chinese corpuses. List-level analyses revealed that the correlation between the American and Taiwanese participants was significant only in false recognition. More importantly, the co-occurrence rate was significantly correlated with false recall and recognition of Taiwanese participants, and not of American participants. In addition, the backward association strength based on Nelson et al. (The University of South Florida word association, rhyme and word fragment norms, 1999) was significantly correlated with false recall of American participants and not of Taiwanese participants. Results are discussed in terms of the relationship between language experiences and lexical association in creating false memory for word lists.
This article has two primary aims. The first is to introduce a new Vietnamese text-based corpus. The Corpora of Vietnamese Texts (CVT; Tang, 2006a) consists of approximately 1 million words drawn from newspapers and children's literature, and is available online at www.vnspeechtherapy.com/vi/CVT. The second aim is to investigate potential differences in lexical frequency and distributional characteristics in the CVT on the basis of place of publication (Vietnam or Western countries) and intended audience: adult-directed texts (newspapers) or child-directed texts (children's literature). We found clear differences between adult- and child-directed texts, particularly in the distributional frequencies of pronouns or kinship terms, which were more frequent in children's literature. Within child- and adult-directed texts, lexical characteristics did not differ on the basis of place of publication. Implications of these findings for future research are discussed.
Semantic features have provided insight into numerous behavioral phenomena concerning concepts, categorization, and semantic memory in adults, children, and neuropsychological populations. Numerous theories and models in these areas are based on representations and computations involving semantic features. Consequently, empirically derived semantic feature production norms have played, and continue to play, a highly useful role in these domains. This article describes a set of feature norms collected from approximately 725 participants for 541 living (dog) and nonliving (chair) basic-level concepts, the largest such set of norms developed to date. This article describes the norms and numerous statistics associated with them. Our aim is to make these norms available to facilitate other research, while obviating the need to repeat the labor-intensive methods involved in collecting and analyzing such norms. The full set of norms may be downloaded from www.psychonomic.org/archive.
This paper concerns metaphor resource creation. It provides an account of methods used, problems discovered, and insights gained at the Hamburg Metaphor Database project, intended to inform similar resource creation initiatives, as well as future metaphor processing algorithms. After introducing the project, the theoretical underpinnings that motivate the subdivision of represented information into a conceptual and a lexical level are laid out. The acquisition of metaphor attestations from electronic corpora is explained, and annotation practices as well as database contents are evaluated. The paper concludes with an overview of related projects and an outline of possible future work.
Although taboo words are used to study emotional memory and attention, no easily accessible normative data are available that compare taboo, emotionally valenced, and emotionally neutral words on the same scales. Frequency, inappropriateness, valence, arousal, and imageability ratings for taboo, emotionally valenced, and emotionally neutral words were made by 78 native-English-speaking college students from a large metropolitan university. The valenced set comprised both positive and negative words, and the emotionally neutral set comprised category-related and category-unrelated words. To account for influences of demand characteristics and personality factors on the ratings, frequency and inappropriateness measures were decomposed into raters' personal reactions to the words versus raters' perceptions of societal reactions to the words (personal use vs. familiarity and offensiveness vs. tabooness, respectively). Although all word sets were rated higher in familiarity and tabooness than in personal use and offensiveness, these differences were most pronounced for the taboo set. In terms of valence, the taboo set was most similar to the negative set, although it yielded higher arousal ratings than did either valenced set. Imageability for the taboo set was comparable to that of both valenced sets. The ratings of each word are presented for all participants as well as for single-sex groups. The inadequacies of the application of normative data to research that uses emotional words and the conceptualization of taboo words as a coherent category are discussed. Materials associated with this article may be accessed at the Psychonomic Society's Archive of Norms, Stimuli, and Data, www.psychonomic.org/archive.
In this paper, we present the building of various language resources for a multi-engine bi-directional English-Filipino Machine Translation (MT) system. Since linguistics information on Philippine languages are available, but as of yet, the focus has been on theoretical linguistics and little is done on the computational aspects of these languages, attempts are reported here on the manual construction of these language resources such as the grammar, lexicon, morphological information, and the corpora which were literally built from almost non-existent digital forms. Due to the inherent difficulties of manual construction, we also discuss our experiments on various technologies for automatic extraction of these resources to handle the intricacies of the Filipino language, designed with the intention of using them for the MT system. To implement the different MT engines and to ensure the improvement of translation quality, other language tools (such as the morphological analyzer and generator, and the part of speech tagger) were developed.
We describe the Age-Dependent Evaluations of German Adjectives (AGE). This database contains ratings for 200 German adjectives by young and older adults (general word-rating study) and graduate students (self-other relevance study). Words were rated on emotion-relevant (valence, arousal, and control) and memory-relevant (imagery) characteristics. In addition, adjectives were evaluated for self-relevance (Does this attribute describe you?), age relevance (Is this attribute typical for young or for older adults?), and self-other relevance (Is this attribute more relevant for the possessor or for other persons?). These ratings are included in the AGE database as a resource tool for experiments on word material. Our comparisons of young and older adults' evaluations revealed similarities but also significant mean-level differences for a large number of adjectives, especially on the valence dimension. This highlights the importance of age in the perception of emotional words. Data for all the words are archived at www.psychonomic.org/archive/.
Complexity is conventionally defined as the level of detail or intricacy contained within a picture. The study of complexity has received relatively little attention-in part, because of the absence of an acceptable metric. Traditionally, normative ratings of complexity have been based on human judgments. However, this study demonstrates that published norms for visual complexity are biased. Familiarity and learning influence the subjective complexity scores for nonsense shapes, with a significant training x familiarity interaction [F(1,52) = 17.53, p {\textless} .05]. Several image-processing techniques were explored as alternative measures of picture and image complexity. A perimeter detection measure correlates strongly with human judgments of the complexity of line drawings of real-world objects and nonsense shapes and captures some of the processes important in judgments of subjective complexity, while removing the bias due to familiarity effects.
This study presents Portuguese category norms for children of three different age groups: preschoolers (3- to 4-year-olds), second graders (7- to 8-year-olds), and preadolescents (11- to 12-year-olds). Three hundred Portuguese children (100 in each group) completed an exemplar-generation task. Preschoolers generated exemplars for 13 categories, second graders generated exemplars for 17 categories, and preadolescents generated exemplars for 21 categories. For each group, responses within each category were organized according to frequency of production in order to derive exemplar-production norms for sets of tested categories. The results also included information about the number of responses and exemplars, idiosyncratic and inappropriate responses, and commonality and diversity indexes for all the categories. A comparison of these children's norms with the Portuguese adult norms was also presented. The full set of norms may be downloaded from www.psychonomic.org/archive.
The goal of this study was to test a new technique for assessing vocabulary development. This technique is based on an algorithm for scoring the accuracy of word definitions using a continuous scale (Collins-Thompson {\&} Callan, 2007). In an experiment with adult learners, target words were presented in six different sentence contexts, and the number of informative versus misleading contexts was systematically manipulated. Participants generated a target definition after each sentence, and the definition-scoring algorithm was used to assess the degree of accuracy on each trial. We observed incremental improvements in definition accuracy across trials. Moreover, learning curves were sensitive to the proportion of misleading contexts, the use of spaced versus massed practice, and individual differences, demonstrating the utility of this procedure for capturing specific experimental effects on the trajectory of word learning. We discuss the implications of these results for measurement of meaning, vocabulary assessment, and instructional design.
In picture-naming tasks, participants name a picture as quickly as possible. In several studies, when the participant did not provide the picture name in the first seconds after object presentation, the examiner provided phonemic or semantic cues. Under these conditions, word retrieval should be easier, thus lowering the age of acquisition (AoA). The goal of the present study was to collect objective norms of AoA in French without any kind of cue. The results were then compared with other European databases that relied on picture-naming tasks conducted with phonemic or semantic cues. Globally, the data of all the databases are significantly correlated. However, the AoA measures in these databases are always lower than in our study, except in Alvarez and Cuetos (2007), who did not provide any assistance to the participant. Therefore, giving phonemic and/or semantic cues lowers the AoA values, indicating that the values from different databases in this domain should be taken with caution. The objective AoA norms from this study may be downloaded from the Psychonomic Society's Archive of Norms, Stimuli, and Data, www.psychonomic.org/archive.
We describe a suite of standards, resources and tools for computational encoding and processing of Modern Hebrew texts. These include an array of XML schemas for representing linguistic resources; a variety of text corpora, raw, automatically processed and manually annotated; lexical databases, including a broad-coverage monolingual lexicon, a bilingual dictionary and a WordNet; and morphological processors which can analyze, generate and disambiguate Hebrew word forms. The resources are developed under centralized supervision, so that they are compatible with each other. They are freely available and many of them have already been used for several applications, both academic and industrial.
Although many individual speech contrasts pairs have been studied within the cross-language literature, no one has created a comprehensive and systematic set of such stimuli. This article justifies and details an extensive set of contrast pairs for Mandarin Chinese and American English. The stimuli consist of 180 pairs of CVC syllables recorded in two tokens each (720 syllables total). Between each CVC pair, two of the segments are identical, whereas the third differs in that a segment drawn from a "native" phonetic category (either Mandarin, English, or both) is partnered with a segment drawn from a "foreign" phonetic category (nonnative to Mandarin, English, or both). Each contrast pair differs by a minimal phonetic amount and constitutes a meaningful contrast among the world's languages (as cataloged in the UCLA Phonological Segment Inventory Database of 451 languages). The entire collection of phonetic differences envelops Mandarin and English phonetic spaces and generates a range of phonetic discriminability. Contrastive segments are balanced through all possible syllable positions, with noncontrastive segments being filled in with other "foreign" segments. Although intended to measure phonetic perceptual sensitivity among adult speakers of the two languages, these stimuli are offered here to all for similar or for altogether unrelated investigations.
Lexical co-occurrence models of semantic memory form representations of the meaning of a word on the basis of the number of times that pairs of words occur near one another in a large body of text. These models offer a distinct advantage over models that require the collection of a large number of judgments from human subjects, since the construction of the representations can be completely automated. Unfortunately, word frequency, a well-known predictor of reaction time in several cognitive tasks, has a strong effect on the co-occurrence counts in a corpus. Two words with high frequency are more likely to occur together purely by chance than are two words that occur very infrequently. In this article, we examine a modification of a successful method for constructing semantic representations from lexical co-occurrence. We show that our new method eliminates the influence of frequency, while still capturing the semantic characteristics of words.
The Character-Component Analysis Toolkit (C-CAT) software was designed to assist researchers in constructing experimental materials using traditional Chinese characters. The software package contains two sets of character stocks: one suitable for research using literate adults as subjects and one suitable for research using schoolchildren as subjects. The software can identify linguistic properties, such as the number of strokes contained, the character-component pronunciation regularity, and the arrangement of character components within a character. Moreover, it can compute a character's linguistic frequency, neighborhood size, and phonetic validity with respect to a user-selected character stock. It can also search the selected character stock for similar characters or for character components with user-specified linguistic properties.
This article present the Spanish assessments of the 111 sounds included in the International Affective Digitized Sounds (IADS; Bradley {\&} Lang, 1999b). The sounds were evaluated by 159 participants in the dimensions of valence, arousal, and dominance, using a computer version of the Self-Assessment Manikin (Bradley {\&} Lang, 1994). Results are compared with those obtained in the American version of the IADS, as well as in the Spanish adaptations of the International Affective Picture System (P. J. Lang, Bradley, {\&} Cuthbert, 1999; Molt{\'{o}} et al., 1999) and the Affective Norms for English Words (Bradley {\&} Lang, 1999a; Redondo, Fraga, Padr{\'{o}}n, {\&} Comesa{\~{n}}a, 2007).
The rapid development of language resources and tools using machine learning techniques for less computerized languages requires appropriately tagged corpus. A tagged Bengali news corpus has been developed from the web archive of a widely read Bengali newspaper. A web crawler retrieves the web pages in Hyper Text Markup Language (HTML) format from the news archive. At present, the corpus contains approximately 34 million wordforms. Named Entity Recognition (NER) systems based on pattern based shallow parsing with or without using linguistic knowledge have been developed using a part of this corpus. The NER system that uses linguistic knowledge has performed better yielding highest F-Score values of 75.40%, 72.30%, 71.37%, and 70.13% for person, location, organization, and miscellaneous names, respectively.
Early vocabulary development is a reliable predictor of children's later language skills. The MacArthur-Bates Communicative Development Inventory (CDI) has provided a powerful tool to assess earlyvocabulary development in English and other languages. However, there have been no published CDI norms for Mandarin Chinese. Given the importance of large-scale comparative data sets for understanding the early childhood lexicon, we have developed an early vocabulary inventory for Mandarin. In this article, we report our efforts in developing this instrument, and discuss the data collected from 884 Chinese families in Beijing over a period of 12-30 months, based on our instrument. Chinese children's receptive and expressive lexicons as assessed by our inventory match well with those reported for English on the basis of CDI. In particular, our data indicate comprehension-production differences, individual differences in early comprehension and in later production, and different lexical development profiles among infants versus toddlers. We also make the checklists and norms of our inventory available to the research community via the Internet; they may be accessed from the Psychonomic Society's Archive of Norms, Stimuli, and Data, at www.psychonomic.org/archive.
Participants judged which of seven facial expressions (neutrality, happiness, anger, sadness, surprise, fear, and disgust) were displayed by a set of 280 faces corresponding to 20 female and 20 male models of the Karolinska Directed Emotional Faces database (Lundqvist, Flykt, {\&} {\"{O}}hman, 1998). Each face was presented under free-viewing conditions (to 63 participants) and also for 25, SO, 100, 250, and 500 msec (to 160 participants), to examine identification thresholds. Measures of identification accuracy, types of errors, and reaction times were obtained for each expression. In general, happy faces were identified more accurately, earlier, and faster than other faces, whereas judgments of fearful faces were the least accurate, the latest, and the slowest. Norms for each face and expression regarding level of identification accuracy, errors, and reaction times may be downloaded from www.psychonomic.org/archive/.
Phylogenetic methods have revolutionised evolutionary biology and have recently been applied to studies of linguistic and cultural evolution. However, the basic comparative data on the languages of the world required for these analyses is often widely dispersed in hard to obtain sources. Here we outline how our Austronesian Basic Vocabulary Database (ABVD) helps remedy this situation by collating wordlists from over 500 languages into one web-accessible database. We describe the technology underlying the ABVD and discuss the benefits that an evolutionary bioinformatic approach can provide. These include facilitating computational comparative linguistic research, answering questions about human prehistory, enabling syntheses with genetic data, and safe-guarding fragile linguistic information.
Although there are many well-characterized affective visual stimuli sets available to researchers, there are few auditory sets available. Those auditory sets that are available have been characterized primarily according to one of two major theories of affect: dimensional or categorical. Current trends have attempted to utilize both theories to more fully understand emotional processing. As such, stimuli that have been thoroughly characterized according to both of these approaches are exceptionally useful. In an effort to provide researchers with such a stimuli set, we collected descriptive data on the International Affective Digitized Sounds (IADS), identifying which discrete categorical emotions are elicited by each sound. The IADS is a database of 111 sounds characterized along the affective dimensions of valence, arousal, and dominance. Our data complement these characterizations of the IADS, allowing researchers to control for or manipulate stimulus properties in accordance with both theories of affect, providing an avenue for further integration of these perspectives. Related materials may be downloaded from the Psychonomic Society Web archive at www.psychonomic.org/archive.
A set of 105 photographs of celebrities has been standardized in French on distinctiveness, proper name agreement, face agreement, age of acquisition (AoA), and subjective frequency. Statistics on the collected variables for photographs are provided. The relationships between these variables have been analyzed. Face naming latencies have also been collected for the photographs of celebrities, and several multiple regression analyses have been carried out on naming latencies and percentages of tip-of-the-tongue (TOT) phenomena. Themain determinants of naming speed included AoA, face agreement, and name agreement. In addition, AoA, together with distinctiveness and face agreement, reliably predicted the percentages of TOTs. The norms, photographs of the celebrities, and spoken naming latencies corresponding to the celebrities are available on the Internet at and should be of great use to researchers interested in the processing of famous people.
Since emotions are expressed through a combination of verbal and non-verbal channels, a joint analysis of speech and gestures is required to understand expressive human communication. To facilitate such investigations, this paper describes a new corpus named the “interactive emotional dyadic motion capture database” (IEMOCAP), collected by the Speech Analysis and Interpretation Laboratory (SAIL) at the University of Southern California (USC). This database was recorded from ten actors in dyadic sessions with markers on the face, head, and hands, which provide detailed information about their facial expressions and hand movements during scripted and spontaneous spoken communication scenarios. The actors performed selected emotional scripts and also improvised hypothetical scenarios designed to elicit specific types of emotions (happiness, anger, sadness, frustration and neutral state). The corpus contains approximately 12 h of data. The detailed motion capture information, the interactive setting to elicit authentic emotions, and the size of the database make this corpus a valuable addition to the existing databases in the community for the study and modeling of multimodal and expressive human communication.
Body-object interaction (BOI) assesses the ease with which a human body can physically interact with a word's referent. Recent research has shown that BOI influences visual word recognition processes in such a way that responses to high-BOI words (e.g., couch) are faster and less error prone than responses to low-BOI words (e.g., cliff). Importantly, the high-BOI words and the low-BOI words that were used in those studies were matched on imageability. In the present study, we collected BOI ratings for a large set of words. BOI ratings, on a 1-7 scale, were obtained for 1,618 monosyllabic nouns. These ratings allowed us to test the generalizability of BOI effects to a large set of items, and they should be useful to researchers who are interested in manipulating or controlling for the effects of BOI. The body-object interaction ratings for this study may be downloaded from the Psychonomic Society's Archive of Norms, Stimuli, and Data, www.psychonomic.org/archive.
In this paper we describe the current state of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definitions, examples and news text, and uses an HPSG based Japanese grammar to encode both syntactic and semantic information. It is combined with an ontology based on the definition sentences to give a detailed sense level description of the most familiar 28,000 words of Japanese.
Age of acquisition (AoA) ratings made on a 1–7 scale for 3,000 monosyllabic words were obtained from 32 participants across four blocks of 750 trials (two blocks of 750 trials were completed in each of 2 days). These results, as well as those of the regression analyses and reliability and validity measures that were originally reported in Cortese and Khanna (2007), are summarized here. Here, we also report high interblock correlations across items, indicating that participants were consistent in their ratings across blocks. The norms for the 3,000 words are important for researchers interested in word processing and may be downloaded from the Psycho- nomic Society's Norms, Stimuli, and Data archive at www.psychonomic.org/archive.
Features are at the core of many empirical and modeling endeavors in the study of semantic concepts. This article is concerned with the delineation of features that are important in natural language concepts and the use of these features in the study of semantic concept representation. The results of a feature generation task in which the exemplars and labels of 15 semantic categories served as cues are described. The importance of the generated features was assessed by tallying the frequency with which they were generated and by obtaining judgments of their relevance. The generated attributes also featured in extensive exemplar by feature applicability matrices covering the 15 different categories, as well as two large semantic domains (that of animals and artifacts). For all exemplars of the 15 semantic categories, typicality ratings, goodness ratings, goodness rank order, generation frequency, exemplar associative strength, category associative strength, estimated age of acquisition, word frequency, familiarity ratings, imageability ratings, and pairwise similarity ratings are described as well. By making these data easily available to other researchers in the field, we hope to provide ample opportunities for continued investigations into the nature of semantic concept representation. These data may be downloaded from the Psychonomic Society's Archive of Norms, Stimuli, and Data, www.psychonomic.org/archive.
Associative norms for homographs have been widely used in the study of language processing. A number of sets of these are available, providing the investigator with the opportunity to compare materials collected over a span of years and a range of locations. Words that are homophonic but not homographic have been used to address a variety of questions in memory as well as in language processing. However, a paucity of normative data are available for these materials, especially with respect to responses to the spoken form of the homophone. This article provides such data for a sample of 207 homophones across four different tasks, both visual and auditory, and examines how well the present measures correlate with each other and with those of other investigators. The finding that these measures can account for a considerable proportion of the variance in the lexical decision and naming data from the English Lexicon Project provides an additional demonstration of their utility. The norms from this study are available online in the Psychonomic Society Archive of Norms, Stimuli, and Data, at www .psychonomic.org/archive.
In this article, we present a database of orthographic neighbors for words that Spanish children read during elementary education. The reference dictionary for lexical entries and frequencies (which had its origin in Mart{\'{i}}nez {\&} Garc{\'{i}}a, 2004) comprises approximately 100,000 words and is the result of accumulating the words read by a sample of children from first to sixth grades. Using the criterion for orthographic neighbors described by Coltheart, Davelaar, Jonasson, and Besner (1977), we present basic statistics related to neighborhood size as a function of the positions of divergent letters, the cumulative frequency of the neighbors, and the numbers of neighbors of higher, lower, and equal frequency. We also attempt to illustrate and unravel the nature of the relationships among the variables neighborhood size, length, and frequency in the distribution of neighbors. The database described in this article is available at www.psychonomic.org/archive.
Research on signed languages offers the opportunity to address many important questions about language that it may not be possible to address via studies of spoken languages alone. Many such studies, however, are inherently limited, because there exist hardly any norms for lexical variables that have appeared to play important roles in spoken language processing. Here, we present a set of norms for age of acquisition, familiarity, and iconicity for 300 British Sign Language (BSL) signs, as rated by deaf signers, in the hope that they may prove useful to other researchers studying BSL and other signed languages. These norms may be downloaded from www.psychonomic.org/archive.
This study provides normative data on the implicit causality of interpersonal verbs in Spanish. Two experiments were carried out. In Experiment 1, ratings of the implicit causality of 100 verbs classified into four types (agent-patient, agent-evocator, stimulus-experiencer, and experiencer-stimulus) were examined. An offline task was used in which 105 adults and 163 children had to complete sentences containing one verb. Both age and gender effects in the causal biases were examined. In Experiment 2, reading times for sentences containing 60 verbs were analyzed. An online reading task was used in which 34 adults had to read sentences that were both congruent and incongruent with the implicit causality of the verb. The results support the effect of implicit causality in both adults and children, and they support the taxonomy used.
This study describes the collection of a large set of word association norms. In a continuous word association task, norms for 1,424 Dutch words were gathered. For each cue, three association responses were obtained per participant. In total, an average of 268 responses were collected for each cue. We investigated the relationship with similar procedures, such as discrete association tasks and exemplar generation tasks. The results show that the use of a continuous task allows the study of weaker associations in comparison with a discrete task. The effects of the continuous tasks were investigated for set size and the availability characteristics of the responses, measured through word frequency, age of acquisition, and imageability. Finally, we compared our findings to those of a semantically constrained version of the association task in which participants generated responses within the domain of a semantic category. Results of this comparison are discussed. The Appendix cited in this article is available at www.psychonomic.org/archive.
The International Affective Picture System (IAPS) has been widely used in aging-oriented research on emotion. However, no ratings for older adults are available. The aim of the present study was to close this gap by providing ratings of valence and arousal for 504 IAPS pictures by 53 young and 53 older adults. Both age groups rated positive pictures as less arousing, resulting in a stronger linear association between valence and arousal, than has been found in previous studies. This association was even stronger in older than in young adults. Older adults perceived negative pictures as more negative and more arousing and positive pictures as more positive and less arousing than young adults did. This might indicate a dedifferentiation of emotional processing in old age. On the basis of a picture recognition task, we also report memorability scores for individual pictures and how they relate to valence and arousal ratings. Data for all the pictures are archived at www.psychonomic.org/archive/.
To conduct experimental investigations into the orthographic processing of Modern Greek, information is needed about the lexical properties known to influence visual word recognition. In this article we introduce GreekLex, a lexical database for Modern Greek, which presents collectively for the first time a series of orthographic measures that can be used for psycholinguistic research. GreekLex consists of 35,304 Modern Greek words ranging in length from 1 to 22 letters, and for each word includes the following statistical information: word length, word-form frequency, lemma frequency, neighborhood density and frequency, transposition neighbors, and addition and deletion neighbors. Furthermore, type and token frequency measures of single letters and bigrams derived from the database are also available. The complete database can be accessed and downloaded freely from www.psychology.nottingham.ac.uk/GreekLex.
Lexical classifications have proved useful in supporting various natural language processing (NLP) tasks. The largest verb classification for English is Levin’s (1993) work which defines groupings of verbs based on syntactic and semantic properties. VerbNet (VN) (Kipper et al. 2000; Kipper-Schuler 2005)—an extensive computational verb lexicon for English—provides detailed syntactic-semantic descriptions of Levin classes. While the classes included are extensive enough for some NLP use, they are not comprehensive. Korhonen and Briscoe (2004) have proposed a significant extension of Levin’s classification which incorporates 57 novel classes for verbs not covered (comprehensively) by Levin. Korhonen and Ryant (unpublished) have recently proposed another extension including 53 additional classes. This article describes the integration of these two extensions into VN. The result is a comprehensive Levin-style classification for English verbs providing over 90% token coverage of the Proposition Bank data (Palmer et al. 2005) and thus can be highly useful for practical applications.
As part of a project to construct an interactive program which would encourage children to play with language by building jokes, we developed a lexical database, starting from WordNet. To the existing information about part of speech, synonymy, hyponymy, etc., we have added phonetic representations and phonetic similarity ratings for pairs of words/phrases. [ABSTRACT FROM AUTHOR], Copyright of Language Resources & Evaluation is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
In this paper we present an annotated audio–video corpus of multi-party meetings. The multimodal corpus provides for each subject involved in the experimental sessions six annotation dimensions referring to group dynamics; speech activity and body activity. The corpus is based on 11 audio and video recorded sessions which took place in a lab setting appropriately equipped with cameras and microphones. Our main concern in collecting this multimodal corpus was to explore the possibility of providing feedback services to facilitate group processes and to enhance self awareness among small groups engaged in meetings. We therefore introduce a coding scheme for annotating relevant functional roles that appear in a small group interaction. We also discuss the reliability of the coding scheme and we present the first results for automatic classification.
In this article, we present normative data for 2,423 Chinese single-character words. For each word, we report values for the following 15 variables: word frequency, cumulative frequency, homophone density, phonological frequency, age of learning, age of acquisition, number of word formations, number of meanings, number of components, number of strokes, familiarity, concreteness, imageability, regularity, and initial phoneme. To validate the norms, we collected word-naming latencies. Factor analysis and multiple regression analysis show that naming latencies of Chinese single-character words are predicted by frequency, semantics, visual features, and consistency, but not by phonology. These analyses show distinct patterns in word naming between Chinese and alphabetic languages and demonstrate the utility of normative data in the study of nonalphabetic orthographic processing.
L'{\^{a}}ge d'acquisition et la familiarit{\'{e}} d'un mot sont des facteurs d{\'{e}}cisifs pour l'acc{\`{e}}s au lexique, en production comme en perception. Pour favoriser les recherches sur les m{\'{e}}canismes du traitement lexical en Fran{\c{c}}ais, une base de donn{\'{e}}es lexicales a {\'{e}}t{\'{e}} constitu{\'{e}}e pour un corpus de 1225 mots monosyllabiques et bisyllabiques du Fran{\c{c}}ais. Cet article d{\'{e}}crit la m{\'{e}}thode utilis{\'{e}}e pour le recueil des donn{\'{e}}es, l'information brute obtenue, la proc{\'{e}}dure pour traiter cette information brute, et le contenu de la base de donn{\'{e}}es CHACQFAM obtenu apr{\`{e}}s traitement de l'information brute, ainsi que les proc{\'{e}}dures de validation de ce contenu. CHACQFAM est disponible gratuitement sur le site Internet « http://psycholinguistique.unige.ch/ ». Mots-cl{\'{e}
We examine the use of film subtitles as an approximation of word frequencies in human interactions. Because subtitle files are widely available on the Internet, they may present a fast and easy way to obtain word frequency measures in language registers other than text writing. We compiled a corpus of 52 million French words, coming from a variety of films. Frequency measures based on this corpus compared well to other spoken and written frequency measures, and explained variance in lexical decision times in addition to what is accounted for by the available French written frequency measures.
The use of multilevel modeling is presented as an alternative to separate item and subject ANOVAs (F1 x F2) in psycholinguistic research. Multilevel modeling is commonly utilized to model variability arising from the nesting of lower level observations within higher level units (e.g., students within schools, repeated measures within individuals). However, multilevel models can also be used when two random factors are crossed at the same level, rather than nested. The current work illustrates the use of the multilevel model for crossed random effects within the context of a psycholinguistic experimental study, in which both subjects and items are modeled as random effects within the same analysis, thus avoiding some of the problems plaguing current approaches.
Opinion mining (OM) is a recent subdiscipline at the crossroads of information retrieval and computational linguistics which is concerned not with the topic a document is about, but with the opinions it expresses. OM has a rich set of applications, ranging from tracking users' opinions about products or about political candidates as expressed in online forums, to customer relationship management. In order to aid the extraction of opinions from text, recent research has tried to automatically determine the “PN-polarity” of subjective terms, i.e. identify whether a term that indicates the presence of an opinion has a positive or a negative connotation. Research on determining the “SO-polarity” of terms, i.e. whether a term indeed indicates the presence of an opinion (a subjective term) or not (an objective, or neutral term) has been instead much scarcer. In this paper we describe SentiWordNet, a lexical resource produced by asking an automated classifier ˆ to associate to each synset s of WordNet (version 2.0) a triplet of scores ˆ(s, p) (for p 2 P ={\{}Positive, Negative, Objective{\}}) describing how strongly the terms contained in s enjoy each of the three properties. The method used to develop SentiWordNet is based on the quantitative analysis of the glosses associated to synsets, and on the use of the resulting vectorial term representations for semi-supervised synset classification. The score triplet is derived by combining the results produced by a committee of eight ternary classifiers, all characterized by similar accuracy levels but extremely different classification behaviour. We present the results of evaluating the accuracy of the automatically assigned triplets on a publicly available benchmark. SentiWordNet is freely available for research purposes, and is endowed with a Web-based graphical user interface.
The analysis of lectures and meetings inside smart rooms has recently attracted much interest in the literature, being the focus of international projects and technology evaluations. A key enabler for progress in this area is the availability of appropriate multimodal and multi-sensory corpora, annotated with rich human activity information during lectures and meetings. This paper is devoted to exactly such a corpus, developed in the framework of the European project CHIL, “Computers in the Human Interaction Loop”. The resulting data set has the potential to drastically advance the state-of-the-art, by providing numerous synchronized audio and video streams of real lectures and meetings, captured in multiple recording sites over the past 4 years. It particularly overcomes typical shortcomings of other existing databases that may contain limited sensory or monomodal data, exhibit constrained human behavior and interaction patterns, or lack data variability. The CHIL corpus is accompanied by rich manual annotations of both its audio and visual modalities. These provide a detailed multi-channel verbatim orthographic transcription that includes speaker turns and identities, acoustic condition information, and named entities, as well as video labels in multiple camera views that provide multi-person 3D head and 2D facial feature location information. Over the past 3 years, the corpus has been crucial to the evaluation of a multitude of audiovisual perception technologies for human activity analysis in lecture and meeting scenarios, demonstrating its utility during internal evaluations of the CHIL consortium, as well as at the recent international CLEAR and Rich Transcription evaluations. The CHIL corpus is publicly available to the research community.
The idea that at least some aspects of word meaning can be induced from patterns of word co-occurrence is becoming increasingly popular. However, there is less agreement about the precise computations involved, and the appropriate tests to distinguish between the various possibilities. It is important that the effect of the relevant design choices and parameter values are understood if psychological models using these methods are to be reliably evaluated and compared. In this article, we present a systematic exploration of the principal computational possibilities for formulating and validating representations of word meanings from word co-occurrence statistics. We find that, once we have identified the best procedures, a very simple approach is surprisingly successful and robust over a range of psychologically relevant evaluation measures.
Many recent studies have demonstrated the influence of sublexical frequency measures on language processing, or called for controlling sublexical measures when selecting stimulus material for psycholinguistic studies (Aichert {\&} Ziegler, 2005). The present study discusses which measures should be controlled for in what kind of study, and presents orthographic and phonological syllable, dual unit (bigram and biphoneme) and single unit (letter and phoneme) type and token frequency measures derived from the lemma and word form corpora of the CELEX lexical database (Baayen, Piepenbrock, {\&} Gulikers, 1995). Additionally, we present the SUBLEX software as an adaptive tool for calculating sublexical frequency measures and discuss possible future applications. The measures and the software can be downloaded at www.psychonomic.org.
Much electronic text in the languages of South Asia has been published on the Internet. However, while Unicode has emerged as the favoured encoding system of corpus and computational linguists, most South Asian language data on the web uses one of a wide range of non-standard legacy encodings. This paper describes the difficulties inherent in converting text in these encodings to Unicode. Among the various legacy encodings for South Asian scripts, the most problematic are 8-bit fonts based on graphical principles (as opposed to the logical principles of Unicode). Graphical fonts typically encode several features in ways highly incompatible with Unicode. For instance, half-form glyphs used to construct conjunct consonants are typically separate code points in 8-bit fonts; in Unicode they are represented by the full consonant followed by virama. There are many more such cases. The solution described here is an approach to text conversion based on mapping rules. A small number of generalised rules (plus the capacity for more specialised rules) captures the behaviour of each character in a font, building up a conversion algorithm for that encoding. This system is embedded in a font-mapping program, outputting CES-compliant SGML Unicode. This program, a generalised text-conversion tool, has been employed extensively in corpus-building for South Asian languages.
We describe the Leipzig Corpora collection (LCC), a freely available resource for corpora and corpus statistics covering more than 20 languages at the time being. Unified format and easy accessibility encourage incorporation of the data into many projects and render the collection a useful resource especially in multilingual settings and for small languages. The preparation of monolingual corpora of standard sizes from different sources (web, newspaper, Wikipedia) is described in detail.
Two experiments attempted to resolve previous contradictory findings concerning developmental trends in false memories within the Deese-Roediger-McDermott (DRM) paradigm by using an improved methodology--constructing age-appropriate associative lists. The research also extended the DRM paradigm to preschoolers. Experiment 1 (N=320) included children in three age groups (preschoolers of 3-4 years, second-graders of 7-8 years, and preadolescents of 11-12 years) and adults, and Experiment 2 (N=64) examined preschoolers and preadolescents. Age-appropriate lists increased false recall. Although preschoolers had fewer false memories than the other age groups, they showed considerable levels of false recall when tested with age-appropriate materials. Results were discussed in terms of fuzzy-trace, source-monitoring, and activation frameworks.
The memory block effect (MBE) occurs when orthographically similar words inhibitretrieval. Previous studies have published 55 different stimuli that produce the MBE in word fragment completion. This small number of stimuli constrains experimental designs, presents serious obstacles for using neuroimaging to elucidate neural substrates of blocking, and raises concern that the MBE is limited to a particular group of words. A pool of 315 stimulus words was tested in a traditional MBE paradigm, and the results demonstrated that the MBE generalizes to other stimuli. This study also expands the number of stimuli that produce the MBE because 185 new stimuli produced blocking effects. As a result, the current list of 240 MBE stimuli can be used for word fragment research including cognitive neuroscience investigations of retrieval inhibition. A table of MBE stimuli is available in an archived appendix that can be downloaded from www.psychonomic.org/archive.
The Affective Norms for English Words (ANEW) are a commonly used set of 1,034 words characterized on the affective dimensions of valence, arousal, and dominance. Traditionally, studies of affect have used stimuli characterized along either affective dimensions or discrete emotional categories, but much current research draws on both of these perspectives. As such, stimuli that have been thoroughly characterized according to both of these approaches are exceptionally useful. In an effort to provide researchers with such a characterization of stimuli, we have collected descriptive data on the ANEW to identify which discrete emotions are elicited by each word in the set. Our data, coupled with previous characterizations of the dimensional aspects of these words, will allow researchers to control for or manipulate stimulus properties in accordance with both dimensional and discrete emotional views, and provide an avenue for further integration of these two perspectives. Our data have been archived at www.psychonomic.org/archive/.
Age of acquisition of a word and familiarity are important factors for the lexical processing, in production as in perception. To help developing research on the mechanisms underlying the lexical processing in French, a lexical data base was built for a corpus of 1225 monosyllabic and disyllabic French words. This article describes the method used to collect the data, the rough information obtained with the survey, explains the method that was used to process the rough information, describes the content of the lexical data base CHACQFAM, obtained after the rough data has been processed, and describes the validation procedure of its content. CHACQFAM is made freely available to researchers in an electronic format, from the website 'http://psycholinguistique.unige.ch/'. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
Written word frequency (e.g., Francis {\&} Kucera, 1982; Kucera {\&} Francis, 1967) constitutes a popular measure of word familiarity, which is highly predictive of word recognition. Far less often, researchers employ spoken frequency counts in their studies. This discrepancy can be attributed most readily to the conspicuous absence of a sizeable spoken frequency count for American English. The present article reports the construction of a 1.6-million-word spoken frequency database derived from the Michigan Corpus of Academic Spoken English (Simpson, Swales, {\&} Briggs, 2002). We generated spoken frequency counts for 34,922 words and extracted speaker attributes from the source material to generate relative frequencies of words spoken by each speaker category. We assess the predictive validity of these counts, and discuss some possible applications outside of word recognition studies.
The York–Toronto–Helsinki Parsed Corpus of Old English Prose (YCOE) is a 1.5 million-word syntactically annotated corpus of Old English prose texts. It was produced at the University of York, UK, between 2000 and 2003, by Ann Taylor, Anthony Warner, Susan Pintzuk and Frank Beths, with a grant from the English Arts and Humanities Research Board (B/RG/AN5907/APN9528). The YCOE is part of the English Parsed Corpora Series. It was the third historical corpus to be completed in this format, and uses the same kind of annotation scheme as its sister corpora, the Penn–Helsinki Parsed Corpus of Middle English II (PPCME2), the York–Helsinki Parsed Corpus of Old English Poetry and the Penn–Helsinki Parsed Corpus of Early Modern English. Two other corpora in the series, the parsed version of the Corpus of Early English Correspondence and the Penn Parsed Corpus of Modern British English, are currently under construction (at the University of York, UK, in cooperation with the University of Helsinki, Finland, and the University of Pennsylvania, USA, respectively).
A new Real-Time Subjective Emotionality Assessment (RTSEA) system was developed for this study. The system is composed of two parts: an emotionality input and evaluation parts. An experiment was conducted in order to investigate the effectiveness of the RTSEA system. The present study compared Galvanic Skin Response (GSR) with the RTSEA by presenting 28 subjects with pictures that aroused either positive or negative emotion. Following the experiment, a subjective assessment using a questionnaire was given to the same subjects. According to the correlation coefficients, changes of the RTSEA had strong correlations with the changes of the GSR. Also, the questionnaire results showed marked similarity to the average responses of the RTSEA. In conclusion, the most remarkable characteristic of the present system is that it not only assesses the average emotionality when stimuli are presented, but also shows the trend of change in emotionality over time.
In most experiments that involve between-subjects or between-items factorial designs, the items and/or the participants in the various experimental groups differ on one or more variables, but need to be matched on all other factors that can affect the outcome measure. Matching large groups of items or participants on multiple dimensions is a difficult and time-consuming task, yet failure to match conditions will lead to suboptimal experiments. We describe a computer program, "Match", that automates this process by selecting the best-matching items from larger sets of candidate items. In most cases, the program produces near-optimal solutions in amatter of minutes and selects matches that are typically superior to those obtained using hand matching or other semiautomated processes. We report the results of a case study in which Match was used to generate matched sets of experimental items (words varying in length and frequency) for a published study on language processing. The program was able to come up with better-matching item sets than those hand-selected by the authors of the original study, and in a fraction of the time originally taken up with stimulus matching.
Two-word familiarity sets were measured in different years (1995 and 2002) and places (Kanto and Kinki, in Japan) for a large number of Japanese words, to examine the reliability of familiarity ratings. The correlation between the word familiarities of the two sets was extremely high (r ? .958, N ? 10,515). It is suggested that familiarity rating, at least for ordinary words found in a dictionary, is very reliable and not greatly affected by differences in years and places.
This article presents the Spanish adaptation of the Affective Norms for English Words (ANEW; Bradley {\&} Lang, 1999). The norms are based on 720 participants' assessments of the translation into Spanish of the 1,034 words included in the ANEW. The evaluations were done in the dimensions of valence, arousal and dominance using the Self-Assessment Manikin (SAM). Apart from these dimensions, five objective (number of letters, number of syllables, grammatical class, frequency and number of orthographic neighbors) and three subjective (familiarity, concreteness and imageability) psycholinguistic indexes are included. The Spanish adaptation of ANEW can be downloaded at www.psychonomic.org.
Event-related potentials (ERP) were recorded in two experiments to examine the effects of concreteness and emotionality on visual word processing. Concrete and abstract words of negative, neutral or positive valence, as well as pseudowords were presented in a hemifield lexical decision task. Experiment 1 yielded early (P2) and late (N400, late positive component/LPC) emotional word effects. Concreteness affected the N400 and the LPC. In line with the extended dual coding model and with previous studies, the N400 effect represents greater semantic activation, whereas the LPC effect may result from mental imagery being activated by concrete words. Experiment 2 engaged participants in a go/no-go task pressing a button for pseudowords. Here, emotionality and concreteness modulated the N400 independently, but interacted in the LPC time window. Only concrete emotional words differed in the LPC response suggesting that concrete negative words such as "wound" or "bomb" differ from neutral and positive words as a function of mental imagery. {\textcopyright} 2007 Elsevier B.V. All rights reserved.
We present a set of translation norms for 670 English and 760 Spanish nouns, verbs and class ambiguous items that varied in their lexical properties in both languages, collected from 80 bilingual participants. Half of the words in each language received more than a single translation across participants. Cue word frequency and imageability were both negatively correlated with number of translations. Word class predicted number of translations: Nouns had fewer translations than did verbs, which had fewer translations than class-ambiguous items. The translation probability of specific responses was positively correlated with target word frequency and imageability, and with its form overlap with the cue word. Translation choice was modulated by L2 proficiency: Less proficient bilinguals tended to produce lower probability translations than more proficient bilinguals, but only in forward translation, from Ll to L2. These findings highlight the importance of translation ambiguity as a factor influencing bilingual representation and performance. The norms can also provide an important resource to assist researchers in the selection of experimental materials for studies of bilingual and monolingual language performance. These norms may be downloaded from www.psychonomic.org/archive.
The main objective of this study is to report rated age of acquisition (AoA) norms for 834 nouns in Portuguese (European). AoA ratings were collected on a 7 point scale, generally following Gilhooly and Logie (1980) procedure with an 8 extra point for "don't know the word" answers. Results were analyzed considering AoA ratings and their standard deviations and considering the relationship between AoA ratings and other psycholinguistic variables (imageability, familiarity, written word frequency, concreteness, number of syllables and number of words). AoA ratings and their standard deviations were significantly and positively correlated, with early acquired word ratings showing higher agreement. Correlation and multiple regression analyses confirmed the major contribution of imageability and familiarity to AoA ratings obtained in other languages. The full database of AoA ratings and other psycholinguistic variables may be downloaded from www.psychonomic.org/archive or www.fpce.ul.pt/pessoal/ulfpfred/aoa.htm.
We report normative data collected from Mainland Chinese speakers for 232 objects taken from Snodgrass and Vanderwart (1980). These data include adult ratings of concept familiarity, age of acquisition (AoA), printedword frequency, and word length (in syllables), as well as measures of rated visual complexity, image agreement, and name agreement. We then examined timed picture naming of these objects with native Chinese speakers in Beijing in two experiments using line drawings and colored pictures. In both experiments, the variables name agreement, rated concept familiarity, and AoA made significant independent contributions to naming latency in multiple regression analyses. We observed a correlation ofr=.85 between naming latency with line drawings and colored pictures and a reduced effect of image agreement on naming when colored pictures were presented. We discuss the implications of our findings for the study of lexical processing in Chinese. Normative data for 232 Chinese nouns may be downloaded from www.psychonomic.org/archive
The English Lexicon Project is a multiuniversity effort to provide a standardized behavioral and descriptive data set for 40,481 words and 40,481 nonwords. It is available via the Internet at elexicon.wustl.edu. Data from 816 participants across six universities were collected in a lexical decision task (approximately 3400 responses per participant), and data from 444 participants were collected in a speeded naming task (approximately 2500 responses per participant). The present paper describes the motivation for this project, the methods used to collect the data, and the search engine that affords access to the behavioral measures and descriptive lexical statistics for these stimuli.
The purpose of the present investigation was to replicate and extend the International Affective Picture System norms (Ito, Cacioppo, {\{}{\&}{\}} Lang, 1998; Lang, Bradley, {\{}{\&}{\}} Cuthbert, 1999). These norms were developed to provide researchers with photographic slides that varied in emotional evocation, especially arousal and valence. In addition to collecting rating data on the dimensions of arousal and valence, we collected data on the dimensions of consequentiality, meaningfulness, familiarity, distinctiveness, and memorability. Furthermore, we collected ratings on the primary emotions of happiness, surprise, sadness, anger, disgust, and fear. A total of 1,302 participants were tested in small groups. The participants in each group rated a subset of 18 slides on 14 dimensions. Ratings were obtained on 703 slides. The means and standard deviations for all of the ratings are provided. We found our valence ratings to be similar to the previous norms. In contrast, our participants were more likely to rate the slides as less arousing than in the previous norms. The mean ratings on the remaining 12 dimensions were all below the midpoint of the 9-point Likert scale. However, sufficient variability in ratings across the slides indicates that selecting slides on the basis of these variables is feasible. Overall, the present ratings should allow investigators to use these norms for research purposes, especially in research dealing with the interrelationships among emotion and cognition. The means and standard deviations for emotions may be downloaded as an Excel spreadsheet from www.psychonomic.org/archive.
Age of acquisition is one of the most important variables in picture naming. For this reason, a large number of findings concerning age-of-acquisition data have been published in recent years in a number of different languages. In this article, objective age-of-acquisition data in Spanish for 328 pictures were collected from a pool of 760 children, half of whom were boys and the other half girls. A total of 246 pictures were selected from the Snodgrass and Vanderwart (1980) set, and 82 were new pictures. Like the results of other studies, we found that objective age of acquisition correlates less than rated age of acquisition with familiarity and frequency, which indicates that the objective measure is less contaminated by other variables than are rated estimates. A very high correlation was obtained between the norms from this study and those published in English, French, Icelandic, and Italian. These norms will be very useful to Spanish psycholinguists and clinicians. Related materials may be downloaded from the Psychonomic Society Web archive at www.psychonomic.org/archive/.
Sound events are sequences of closely grouped and temporally related environmental sounds that tell a story or establish a sense of place. The goal of our project was to create a set of sound events depicting various scenarios (such as a car accident, cooking breakfast, and walking outdoors) and to gather normative data about how people understand them. Samples of college students listened to 22 sound events over headphones in three self-paced, computer-based studies. In the Identification Task, 43 participants used text boxes to type descriptions of what was happening in the sound events. In the Rating Task, 39 participants used Likert scales to rate the sound events on the attributes of familiarity, complexity, and pleasantness. In the Memory Task, 42 participants answered two multiple-choice questions immediately after listening to each sound event. Detailed tables are provided for the following: (1) Description of the sound events and their components; (2) accuracy and response time measurements for each of the 22 sound events across the three studies; and (3) rank-orderings of the sound events by ease of identification, recognition of details, and rated familiarity, complexity, and pleasantness. Digital files of the stimuli, which may be of interest to auditory cognition researchers and clinical neuropsychologists, may be downloaded from either www.psychonomic.org/archive or www.cofc.edu/-marcellm/sound event studies/sndevent.htm.
There is a longstanding tradition in psychological research for norming lists of words that are used in experimental studies. The present study extends this practice to graphic imagery by obtaining norming data on 24 simple abstract graphic shapes composed of three straight-line segments. The attributes obtained in the norming procedure were the shapes' familiarity, describability, associability, availability, and potential for word association. Results from rating data indicate significantly different, yet reliable, responses by participants to the various shape configurations. Multidimensional scaling analysis of shape ratings identified two underlying dimensions of perceived differences: the continuity of a shape's linear direction and the consistency or regularity of its interior angles. By contrast, performance in generating word associations for figures appeared to be linguistically driven, with initial responses related to the similarity of shapes to letters of the alphabet. The norms and the computer program used to collect them can be downloaded from www.psychonomic.org/archive. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
For more than half a century, emotion researchers have attempted to establish the dimensional space that most economically accounts for similarities and differences in emotional experience. Today, many researchers focus exclusively on two-dimensional models involving valence and arousal. Adopting a theoretically based approach, we show for three languages that four dimensions are needed to satisfactorily represent similarities and differences in the meaning of emotion words. In order of importance, these dimensions are evaluation-pleasantness, potency-control, activation-arousal, and unpredictability. They were identified on the basis of the applicability of 144 features representing the six components of emotions: (a) appraisals of events, (b) psychophysiological changes, (c) motor expressions, (d) action tendencies, (e) subjective experiences, and (f) emotion regulation.
We briefly discuss the origin and development of WordNet, a large lexical database for English. We outline its design and contents as well as its usefulness for Natural Language Processing. Finally, we discuss crosslinguistic WordNets and complementary lexical resources. [ABSTRACT FROM AUTHOR], Copyright of Language Resources & Evaluation is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
In the typical memory conjunction experiment, participants are presented with two "parent" stimulus items (e.g., blackmail and jailbird) that are later recombined to form a "conjunction lure" (e.g., blackbird). This paradigm is an efficient way to test false memories because participants frequently show false recognition for the recombined features of the previously studied stimuli. Two experiments are reported in which normative data for 96 memory conjunction triplets are presented. The first experiment provides descriptive statistics for how often the conjunction triplets show true and false recognition. Due to the variance in the rates of false recognition for the conjunction lure, the second experiment was conducted to help build an understanding of the factors that affect the rate of false recognition of the conjunction lures. Conceptual overlap of the first parent word and the conjunction item predicted false recognition. Digital files containing norms for 96 memory conjunction triplets may be downloaded from www.psychonomic.org/archive.
TimeBank is the only reference corpus for TimeML, an expressive language for annotating complex temporal information. It is a rich resource for a broad range of research into various aspects of the expression of time and temporally related events. This paper traces the development of TimeBank from its initial—and somewhat noisy—version (1.1) to a substantially revised release (1.2), now available via the Linguistic Data Consortium. The development path is motivated by the encouraging empirical results of TimeML-compliant annotators developed on the basis of TimeBank 1.1, and is informed by a detailed study of the characteristics of that initial release, which guides a clean-up process turning TimeBank 1.2 into a consistent and robust community resource.
We investigated the linguistic features of temporal cohesion that distinguish variations in temporal coherence. In an analysis of 150 texts, experts rated temporal coherence on three continuous scale measures designed to capture unique representations of time. Coh-Metrix, a computational tool that assesses textual cohesion, correctly predicted the human ratings with five features of temporal cohesion. The correlations between predicted and actual scores were all statistically significant. In a complementary study, we explored the importance of temporal cohesion in characterizing genre. A discriminant function analysis, using Coh-Metrix temporal indices, successfully distinguished the genres of science, history, and narrative texts. The results suggested that history texts are more similar to narrative texts than to science texts in terms of temporal cohesion.
The AMI Meeting Corpus contains 100 h of meetings captured using many synchronized recording devices, and is designed to support work in speech and video processing, language engineering, corpus linguistics, and organizational psychology. It has been transcribed orthographically, with annotated subsets for everything from named entities, dialogue acts, and summaries to simple gaze and head movement. In this written version of an LREC conference keynote address, I describe the data and how it was created. If this is “killer” data, that presupposes a platform that it will “sell”; in this case, that is the NITE XML Toolkit, which allows a distributed set of users to create, store, browse, and search annotations for the same base data that are both time-aligned against signal and related to each other structurally.
It is well known that the statistical characteristics of a language, such as word frequency or the consistency of the relationships between orthography and phonology, influence literacy acquisition. Accordingly, linguistic databases play a central role by compiling quantitative and objective estimates about the principal variables that affect reading and writing acquisition. We describe a new set of Web-accessible databases of French orthography whose main characteristic is that they are based on frequency analyses of words occurring in reading books used in the elementary school grades. Quantitative estimates were made for several infralexical variables (syllable, grapheme-to-phoneme mappings, bigrams) and lexical variables (lexical neighborhood, homophony and homography). These analyses should permit quantitative descriptions of the written language in beginning readers, the manipulation and control of variables based on objective data in empirical studies, and the development of instructional methods in keeping with the distributional characteristics of the orthography.
Only very recently have Vietnamese researchers begun to be involved in the domain of Natural Language Processing (NLP). As there does not exist any published work in formal linguistics nor any recognizable standard for Vietnamese word definition and word categories, the fundamental tasks for automatic Vietnamese language processing, such as part-of-speech tagging, parsing, etc., are very difficult tasks for computer scientists. The fact that all necessary linguistic resources have to be built from scratch by each research team is a real obstacle to the development of Vietnamese language processing. The aim of our projects is thus to build a common linguistic database that is freely and easily exploitable for the automatic processing of Vietnamese. In this paper, we present our work on creating a Vietnamese lexicon for NLP applications. We emphasize the standardization aspect of the lexicon representation. We especially propose an extensible set of Vietnamese syntactic descriptions that can be used for tagset definition and morphosyntactic analysis. These descriptors are established in such a way as to be a reference set proposal for Vietnamese in the context of ISO subcommittee TC 37/SC 4 (Language Resource Management).
The complexity of Chinese orthography has hindered the progress of research in Chinese to the same level of sophistication of that in alphabetic languages such as English. Also, there has been no publicly available resource concerning the decomposition of Chinese characters, which is essential in any attempt to model the cognitive processes of Chinese character recognition. Here we report our construction and analysis of a Chinese lexical database containing the most frequent phonetic compounds decomposed into semantic and phonetic radicals according to Chinese etymology. Each radical was further decomposed into basic stroke patterns according to a Chinese transcription system, Cangjie (Chu, 1979 Laboratory of chu Bong-Foo Retrieved August 25, 2004, from http://www.cbflabs.com/). Other information such as pronunciation and character frequency were also incorporated. We examine the distribution of different types of character, the information skew in phonetic compounds, the relations between subcharacter orthographic units and the pronunciation of the entire character, and the processing implications of these phenomena in terms of universal psycholinguistic principles. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
Many studies have addressed the issue of whether age of acquisition and/ or frequency affect particular lexical tasks. Methods typically employed in such studies are based on the general linear model (e.g., ANOVA or multiple regression). These methods assume manipulated independent variables whereas the usual approach of investigating age-of-acquisition and frequency effects uses estimated norms of word properties. This failure to truly manipulate variables violate the assumptions of the analyses. A simulation is provided that demonstrates how this violation can lead to erroneous conclusions of effects when none are present. Recommendations are made for a more correlational approach to analysis using structural equation modelling techniques. It is also discussed how this use of estimates of lexical data is problematic for determining effects throughout psycholinguistic research. {\textcopyright} 2006 Psychology Press Ltd.
A simple and flexible schema for storing and presenting monolingual language resources is proposed. In this format, data for 18 different languages is already available in various sizes. The data is provided free of charge for online use and download. The main target is to ease the application of algorithms for monolingual and interlingual studies.
Word lists are most commonly used in the investigation of human memory. To prevent transfer effects, repeated measures of memory for words require multiple lists of different words. Yet, the psycholinguistic properties of all word lists employed should match as closely as possible to avoid confounding with the independent variable(s) in question. Although comprehensive databases for word norms exist, to our knowledge no tool is available that automates the creation of such equivalent word lists. Instead, matching different lists is often accomplished prima facie. We have therefore developed a Windows program called EQUIWORD that completely automates the creation of word lists that are truly parallel with respect to a wide range of attributes. EQUIWORD takes psycholinguistic databases of different formats as input and computes several coefficients of distance for every possible word pairing. Program output consists of a list of all word pairs sorted according to their distance. On that basis, creating equivalent word lists is simply done by selecting the pairs with the lowest distance coefficients.
The present study provides Canadian French normative data for 388 line drawings from the European Picture Pool for Oral Naming (Protocole europ{\'{e}}en de d{\'{e}}nomination orale d'images; PEDOI; Kremin et al., 2003). One hundred eighty subjects were equally distributed for age group (18-39,40-59, 60-85), educational level (low, high), and sex. They rated pictures of objects on age of acquisition, name agreement, familiarity, and visual complexity. Syllable length and word frequency were also taken into account. The present study suggests that age of acquisition and name agreement show significant age-related differences. These results show that unequivocal interpretation of age-related differences can be made when age-appropriate norms are used.
Four hundred forty-eight children 3-12 years of age generated category exemplars for 33 distinct categories. The percentage of the participants reporting each exemplar, the percentage of the participants reporting each exemplar first, the percentage of the participants reporting each exemplar across age groups (3-5 years, 6-8 years, and 9-12 years), and the mean rank of each exemplar are presented. A full version of the 29 category norms may be downloaded from www.psychonomic.org/archive.
Corpora are an important resource for both teaching and research. Arabic lacks sufficient resources in this field, so a research project has been designed to compile a corpus, which represents the state of the Arabic language at the present time and the needs of end-users. This report presents the result of a survey of the needs of teachers of Arabic as a foreign language (TAFL) and language engineers. The survey shows that a wide range of text types should be included in the corpus. Overall, our survey confirms our view that existing corpora are too narrowly limited in source-type and genre, and that there is a need for a freely-accessible corpus of contemporary Arabic covering a broad range of text-types. We have collected and published an initial version of the Corpus of Contemporary Arabic (CCA) to meet these design issues. The CCA is freely downloadable via WWW from http://www.comp.leeds.ac.uk/arabic.
We introduce the Berlin Affective Word List (BAWL) in order to provide researchers with a German database containing both emotional valence and imageability ratings for more than 2,200 German words. The BAWL was cross-validated using a forced choice valence decision task in which two distinct valence categories (negative or positive) had to be assigned to a highly controlled selection of 360 words according to varying emotional content (negative, neutral, or positive). The reaction time (RT) results corroborated the valence categories: Words that had been rated as "neutral" in the norms yielded maximum RTs. The BAWL is intended to help researchers create stimulus materials for a wide range of experiments dealing with the emotional processing of words.
In the present study, we tested a computer-based procedure for assessing very concise summaries (50 words long) of two types of text (narrative and expository) using latent semantic analysis (LSA) in comparison with the judgments of four human experts. LSA was used to estimate semantic similarity using six different methods: four holistic (summary-text, summary-summaries, summary-expert summaries, and pregraded-ungraded summary) and two componential (summary-sentence text and summary-main sentence text). A total of 390 Spanish middle and high school students (14-16 years old) and six experts read a narrative or expository text and later summarized it. The results support the viability of developing a computerized assessment tool using human judgments and LSA, although the correlation between human judgments and LSA was higher in the narrative text than in the expository, and LSA correlated more with human content ratings thanwith hu mancoherence ratings. Finally, theholistic methods were found to be more reliable than the componential methods analyzed in this study.
This paper describes a multi-modal corpus of hand-annotated meeting dialogues that was designed for studying addressing behaviour in face-to-face conversations. The corpus contains annotated dialogue acts, addressees, adjacency pairs and gaze direction. First, we describe the corpus design where we present the meetings collection, annotation scheme and annotation tools. Then, we present the analysis of the reproducibility and stability of the annotation scheme.
In this paper, we propose a segment-based annotation tool providing appropriate interactivity between a human annotator and an automatic parser. The proposed annotation tool provides the preview of a complete sentence structure suggested by the parser, and updates the preview whenever the annotator cancels or selects each segmentation point. Thus, the annotator can select the proper sentence segments maximizing parsing accuracy and minimizing human intervention. Experimental results show that the proposed tool allows the annotator to be able to reduce human intervention by approximately 39% compared with manual annotation. Sejong Korean treebank, one of the large scale treebanks, was constructed with the proposed annotation tool.
Two studies were conducted in which human participants rated pairs of words according to the perceived degree to which the words' referents shared semantic features. The participants found the task intuitive, simple, and quick to complete. The ratings were reliable and valid. Interrater and interstudy correlations were high, and ratings were good predictors of known feature overlap values obtained from existing semantic feature norms. (PsycINFO Database Record (c) 2006 APA ) (journal abstract)
Anagram tasks are frequently used in cognitive research, and the generation of new scrambled letter combinations is a task well suited to a software solution. Most available programs, however, do not allow experimenters to generate new anagrams flexibly or to characterize existing anagrams using psycholinguistic criteria. They also do not provide detailed information on their source dictionaries. We present anagram software that interfaces with CELEX2, an internationallyrecognized psycholinguistic database. This software allows users to capitalize on lexical variables and thus enables direct control of psycholinguistic features that may influence the cognitive processes involved in anagram solution.
We describe a Windows program that enables users to obtain a broad range of statistics concerning the properties of word and nonword stimuli in an agglutinative language (Basque), including measures of word frequency (at the whole-word and lemma levels), bigram and biphone frequency, orthographic similarity, orthographic and phonological structure, and syllable-based measures. It is designed for use by researchers in psycholinguistics, particularly those concerned with recognition of isolated words and morphology. In addition to providing standard orthographic and phonological neighborhood measures, the program can be used to obtain information about other forms of orthographic similarity, such as transposed-letter similarity and embedded-word similarity. It is available free of charge from www .uv.es/mperea/E-Hitz.zip.
In a 12-month project we have developed a new, register-diverse, 55-million-word bilingual corpus—the New Corpus for Ireland (NCI)—to support the creation of a new English-to-Irish dictionary. The paper describes the strategies we employed, and the solutions to problems encountered. We believe we have a good model for corpus creation for lexicography, and others may find it useful as a blueprint. The corpus has two parts, one Irish, the other Hiberno-English (English as spoken in Ireland). We describe its design, collection and encoding.
Matching stimuli across a range of influencing variables is no less important for studies of face recognition than it is for those of word processing. Whereas a number of corpora exist to allow experimenters to select a carefully controlled set of word stimuli, similar databases for famous faces do not exist. This article, therefore, provides researchers in the area of face recognition with a useful resource on which to base their stimulus selection. In the first phase of the investigation, British adults over 40 years of age were requested to generate the names of famous people (or celebrities) that they thought they would recognize and to write these down. The most frequently named celebrities were then rated by adults from the same age population for familiarity, distinctiveness, and age of acquisition. The result is a database of 696 famous people, with an indication of their relative eminence in the public consciousness and rated for these important variables. Phoneme counts are also provided for each famous person, together with family name frequency counts in the general population, where available. Materials and links may be accessed at www.psychonomic.org/archive.
We describe a Chinese lexical semantic resource that consists of 11,765 predicates (mostly verbs and their nominalizations) analyzed with coarse-grained senses and semantic roles. We show that distinguishing senses at a coarse-grained level is a necessary part of specifying the semantic roles and describe our strategies for sense determination for purposes of predicate-argument structure specification. The semantic roles are postulated to account for syntactic variations, the different ways in which the semantic roles of a predicate are realized. The immediate purpose for this lexical semantic resource is to support the annotation of the Chinese PropBank, but we believe it can also serve as stepping stone for higher-level semantic generalizations.
Ratings of age of acquisition (AoA), imageability, and familiarity were collected for 1,526 words. The methodology made use of a modular approach, in which the full sample of words was divided into five separate blocks. Within each block, each word was rated on each of the three variables by 20 partici- pants (undergraduate students from the University of Bristol). Analyses comparing these ratings to existing norm databases demonstrated that this methodology resulted in high reliability (assessed by Cronbach's ) and validity. The ratings were also transformed to be compatible with the Gilhooly and Logie (1980) norms. This transformation resulted in a set of norms for 3,394 words, which is by far the largest database of ratings for AoA, imageability, and familiarity to date. The resulting database should be useful for researchers interested in manipulating or controlling these factors in word recognition, neuropsychological, or memory studies. These norms can be downloaded from language.psy.bris .ac.uk/bristol{\_}norms.html.
The HAL (hyperspace analog to language) model of lexical semantics uses global word co-occurrence from a large corpus of text to calculate the distance between words in co-occurrence space. We have implemented a system called HiDEx (High Dimensional Explorer) that extends HAL in two ways: It removes unwanted influence of orthographic frequency from the measures of distance, and it finds the $\backslash$nnumber of words within a certain distance of the word of interest (NCount, the number of neighbors). These two changes to the HAL model produce measures of word neighborhood density that are reliably predictive of human lexical decision reaction times.
Syllogistic reasoning, in which people identify conclusions from quantified premise pairs, remains a benchmark task whose patterns of data must be accounted for by general theories of deductive reasoning. However, psychologists have confined themselves to administering only the 64 premise pairs historically identified by Aristotle. By utilizing all combinations of negations, the present article identifies an expanded set of 576 premise pairs and gives the valid conclusions that they support. Many of these have interesting properties, and the identification of predictions and their verification will be an important next step for all proponents of such theories.
CoLFIS is a lexical database of written Italian, with the following features: it is based on a balanced corpus of over 3 millions words, reflecting the reading habits of the Italian population as inferred by ISTAT data; the lexical data are fully lemmatized and part-of-speech annotated; it provides a frequency lexicon/dictionary for both lemmas (“lemmario”) and forms (“formario”).
Este estudo apresenta dados normativos de imag{\'{e}}tica (imagery) e concreteza (concreteness) para controlo e manipula{\c{c}}{\~{a}}o de substantivos comuns em Portugal. Medidas de imag{\'{e}}tica e concreteza foram recolhidas e s{\~{a}}o apresentadas para um total de 250 substantivos comuns.
Objective: To develop the native Chinese Affective Picture System (CAPS) for future research on emotion.!Methods: 852 pictures were screened out to make up of CAPS. 46 Chinese university students were collected to rate the valence, arousal and dominance by self-report in a 9-point rating scale for CAPS.!Results: The standard deviations of scores on valence and dominance were greater than that on arousal. Scatter plot showed that the score distribution on the dimension of valence and arousal was wide in CAPS.! Conclusion: Though IAPS (International Affective Picture System) is highly internationally-accessible, there are still significant differences between the two sources. The native Chinese Affective Picture System is necessary.
The purpose of this paper is to provide guidelines for building a word alignment evaluation scheme. The notion of word alignment quality depends on the application: here we review standard scoring metrics for full text alignment and give explanations on how to use them better. We discuss strategies to build a reference corpus, and show that the ratio between ambiguous and unambiguous links in the reference has a great impact on scores measured with these metrics. In particular, automatically computed alignments with higher precision or higher recall can be favoured depending on the value of this ratio. Finally, we suggest a strategy to build a reference corpus particularly adapted to applications where recall plays a significant role, like in machine translation. The manually aligned corpus we built for the Spanish-English European Parliament corpus is also described. This corpus is freely available.
The International Affective Picture System (IAPS) is widely used in studies of emotion and has been $\backslash$ncharacterized primarily along the dimensions of valence, arousal, and dominance. Even though $\backslash$nresearch has shown that the IAPS is useful in the study of discrete emotions, the categorical structure $\backslash$nof the IAPS has not been characterized thoroughly. The purpose of the present project was to collect $\backslash$ndescriptive emotional category data on subsets of the IAPS in an effort to identify images that elicit $\backslash$none discrete emotion more than others. These data reveal multiple emotional categories for the images $\backslash$nand indicate that this image set has great potential in the investigation of discrete emotions. This $\backslash$narticle makes these data available to researchers with such interests.
This study presents a database of 500 words from five semantic categories: animals, body parts, furniture, clothing, and intelligence. Each category contains 100 words, and data on lexical availability, age of acquisition, imageability, typicality, concept familiarity, written word frequency, and word length in number of syllables are provided with each word. The full set of norms may be downloaded from www.psychonomic.org/archive.
Semantic features have provided insight into numerous behavioral phenomena concerning concepts, categorization, and semantic memory in adults, children, and neuropsychological populations. Numerous theories and models in these areas are based on representations and computations involving semantic features. Consequently, empirically derived semantic feature production norms have played, and continue to play, a highly useful role in these domains. This article describes a set of feature norms collected from approximately 725 participants for 541 living (dog) and nonliving (chair) basic-level concepts, the largest such set of norms developed to date. This article describes the norms and numerous statistics associated with them. Our aim is to make these norms available to facilitate other research, while obviating the need to repeat the labor-intensive methods involved in collecting and analyzing such norms. The full set of norms may be downloaded from www.psychonomic.org/archive.
This paper describes a corpus annotation project to study issues in the manual annotation of opinions, emotions, sentiments, speculations, evaluations and other private states in language. The resulting corpus annotation scheme is described, as well as examples of its use. In addition, the manual annotation process and the results of an inter-annotator agreement study on a 10,000-sentence corpus of articles drawn from the world press are presented.
This study provides Japanese normative measures for 359 line drawings, including 260 pictures (44 redrawn) taken from Snodgrass and Vanderwart (1980). The pictures have been standardized on voice key naming times, name agreement, age of acquisition, and familiarity. The data were compared with American, Spanish, French, and Icelandic samples reported in previous studies. In general, the correlations between variables in the present study and those in the other studies were relatively high, except for name agreement. Naming times were predicted in multiple regression analyses by name agreement. The full set of the norms and the new pictures may be downloaded from www.psychonomic.org/archive/.
Picture naming has become an important experimental paradigm in cognitive psychology. Young children are more variable than adults in their naming responses and less likely to know the object or its name. A consequence is that the interpretation of the two classical measures used by Snodgrass and Vanderwart (1980) for scoring name agreement in adults (the percentage of agreement, based on modal name, and the H statistic, based on alternative names) will differ because of the high rate of "don't know object" responses, common in young children, relative to the low rate of "don't know object" responses more characteristic of adults. The present study focused on this methodological issue in young French children (3-8 years old), using a set of 145 Snodgrass-Vanderwart pictures. Our results indicate that the percentage of agreement based on the expected name is a better measure of picture-naming performance than are the commonly used measures. The norms may be downloaded from www.psychonomic.org/archive.
Familiarity with a word can be divided into two main components: familiarity with the form of the word (due to both its lexicality and its specific form) and familiarity with its meaning. In this study, ratings of familiarity were compared for words whose meaning was unknown to participants (UM words), for words of known meaning (KM words), and for unknown words (U words). Linguistic and experiential frequencies were equivalent. Rated familiarity was lower for UM than KM words and even lower for U words. Next, we built pseudowords from these stimuli by changing one letter and submitted them to two familiarity rating tasks that differed in the nature of the additional stimuli: either only nonwords or nonwords plus words. It was assumed that familiarity ratings would be lower for pseudowords built from UM words than for pseudowords built from KM words. The data were consistent with this assumption, and ratings depended on the initial categories of stimuli. These results support the view that usual word familiarity has two components, familiarity with form and familiarity with meaning, and a double source, processing of word form and processing of word meaning. The full set of these materials and norms may be downloaded from www.psychonomic.org/archive.
We provide objective data concerning the age of acquisition (AoA) of words from 202 Italian children 34-69 months of age. We investigated picture naming with 80 concrete words belonging to eight semantic categories that are included in a widely used battery for the study of naming and semantic memory. For each word, we calculated three different indices: two directly expressing the age at which a picture was given the correct name by at least 75{\%} of the subjects, and one expressing the overall percentage of our children who were correct in the task. (For the latter index, we provide separate values for boys and girls.). The correlation between objective indices of AoA and adult estimates culled from the literature was not very high. Moreover, objective indices showed low correlations with frequency and familiarity, in contrast to adult ratings. We conclude that adult estimates of AoA present validity problems and should be used with caution. The full set of stimuli is available at www.psychonomic.org/archive.
This article describes a Windows program that enables users to obtain a broad range of statistics concerning the properties of word and nonword stimuli, including measures of word frequency, orthographic similarity, orthographic and phonological structure, age of acquisition, and imageability. It is designed for use by researchers in psycholinguistics, particularly those concerned with recognition of isolated words. The program computes measures of orthographic similarity on line, either with respect to a default vocabulary of 30,605 words or to a vocabulary specified by the user. In addition to providing standard orthographic neighborhood measures, the program can be used to obtain information about other forms of orthographic similarity, such as transposed-letter similarity and embedded-word similarity. It is available, free of charge, from the following Web site: http://www.maccs.mq. edu.au/colin/N-Watch/.
Attributes associated with concept representations, such as familiarity, typicality, and age of acquisition, have been shown to be important influences on lexical-semantic processing. In most previous studies of healthy and pathological aging, these attributes are not equated for younger and older adults separately on the stimuli used. In this study, normative data were collected to test whether there exist any age differences in these attributes. The results demonstrate that the ratings given by younger and older adults on natural and manmade category items correlated positively. However, age differences were also apparent, whereby older adults provided higher ratings overall than younger adults. Suggestions and hypotheses are presented to explain this pattern of age differences, which relate to how category concepts may be represented by healthy younger and older adults. Also, the possible implications for these differential age ratings on lexical-semantic processing are discussed. The age differences apparent in this study demonstrate the need to consider age-appropriate normative ratings in the selection of stimuli for use in lexical-semantic processing studies of aging, and the normative data presented provide a means of equating category stimuli. The complete list of all the means is available at www.psychonomic.org/archive.
A Web-based coding application was designed to improve coding efficiency and to provide a systematic means of evaluating responses to open-ended assessments. The system was developed for use by multiple raters to assign open-ended responses to predetermined categories. The application provides a software environment for efficiently supervising the work of coders and evaluating the quality of the coding by (1) systematically presenting open-ended responses to coders, (2) tracking each coder's categorized responses, and (3) assessing interrater consistency at any time in order to identify coders in need of further training. In addition, the application can be set to automatically assign repeated responses to categories previously identified as appropriate for those responses. To evaluate the efficacy of the coding application and to determine the statistical reliability of coding open-ended data within this application, we examined data from two empirical studies. The results demonstrated substantial interrater agreement on items assigned to various categories across free and controlled association tasks. Overall, this new coding application provides a feasible method of reliably coding open-ended data and makes the task of coding these data more manageable. [PUBLICATION ABSTRACT]
The present study provides norms for Spanish word lists that have been used to create false memories in native speakers of Spanish. The word lists reported are based on the Roediger and McDermott (1995) lists that have been used extensively to examine illusory memories. We employed Roediger and McDermott's critical lures, translated them into Spanish, and created semantically associated Spanish word lists by testing native Spanish speakers. The resulting lists were then normed with additional native Spanish speakers. Overall, the participants recalled 53{\%} of the list items and 32{\%} of the critical lures with the word lists developed. In addition, 74{\%} of the list items and 69{\%} of the critical lures were recognized by the participants. The present study adds to the literature by providing a set of Spanish lists that can be used by researchers interested in evaluating false memories in individuals who speak Spanish. These norms may be downloaded from www.psychonomic.org/archive.
This article describes a Windows program that enables users to obtain a broad range of statistics concerning the properties of word and nonword stimuli in Spanish, including word frequency, syllable frequency, bigram and biphone frequency, orthographic similarity, orthographic and phonological structure, concreteness, familiarity, imageability, valence, arousal, and age-of-acquisition measures. It is designed for use by researchers in psycholinguistics, particularly those concerned with recognition of isolated words. The program computes measures of orthographic similarity online, with respect to either a default vocabulary of 31,491 Spanish words or a vocabulary specified by the user. In addition to providing standard orthographic and phonological neighborhood measures, the program can be used to obtain information about other forms of orthographic similarity, such as transposed-letter similarity and embedded-word similarity. It is available, free of charge, from the following Web site: www.maccs.mq.edu.au/-colin/B-Pal.
The CFVlexvar.xls database includes imageability, frequency, and grammatical properties of the first words acquired by Italian children. For each of 519 words that are known by children 18-30 months of age (taken from Caselli {\&} Casadio's, 1995, Italian version of the MacArthur Communicative Development Inventory), new values of imageability are provided and values for age of acquisition, child written frequency, and adult written and spoken frequency are included. In this article, correlations among the variables are discussed and the words are grouped into grammatical categories. The results show that words acquired early have imageable referents, are frequently used in the texts read and written by elementary school children, and are frequent in adult written and spoken language. Nouns are acquired earlier and are more imageable than both verbs and adjectives. The composition in grammatical categories of the child's first vocabulary reflects the composition of adult vocabulary. The full set of these norms can be downloaded from www.psychonomic.org/archive/
In this article, normative data on the familiarity and difficulty of 196 single-solution Spanish word fragments are presented. The database includes the following indices: difficulty, familiarity, frequency, number of meanings, number of letters given in the fragment, first and/or last letters given, and ratio of letters to blanks. A factor analysis was performed on difficulty, and two factors were obtained. Frequency, familiarity, and number of meanings loaded highly on the first factor, which we consider to measure lexical processes, whereas number of letters in the fragment, first and/or last letters given, and ratio of letters to blanks loaded highly on the second factor, which we judge to be determined by perceptual information. Regression analyses using factor scores as predictors showed that both factors accounted for a significant part of the completion probability scores. The full set of these norms may be downloaded from the Psychonomic Society Web archive at
Fiction is not always accurate, and this has consequences for readers. In laboratory studies, the reading of short stories led participants to produce story errors as facts on a later test of general knowledge (Marsh, Meade, {\&} Roediger, 2003). The present article describes these story stimuli in detail, so that interested researchers will be able to use the stimuli and change them as needed for particular research projects. This article provides instructions for using the stories and suggestions for modifying them; it is a manual for one way of creating suggestibility. The full set of stories and reading comprehension questions may be downloaded fromwww.psychonomic.org/archive/.
We report the results of a large-scale picture naming experiment in which we evaluated the potential contribution of nine theoretically relevant factors to naming latencies. The experiment included a large number of items and a large sample of participants. In order to make this experiment as similar as possible to classic picture naming experiments, participants were familiarized with the materials during a training session. Speeded naming latencies were determined by a software key on the basis of the digital recording of the responses. The effects of various variables on these latencies were assessed with multiple regression techniques, using a repeated measures design. The interpretation of the observed effects is discussed in relation to previous studies and current views on lexical access during speech production.
- SP{\'{I}}{\v{S}} FONOLOGIE, MOORY ATD$\backslash$r$\backslash$nOn the basis of the lexical corpus created by Amano and Kondo (2000), using the Asahi newspaper, the present study provides frequencies of occurrence for units of Japanese phonemes, morae, and syllables. Among the five vowels, /a/ (23.42{\%}), /i/ (21.54{\%}), /u/ (23.47{\%}), and /o/ (20.63{\%}) showed similar frequency rates, whereas /e/ (10.94{\%}) was less frequent. Among the 12 consonants, /k/ (17.24{\%}), /t/ (15.53{\%}), and /r/ (13.11{\%}) were used often, whereas /p/ (0.60{\%}) and /b/ (2.43{\%}) appeared far less frequently. Among the contracted sounds, /sj/ (36.44{\%}) showed the highest frequency, whereas /mj/ (0.27{\%}) rarely appeared. Among the five long vowels, /aR/ (34.4{\%}) was used most frequently, whereas /uR/ (12.11{\%}) was not used so often. The special sound /N/ appeared very frequently in Japanese. The syllable combination /k/+V+/N/ (19.91{\%}) appeared most frequently among syllabic combinations with the nasal /N/. The geminate (or voiceless obstruent) /Q/, when placed before the four consonants /p/, /t/, /k/, and /s/, appeared 98.87{\%} of the time, but the remaining 1.13{\%} did not follow the definition. The special sounds /R/, /N/, and /Q/ seem to appear very frequently in Japanese, suggesting that they are not special in terms of frequency counts. The present study further calculated frequencies for the 33 newly and officially listed morae/syllables, which are used particularly for describing alphabetic loanwords. In addition, the top 20 bi-mora frequency combinations are reported. Files of frequency indexes may be downloaded from the Psychonomic Society Web archive at http://www.psychonomic.org/archive/.
Researchers often require subjects to make judgments that call upon their knowledge of the orthographic structure of English words. Such knowledge is relevant in experiments on, for example, reading, lexical decision, and anagram solution. One common measure of orthographic structure is the sum of the frequencies of consecutive bigrams in the word. Traditionally, researchers have relied on token-based norms of bigram frequencies. These norms confound bigram frequency with word frequency because each instance (i.e., token) of a particular word in a corpus of running text increments the frequencies of the bigrams that it contains. In this article, the authors report a set of type-based bigram frequencies in which each word (i.e., type) contributes only once, thereby unconfounding bigram frequency from word frequency. The authors show that type-based bigram frequency is a better predictor of the difficulty of anagram solution than is token-based frequency. These norms can be downloaded from www.psychonomic.org/archive/ .
Faces constitute a unique and widely used category of stimuli. In spite of their importance, there are few collections of faces for use in research, none of which adequately represent the different ages of faces across the lifespan. This lack of a range of ages has limited the majority of researchers to using predominantly young faces as stimuli even when their hypotheses concern both young and old participants. We describe a database of 575 individual faces ranging from ages 18 to 93. Our database was developed to be more representative of age groups across the lifespan, with a special emphasis on recruiting older adults. The resulting database has faces of 218 adults age 18-29, 76 adults age 30-49, 123 adults age 50-69, and 158 adults age 70 and older. These faces may be acquired for research purposes from http://agingmind.cns.uiuc.edu/facedb/. This will allow researchers interested in using facial stimuli access to a wider age range of adult faces than has previously been available.
We describe a set of pictorial and auditory stimuli that we have developed for use in word learning tasks in which the participant learns pairings of novel auditory sound patterns (names) with pictorial depictions of novel objects (referents). The pictorial referents are drawings of "space aliens," consisting of images that are variants of 144 different aliens. The auditory names are possible nonwords of English; the stimulus set consists of over 2,500 nonword stimuli recorded in a single voice, with controlled onsets, varying from one to seven syllables in length. The pictorial and nonword stimuli can also serve as independent stimulus sets for purposes other than word learning. The full set of these stimuli may be downloaded from www.psychonomic.org/archive/.
A data set is described that includes eight variables gathered for 13 common superordinate natural language categories and a representative set of 338 exemplars in Dutch. The category set contains 6 animal categories (reptiles, amphibians, mammals, birds, fish, and insects), 3 artifact categories (musical instruments, tools, and vehicles), 2 borderline artifact-natural-kind categories (vegetables and fruit), and 2 activity categories (sports and professions). In an exemplar and a feature generation task for the category nouns, frequency data were collected. For each of the 13 categories, a representative sample of 5-30 exemplars was selected. For all exemplars, feature generation frequencies, typicality ratings, pairwise similarity ratings, age-of-acquisition ratings, word frequencies, and word associations were gathered. Reliability estimates and some additional measures are presented. The full set of these norms is available in Excel format at the Psychonomic Society Web archive, www.psychonomic.org/archive/.
The aim of the present study was to provide French normative data for 112 action line drawings. The set of action pictures consisted of 71 drawings taken from Masterson and Druks (1998) and 41 additional drawings. It was standardized on six psycholinguistic variables--that is, name agreement, image agreement, image variability, visual complexity, conceptual familiarity, and age of acquisition (AoA). Naming latencies to the action pictures were collected, and a regression analysis was performed on the naming latencies, with the standardized variables, as well as with word frequency and length, taken as predictors. A reliable influence of AoA, name agreement, and image agreement on the naming latencies was observed. The findings are consistent with previous published studies in other languages. The full set of these norms may be downloaded from www.psychonomic.org/archive/.
In this study we present some semantic-lexical norms concerning 'fruit' collected from normal subjects. Modelling semantic fluency needs norms for all the given exemplars: at least for Italian language, only a few of them are available. The category 'fruit' is composed of a limited number of exemplars, and some subcategories can be singled out. On a preliminary fluency task, 84 different fruits were produced, and a further normal sample provided ratings for familiarity, prototypicality and their age of acquisition. Moreover the semantic proximity between each pair of the 32 most frequent exemplars was collected and a cluster analysis has been carried out in order to yield an empirical partition of 'fruit' into different subgroups of exemplars. Besides the main models of fluency tasks, some possible advantages offered by these norms in the study of brain-damaged subjects are discussed on the basis of real data obtained from two normal subjects. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
WordGen is an easy-to-use program that uses the CELEX and Lexique lexical databases for word selection and nonword generation in Dutch, English, German, and French. Items can be generated in these four languages, specifying any combination of seven linguistic constraints: number of letters, neighborhood size, frequency, summated position-nonspecific bigram frequency, minimum position-nonspecific bigram f requency, position-specific frequency of the initial and final bigram, and orthographic relatedness. The program also has a module to calculate the respective values of these variables for items that have already been constructed, either with the program or taken from earlier studies. Stimulus queries can be entered through WordGen's graphical user interface or by means of batch files. WordGen is especially useful for (1) Dutch and German item generation, because no such stimulus-selection tool exists for these languages, (2) the generation of nonwords for all four languages, because our program has some important advantages over previous nonword generation approaches, and (3) psycholinguistic experiments on bilingualism, because the possibility of using the same tool for different languages increases the cross-linguistic comparability of the generated item lists. WordGen is free and available at http://expsy.ugent.be/wordgen.htm.
A list of gender-related and gender-neutral words for use in testing gender stereotyping and memory was created and evaluated. Words were rated by samples of undergraduates at universities located in the northeast, southeast, and south-central United States. A substantial list of masculine, feminine, and gender-neutral words was identified. These lists allow researchers to construct large lists of gender-associated words while being able to control for extraneous variables, such as word frequency and word length. In addition, the high reliability across the samples suggests that gender ratings are a fairly stable phenomenon. Applications for this list are discussed. The word lists presented in Tables 1-3 and the raw data analyzed in this article may be downloaded from www.psychonomic.org/archive/.
In this article, we present a new lexical database for French: Lexique. In addition to classical word information such as gender, number, and grammatical category, Lexique includes a series of interesting new characteristics. First, word frequencies are based on two cues: a contemporary corpus of texts and the number of Web pages containing the word. Second, the database is split into a graphemic table with all the relevant frequencies, a table structured around lemmas (particularly interesting for the study of the inflectional family), and a table about surface frequency cues. Third, Lexique is distributed under a GNU-like license, allowing people to contribute to it. Finally, a metasearch engine, Open Lexique, has been developed so that new databases can be added very easily to the existing ones. Lexique can either be downloaded or interrogated freely from http://www.lexique.org.
This article reports, for the first time, type and token frequencies of tones, onsets, codas, rimes, and syllables of Hong Kong Cantonese. The information is derived from a computerized spoken corpus, the Hong Kong Cantonese adult language corpus (HKCAC; Leung {\&} Law, 2001), consisting of more than 140,000 character-syllable units. Since the HKCAC is based on recordings of connected speech, comparisons are made with respect to the inventories of various phonological units between the HKCAC and standard descriptions of the Cantonese phonological system--in particular, Fok (1974) and Bauer and Benedict (1997). It is hoped that the frequency information presented here will become a valuable tool for future psycholinguistic and linguistic research in this language. The full set of these frequency counts may be downloaded from the Psychonomic Society Web archive at www.psychonomic.org/ archive/.
Phonotactic probability refers to the frequency with which phonological segments and sequences of phonological segments occur in words in a given language. We describe one method of estimating phonotactic probabilities based on words in American English. These estimates of phonotactic probability have been used in a number of previous studies and are now being made available to other researchers via a Web-based interface. Instructions for using the interface, as well as details regarding how the measures were derived, are provided in the present article. The Phonotactic Probability Calculator can be accessed at http://www.people.ku.edu/{\~{}}mvitevit/PhonoProbHome.html.
The present study presents normative measures for 260 line drawings of everyday objects, found in Snodgrass and Vanderwart (1980), viewed by individuals in China and the United States. Within each cultural group, name agreement, concept agreement, and familiarity measures were obtained separately for younger adults and older adults. For a subset of 57 pictures (22{\%}), there was equivalence in both name agreement and concept agreement, and for an additional subset of 29 pictures (11{\%}), there was nonequivalent name agreement but equivalent concept agreement, across all culture-by-age groups. The data indicate substantial differences across culture-by-age groups in name agreement percentages and number of distinct name responses provided. We discovered significant differences between older and younger American adults in both name agreement percentages (67 pictures, or 26{\%}) and concept agreement percentages (44 pictures, or 17{\%}). Written naming responses collected for the entire set of Snodgrass and Vanderwart pictures showed shifts in both naming and concept agreement percentages over the intervening decades: Although correlations in name agreement were strong (r = .71, p {\textless} .001) between our younger American samples and those of Snodgrass and Vanderwart, name agreement percentages have changed for a substantial proportion (33{\%}) of the 260 pictures; moreover, 63{\%} of the stimuli for which Snodgrass and Vanderwart reported concept agreement now appear to differ. We provide comprehensive comparison statistics and tests for both the present study and prior ones, finding differences across numerous item-level measures. The corpus of data suggests that substantial differences in all measures can be found across age as well as culture, so that unequivocal conclusions with respect to cross-cultural or age-related differences in cognition can be made only when appropriate stimuli are selected for studies. Data for all 260 pictures, for each of the four groups, and all supporting materials and tests are freely archived at http://agingmind.cns.uiuc.edu/Pict{\_}Norms. The full set of these norms may be downloaded from www.psychonomic.org/archive/.
Preexisting word knowledge is accessed in many cognitive tasks, and this article offers a means for indexing this knowledge so that it can be manipulated or controlled. We offer free association data for 72,000 word pairs, along with over a million entries of related data, such as forward and backward strength, number of competing associates, and printed frequency. A separate file contains the 5,019 normed words, their statistics, and thousands of independently normed rhyme, stem, and fragment cues. Other files provide n x n associative networks for more than 4,000 words and a list of idiosyncratic responses for each normed word. The database will be useful for investigators interested in cuing, priming, recognition, network theory, linguistics, and implicit testing applications. They also will be useful for evaluating the predictive value of free association probabilities as compared with other measures, such as similarity ratings and co-occurrence norms. Of several procedures for measuring preexisting strength between two words, the best remains to be determined. The norms may be downloaded from www.psychonomic.org/archive/.
The Battig and Montague (1969) category norms have been an invaluable tool for researchers in many fields, with a recent literature search revealing their use in over 1600 projects published in more than 200 different journals. Since 1969, numerous changes have occurred culturally that warrant the collection of new normative data. For instance, in the mid-1960s, the waltz was a popular dance, and undergraduates wore rubbers on their feet. To meet the need for updated norms, we report an expanded version of the Battig and Montague (1969) norms, based on responses from three different sites varying in geographical locations within the United States. The norms were expanded to include new categories (e.g., ad hoc categories) and new measures, most notably latencies for the generated responses. Analyses demonstrated high levels of geographical stability across the new sites, with lower and more variable levels of generational stability between the Battig and Montague norms and the current norms.
We present a set of stimuli representing human actions under point-light conditions, as seen from different viewpoints. The set contains 22 fairly short, well-delineated, and visually “loopable” actions. For each action, we provide movie files from five different viewpoints as well as a text file with the three spatial coordinates of the point lights, allowing researchers to construct customized versions. The full set of stimuli may be downloaded fromwww.psychonomic.org/archive/.
This paper reports on the TIGER Treebank, a corpus of currently 40,000 syntactically annotated German newspaper sentences. We describe what kind of information is encoded in the treebank and introduce the different representation formats that are used for the annotation and exploitation of the treebank. We explain the different methods used for the annotation: interactive annotation, using the tool ANNOTATE, and LFG parsing. Furthermore, we give an account of the annotation scheme used for the TIGER treebank. This scheme is an extended and improved version of the NEGRA annotation scheme and we illustrate in detail the linguistic extensions that were made concerning the annotation in the TIGER project. The main differences are concerned with coordination, verb-subcategorization, expletives as well as proper nouns. In addition, the paper also presents the query tool TIGERSearch that was developed in the project to exploit the treebank in an adequate way. We describe the query language which was designed to facilitate a simple formulation of complex queries; furthermore, we shortly introduce TIGER in, a graphical user interface for query input. The paper concludes with a summary and some directions for future work.
Theories of object recognition differ to the extent that they consider object representations as being mediated only by the shape of the object, or shape and surface details, if surface details are part of the representation. In particular, it has been suggested that color information may be helpful at recognizing objects only in very special cases, but not during basic-level object recognition in good viewing conditions. In this study, we collected normative data (naming agreement, familiarity, complexity, and imagery judgments) for Snodgrass and Vanderwart's object database of 260 black-and-white line drawings, and then compared the data to exactly the same shapes but with added gray-level texture and surface details (set 2), and color (set 3). Naming latencies were also recorded. Whereas the addition of texture and shading without color only slightly improved naming agreement scores for the objects, the addition of color information unambiguously improved naming accuracy and speeded correct response times. As shown in previous studies, the advantage provided by color was larger for objects with a diagnostic color, and structurally similar shapes, such as fruits and vegetables, but was also observed for man-made objects with and without a single diagnostic color. These observations show that basic-level 'everyday' object recognition in normal conditions is facilitated by the presence of color information, and support a 'shape + surface' model of object recognition, for which color is an integral part of the object representation. In addition, the new stimuli (sets 2 and 3) and the corresponding normative data provide valuable materials for a wide range of experimental and clinical studies of object recognition.
A set of 142 photographs of actions (taken from Fiez {\&} Tranel, 1997) was standardized in French on name agreement, image agreement, conceptual familiarity, visual complexity, imageability, age of acquisition, and duration of the depicted actions. Objective word frequency measures were provided for the infinitive modal forms of the verbs and for the cumulative frequency of the verbal forms associated with the photographs. Statistics on the variables collected for action items were provided and compared with the statistics on the same variables collected for object items. The relationships between these variables were analyzed, and certain comparisons between the current database and other similar published databases of pictures of actions are reported. Spoken and written naming latencies were also collected for the photographs of actions, and multiple regression analyses revealed that name agreement, image agreement, and age of acquisition are the major determinants of action naming speed. Finally, certain analyses were performed to compare object and action naming times. The norms and the spoken and written naming latencies corresponding to the pictures are available on the Internet (http://www.psy.univ-bpclermont.fr/{\~{}}pbonin/pbonin-eng.html) and should be of great use to researchers interested in the processing of actions.
WordNet, an electronic dictionary (or lexical database), is a valuable resource for computational and cognitive scientists. Recent work on the computing of semantic distances among nodes (synsets) in WordNet has made it possible to build a large database of semantic distances for use in selecting word pairs for psychological research. The database now contains nearly 50,000 pairs of words that have values for semantic distance, associative strength, and similarity based on co-occurrence. Semantic distance was found to correlate weakly with these other measures but to correlate more strongly with another measure of semantic relatedness, featural similarity. Hierarchical clustering analysis suggested that the knowledge structure underlying semantic distance is similar in gross form to that underlying featural similarity. In experiments in which semantic similarity ratings were used, human participants were able to discriminate semantic distance. Thus, semantic distance as derived from WordNet appears distinct from other measures of word pair relatedness and is psychologically functional. This database may be downloaded from www.psychonomic.org/archive/.
372 CLARK AND PAIVIO Shoben, 1983), and as theoretical critics questioned the proposed nature of imagery (e.g., Pylyshyn, 1973). Research on item attributes continues to become more sophisticated in a variety of ways, including considera-tion of larger numbers of properties (e.g., Paivio et al., 1989; Rubin, 1980), and serious efforts to model and simulate the effects of stimulus attributes (e.g., Ellis {\&} Lambon Ralph, 2000). Computers have played a central role in both of these developments. The multivariate and simulation studies that might ultimately contribute to the emergence of theoretical models with strong empirical foundations has increased demand for large item pools and increased numbers of properties. The need for a greater number of properties and items follows from the essentially nonexperimental nature of item attribute research. Although researchers may exper-imentally assign different types of materials to different subjects, they generally do not and in some cases cannot experimentally manipulate the property or properties of interest (although such experimental approaches have been used in some cases). Rather, the properties are gen-erally measured in some fashion, and these measures will invariably correlate with other properties that might produce spurious effects or mask the effects of the target attributes. The primary way to address this problem, as in any nonexperimental research, is to identify diverse potentially contaminating constructs, obtain reliable and valid measures, and either control them in the selection of materials or include them in statistical analyses that can accommodate correlated factors (e.g., multiple re-gression, factor analysis, structural equation modeling). In addition to serving this control function, the col-lection of a large number of properties can itself provide information useful in the conceptualization of various item attributes. We can illustrate with one controversial question—the relative importance of frequency and age of acquisition in picture naming and other semantic re-trieval tasks (Morrison {\&} Ellis, 2000). Paivio et al. (1989) obtained picture naming and imagery latencies for a moderate-sized pool of pictures and their most common labels. Information on a wide range of properties, in-cluding age of acquisition, was also obtained. Factor analysis of the results indicated that age of acquisition loaded on several different factors (e.g., familiarity, con-creteness, name length), all of which contributed to pic-ture naming latencies. One interpretation of this result is that age of acquisition is a multidimensional measure that taps a number of distinct properties of words and pictures, hence its superiority to single-component pre-dictors in multiple regression analyses. More specula-tively, one might hypothesize that people rating age of acquisition are actually making judgments of how con-crete, short, and familiar items are, and that children in fact first learn words that tend to be concrete, short, and familiar. Generalization across items provides yet another rea-son to continue the development of item norms for use in cognitive research and theorizing. No single set of norms will ever suffice, because results can depend on the par-ticular pool of items that have been included in the norms. Despite the large corpus of materials on which the Ku{\v{c}}era and Francis (1967) frequency norms are based, for exam-ple, abstract words are still probably overrepresented just because of the types of text that dominate the corpus (e.g., literary and academic materials). It is therefore im-portant to continue to develop additional norms to per-mit evaluation of the generality of findings across di-verse word pools. Another facet of the generalization issue is the possi-bility of generational or cohort differences across ex-tended periods of time. With respect to word familiarity, for example, exposure to and knowledge of particular words might differ today from ratings, like those in the PYM norms, collected during the 1960s. New norms and replication of existing properties allow researchers to de-termine the continuing validity of norms collected years and in some cases decades ago. This article reports two extensions of the PYM norms. Part 1 reports a marked expansion of the number of prop-erties available for the original 925 PYM items, and Part 2 reports an expansion of the number of items for which basic properties are available. We also provide re-sults of factor analyses for both extensions, with the analysis in Part 1 being particularly informative about interrelationships among a diverse collection of word properties.
We tabulated upper- and lowercase letter frequency using several large-scale English corpora (approximately 183 million words in total). The results indicate that the relative frequencies for upper- and lowercase letters are not equivalent. We report a letter-naming experiment in which uppercase frequency predicted response time to uppercase letters better than did lowercase frequency. Tables of case-sensitive letter and bigram frequency are provided, including common nonalphabetic characters. Because subjects are sensitive to frequency relationships among letters, we recommend that experimenters use case-sensitive counts when constructing stimuli from letters.
OBJETIVO: Este estudo comparou os resultados entre crian{\c{c}}as brasileiras e americanas quanto {\`{a}}omea{\c{c}}{\~{a}}o, familiaridade com o conceito representado e complexidade visual de um conjunto de 400 figuras M{\'{E}}TODO: Foram avaliadas 36 crian{\c{c}}as brasileiras (18 meninos) de 5 a 7 anos de idade com caracter{\'{i}}sticas semelhantes {\`{a}}s crian{\c{c}}as americanas. Os procedimentos e medidas empregados no estudo brasileiro foram os mesmos usados para a popula{\c{c}}{\~{a}}o americana permitindo compara{\c{c}}{\~{a}}o direta dos dados das duas amostras atrav{\'{e}}s de correla{\c{c}}{\~{o}}es rho de Spearman e testes t de Student. RESULTADOS: Foram observadas correla{\c{c}}{\~{o}}es positivas significativas para todas as medidas entre as amostras brasileira e americana. A an{\'{a}}lise qualitativa demonstrou que ambos os grupos deram nomes modais que diferem do proposto para 59 figuras. As crian{\c{c}}as brasileiras utilizaram nomes que diferem do proposto para 72 figuras nomeadas corretamente pelas americanas. As americanas nomearam diferentemente do nome modal 26 figuras nomeadas corretamente pelas brasileiras. CONCLUS{\~{A}}O: O conjunto de 400 figuras mostrou-se um instrumento adequado para uso em diferentes culturas. Contudo, {\'{e}} aconselh{\'{a}}vel evitar o uso de figuras que produziram inconsist{\^{e}}ncia de nomea{\c{c}}{\~{a}}o nas popula{\c{c}}{\~{o}}es brasileira e norte-americana em estudos em outras culturas com o mesmo grupo et{\'{a}}rio at{\'{e}} que normas espec{\'{i}}ficas estejam dispon{\'{i}}veis.
Homophones are words that share phonology but differ in meaning and spelling (e.g., beach, beech). This article presents the results of normative surveys that asked young and older adults to free associate to and rate the dominance of 197 homophones. Although norms exist for young adults on word familiarity and frequency for homophones, these results supplement the literature by (1) reporting the four most frequent responses to visually presented homophones for both young and older adults, and (2) reporting young and older adults' ratings of homophone dominance. Results indicated that young and older adults gave the same first response to 67{\%} of the homophones and rated homophone dominance similarly on 60{\%} of the homophone sets. These results identify a subset of homophones that are preferable for research with young and older adults because of age-related equivalence in free association and dominance ratings. These norms can be downloaded from the Psychonomic Society's Web archive, www.psychonomic.org/archive/.
Este estudo apresenta dados normativos de familiaridade para utiliza{\c{c}}{\~{a}}o enquanto base para controloe manipula{\c{c}}{\~{a}}o de substantivos comuns em Portugal. Medidas de familiaridade com o referente e como significado (Larochelle {\&} Saumier, 1993) foram recolhidas em dois momentos, num primeiro casoenglobando apenas substantivos concretos ( n =320) e num segundo momento englobando tantosubstantivos concretos como abstractos ( n =219). As normas s{\~{a}}o apresentadas para um total de 459 palavras diferentes.
Word sketches are one-page automatic, corpus-based summaries of a word's grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary and were presented at Euralex 2002. At that point, they only existed for English. Now, we have developed the Sketch Engine, a corpus tool which takes as input a corpus of any language and a corresponding grammar patterns and which generates word sketches for the words of that language. It also generates a thesaurus and 'sketch differences', which specify similarities and differences between near-synonyms. We briefly present a case study investigating applicability of the Sketch Engine to free word-order languages. The results show that word sketches could facilitate lexicographic work in Czech as they have for English.
A digital signal-processing (DSP) technique for rapid generation of complex auditory motion stimuli based on dynamic linear changes in interaural delay is described. In this technique, a pair of complementary discrete Fourier transforms (DFTs) for which the component spacing in one series is different than that of the other is used. The appeal of this technique is its wide applicability, since it can generate real-time motion stimuli of any velocity and starting interaural delay for complex broadband or filtered noise waveforms and nonstationary sounds such as speech, music, and other natural sounds.
We summarize five studies of our large-scale research program, in which we examined aspects of contour-based object identification and segmentation, and we report on the stimuli we used, the norms and data we collected, and the software tools we developed. The stimuli were outlines derived from the standard set of line drawings of everyday objects by Snodgrass and Vanderwart (1980). We used contour curvature as a major variable in all the studies. The total number of 1,500 participants produced very solid, normative identification rates of silhouettes and contours, straight-line versions, and fragmented versions, and quite reliable benchmark data about saliency of points and object segmentation into parts. We also developed several software tools to generate stimuli and to analyze the data in nonstandard ways. Our stimuli, norms and data, and software tools have great potential for further exploration of factors influencing contour-based object identification, and are also useful for researchers in many different disciplines (including computer vision) on a wide variety of research topics (e.g., priming, agnosia, perceptual organization, and picture naming). The full set of norms, data, and stimuli may be downloaded from www.psychonomic.org/archive/.
Verb subcategorization frequencies (verb biases) have been widely studied in psycholinguistics and play an important role in human sentence processing. Yet available resources on subcategorization frequencies suffer from limited coverage, limited ecological validity, and divergent coding criteria. Prior estimates of verb transitivity, for example, vary widely with corpus size, coverage, and coding criteria This article provides norming data for 281 verbs of interest to psycholinguistic research, sampled from a corpus of American English, along with a detailed coding manual. We examine the effect on transitivity bias of various coding decisions and methods of computing verb biases.
Equal numbers of male and female participants judged which of seven facial expressions (anger, disgust, fear, happiness, neutrality, sadness, and surprise) were displayed by a set of 336 faces, and we measured both accuracy and response times. In addition, the participants rated how well the expression was displayed (i.e., the intensity of the expression). These three measures are reported for each face. Sex of the rater did not interact with any of the three measures. However, analyses revealed that some expressions were recognized more accurately in female than in male faces. The full set of these norms may be downloaded fromwww.psychonomic.org/archive/.
Imageability ratings made on a 1-7 scale and reaction times for 3,000 monosyllabic words were obtained from 31 participants. Analyses comparing these ratings to 1,153 common words from Toglia and Battig (1978) indicate that these ratings are valid. Reliability was assessed (alpha = .95). The information obtained in this study adds to that of other normative studies and is useful to researchers interested in manipulating or controlling imageability in word recognition and memory studies. These norms can be downloaded from www.psychonomic.org/archive/.
This article presents MANULEX, a Web-accessible database that provides grade-level word frequency lists of nonlemmatized and lemmatized words (48,886 and 23,812 entries, respectively) computed from the 1.9 million words taken from 54 French elementary school readers. Word frequencies are provided for four levels: first grade (G1), second grade (G2), third to fifth grades (G3-5), and all grades (G1-5). The frequencies were computed following the methods described by Carroll, Davies, and Richman (1971) and Zeno, Ivenz, Millard, and Duvvuri (1995), with four statistics at each level (F, overall word frequency; D, index of dispersion across the selected readers; U, estimated frequency per million words; and SFI, standard frequency index). The database also provides the number of letters in the word and syntactic category information. MANULEX is intended to be a useful tool for studying language development through the selection of stimuli based on precise frequency norms. Researchers in artificial intelligence can also use it as a source of information on natural language processing to simulate written language acquisition in children. Finally, it may serve an educational purpose by providing basic vocabulary lists.
On the basis of calculations using the latest lexical database produced by Amano and Kondo (2000), the fourth edition of a Web-accessible database of characteristics of the 1,945 basic Japanese kanji was produced by including the mathematical concepts of entropy, redundancy, and symmetry and by replacing selected indexes found in previous editions (Tamaoka, Kirsner, Yanase, Miyaoka, {\&} Kawakami, 2002). The kanji database in the fourth edition introduces seven new figures for kanji characteristics: (1) printed frequency, (2) lexical productivity, (3) accumulative lexical productivity, (4) symmetry for lexical productivity, (5) entropy, (6) redundancy, and (7) numbers of meanings for On-readings and Kun-readings. The file of the fourth edition of the kanji database may be downloaded from the Psychonomic Society Web archive, http://www.psychonomics.org/archive/.
This paper presents an automatic construction of Korean WordNet from pre-existing lexical resources. We develop a set of automatic word sense disambiguation techniques to link a Korean word sense collected from a bilingual machine-readable dictionary to a single corresponding English WordNet synset. We show how individual links provided by each word sense disambiguation method can be non-linearly combined to produce a Korean WordNet from existing English WordNet for nouns.
The most frequent names in Spanish corresponding to a set of 247 pictures in the Snodgrass and Vanderwart (1980) norms were used as stimuli in a discrete free-association task. A sample of 525 Spanish-speaking participants provided the first word that came to mind for each of the verbal stimuli. Responses were organized according to frequency of production in order to prepare word-association norms for the set of stimuli.
Advances in computational linguistics and discourse processing have made it possible to automate many language- and text-processing mechanisms. We have developed a computer tool called Coh-Metrix, which analyzes texts on over 200 measures of cohesion, language, and readability. Its modules use lexicons, part-of-speech classifiers, syntactic parsers, templates, corpora, latent semantic analysis, and other components that are widely used in computational linguistics. After the user enters an English text, CohMetrix returns measures requested by the user. In addition, a facility allows the user to store the results of these analyses in data files (such as Text, Excel, and SPSS). Standard text readability formulas scale texts on difficulty by relying on word length and sentence length, whereas Coh-Metrix is sensitive to cohesion relations, world knowledge, and language and discourse characteristics.
Multimodal corpora that show humans interacting via language are now relatively easy to collect. Current tools allow one either to apply sets of time-stamped codes to the data and consider their timing and sequencing or to describe some specific linguistic structure that is present in the data, built over the top of some form of transcription. To further our understanding of human communication, the research community needs code sets with both timings and structure, designed flexibly to address the research questions at hand. The NITE XML Toolkit offers library support that software developers can call upon when writing tools for such code sets and, thus, enables richer analyses than have previously been possible. It includes data handling, a query language containing both structural and temporal constructs, components that can be used to build graphical interfaces, sample programs that demonstrate how to use the libraries, a tool for running queries, and an experimental engine that builds interfaces on the basis of declarative specifications.
The present article provides Spanish norms for name agreement, printed word frequency, word compound frequency, familiarity, imageability, visual complexity, age of acquisition, and word length (measured by syllables and phonemes) for 100 line drawings of actions taken from Druks and Masterson (2000). In addition, through a naming-time experiment carried out with a group of 54 Spanish students in a pool of 63 of these line drawings, we determined the best predictors of naming actions. In the multiple regression analysis, age of acquisition and name agreement emerged as the most important determinants of action-naming reaction time.
Timed picture naming was compared in seven languages that vary along dimensions known to affect lexical access. Analyses over items focused on factors that determine cross-language universals and cross-language disparities. With regard to universals, number of alternative names had large effects on reaction time within and across languages after target–name agreement was controlled, suggesting inhibitory effects from lexical competitors. For all the languages, word frequency and goodness of depiction had large effects, but objective picture complexity did not. Effects of word structure variables (length, syllable structure, compounding, and initial frication) varied markedly over languages. Strong cross-language correlations were found in naming latencies, frequency, and length. Other-language frequency effects were observed (e.g., Chinese frequencies predicting Spanish reaction times) even after within-language effects were controlled (e.g., Spanish frequencies predicting Spanish reaction times). These surprising cross-language correlations challenge widely held assumptions about the lexical locus of length and frequency effects, suggesting instead that they may (at least in part) reflect familiarity and accessibility at a conceptual level that is shared over languages.
Snodgrass {\&} Vanderwart (1980) standardized a set of 260 pictures in the USA for use in studies of cognitive processes that employ pictured objects as laboratory analogues of object themselves. Since then similar norms for this set were obtained in Britain, Spain, Japan and Iceland and a larger set of 400 pictures (including the original 260: Cycowicz et al., 1997) was studied in France and Brazil. The present article provides a comparison of the norms obtained in Brazil and internationally. The pattern of correlations among the Brazilian and other standardizations were equivalent to that previously observed: despite pictures being judged to be of similar familiarity and visual complexity (high positive correlations), name agreement was less correlated, possibly due to differences in the languages spoken in each country and/or in the sample size used in each study. Results confirm the adequacy of the Brazilian norms.
The majority of research on the acquisition of spoken language has focused on language production, due to difficulties in the assessment of comprehension. A primary limitation to comprehension assessment is maintaining the interest and attention of younger infants. We have developed an assessment procedure that addresses the need for an extensive performance-based measure of comprehension in the 2nd year of life. In the interest of developing an engaging approach that takes into account infants' limited attention capabilities, we designed an assessment based on touchscreen technology. This approach builds upon prior research by combining standardization and complexity with an engaging infant-friendly interface. Data suggest that the touchscreen procedure is effective in eliciting and maintaining infant attention and will yield more extensive and reliable estimates of early comprehension than do other procedures. The software to implement the assessment is available free of charge for academic purposes.
Pictures are often used as stimuli in studies of perception, language, and memory. Since performances on different sets of pictures are generally contrasted, stimulus selection requires the use of standardized material to match pictures across different variables. Unfortunately, the number of standardized pictures available for empirical research is rather limited. The aim of the present study is to provide French normative data for a new set of 299 black-and-white drawings. Alario and Ferrand (1999) were closely followed in that the pictures were standardized on six variables: name agreement, image agreement, conceptual familiarity, visual complexity, image variability, and age of acquisition. Objective frequency measures are also provided for the most common names associated with the pictures. Comparative analyses between our results and the norms obtained in other, similar studies are reported. Finally, naming latencies corresponding to the set of pictures were also collected from French native speakers, and correlational/multiple-regression analyses were performed on naming latencies. This new set of standardized pictures is available on the Internet (http://leadserv.u-bourgogne.fr/bases/pictures/) and should be of great use to researchers when they select pictorial stimuli.
Measures of icon designs rely heavily on surveys of the perceptions of population samples. Thus, measuring the extent to which changes in the structure of an icon will alter its perceived complexity can be costly and slow. An automated system capable of producing reliable estimates of perceived complexity could reduce development costs and time. Measures of icon complexity developed by Garcia, Badre, and Stasko (1994) and McDougall, Curry, and de Bruijn (1999) were correlated with six icon properties measured using Matlab (MathWorks, 2001) software, which uses image-processing techniques to measure icon properties. The six icon properties measured were icon foreground, the number of objects in an icon, the number of holes in those objects, and two calculations of icon edges and homogeneity in icon structure. The strongest correlates with human judgments of perceived icon complexity (McDougall et al., 1999) were structural variability (r(s) = .65) and edge information (r(s) = .64).
Presents the EPOS database which comprises a list of orthographic neighbors of the 7,076 head words in the Basic Vocabulary of the Italian Language list (T. De Mauro, 1991) and the Zingarelli dictionary (N. Zingarelli, 1999). Development and structure of the database are described. The use of the EPOS database to facilitate the construction of lists of words with a stated number of neighbors and in the development of experimental research studies on the reading ability of good readers and dyslexic subjects is discussed.
As language data and associatedtechnologies proliferate and as the languageresources community expands, it is becomingincreasingly difficult to locate and reuse existingresources. Are there any lexical resources forsuch-and-such a language? What tool workswith transcripts in this particular format?What is a good format to use for linguisticdata of this type? Questions like these dominate manymailing lists, since web search engines are anunreliable way to find language resources. Thispaper reports on a new digital infrastructurefor discovering language resources beingdeveloped by the Open Language Archives Community(OLAC). At the core of OLAC is its metadataformat, which is designed to facilitatedescription and discovery of all kinds oflanguage resources, including data, tools, oradvice. The paper describes OLAC metadata, itsrelationship to Dublin Core metadata, and itsdissemination using the metadata harvesting protocol of the Open Archives Initiative.
Factors affecting word retrieval were compared in a timed picture-naming paradigm for 520 drawings of objects. In prior timed and untimed studies by Snodgrass
A computational analysis of a large British English database was performed and frequencies occurrence of grapheme-phoneme correspondences were obtained. A computer program was implemented, which used these frequencies to predict the probabilities of all possible pronunciations of any given string of graphemes. These results led to a proposal for a quantitative method of measurement of the orthographic depth of different languages.
An adult language corpus of spoken Hong Kong Cantonese (HKCAC) has recently been developed consisting of spontaneous speech recorded from phone-in programs and forums on the radio in Hong Kong. The database represents the speech of a total of sixty-nine speakers in addition to the program hosts, and has approximately 170, 000 characters. It is believed that HKCAC will be of great value to linguists who are interested in studying Cantonese, and speech therapists and educators who work with the Cantonese speaking population. A search engine with a user-friendly interface has also been developed by using FileMaker Pro 4.0 (Chinese version). Apart from the basic frequency information and the display of search results in KWAL (Key Word And Line) format, the search engine also allows users to search for various phonetic realizations of a particular character or the set of characters associated with a particular syllable. The content and structure of the corpus, and the overall architecture as well as the technical aspects of the search engine are described. Search procedures are illustrated with examples. The paper ends with a discussion of the future development of HKCAC. {\textcopyright} 2001 John Benjamins Publishing Company.
In 1981, the Japanese government published a list of the 1,945 basic Japanese kanji (Jooyoo Kanji-hyo), including specifications of pronunciation. This list was established as the standard for kanji usage in print. The database for 1,945 basic Japanese kanji provides 30 cells that explain in detail the various characteristics of kanji. Means, standard deviations, distributions, and information related to previous research concerning these kanji are provided in this paper. The database is saved as a Microsoft Excel 2000 file for Windows. This kanji database is accessible on the Web site of the Oxford Text Archive, Oxford University (http://ota.ahds.ac.uk). Using this database, researchers and educators will be able to conduct planned experiments and organize classroom instruction on the basis of the known characteristics of selected kanji.
Word difficulty varies from language to language; therefore, normative data of verbal stimuli cannot be imported directly from another language. We present mean identification thresholds for the 260 screen-fragmented words corresponding to the total set of Snodgrass and Vanderwart (1980) pictures. Individual words were fragmented in eight levels using Turbo Pascal, and the resulting program was implemented on a PC microcomputer. The words were presented individually to a group of 40 Spanish observers, using a controlled time procedure. An unspecific learning effect was found showing that performance improved due to practice with the task. Finally, of the 11 psycholinguistic variables that previous researchers have shown to affect word identification, only imagery accounted for a significant amount of variance in the threshold values.
Several studies on auditory word recognition indicate that word processing is influenced by phonological similarity with other words. We describe a lexical database, VoColex, which provides several statistical indexes of phonological similarity between French words. Phonological similarity is computed according to two distinct principles. According to the first principle, phonologically similar words share initial phonemes with the target word. According to the second principle, phonological neighbours correspond to any words which can be derived from the target by a single phoneme change (substitution, addition, or deletion) whatever the position of the modified phoneme. The statistical data provided by VoCoLex allow the control and the empirical manipulation of various measures of phonological similarity, as well as quantitative descriptions of the auditory lexicon.
The present study describes normative measures for 626 Italian simple nouns. The database (LEXVAR.XLS) is freely available for down-loading on the Web site http://wwwistc.ip.rm.cnr.it/materia/database/. For each of the 626 nouns, values for the following variables are reported: age of acquisition, familiarity, imageability, concreteness, adult written frequency, child written frequency, adult spoken frequency, number of orthographic neighbors, mean bigram frequency, length in syllables, and length in letters. A classification of lexical stress and of the type of word-initial phoneme is also provided. The intercorrelations among the variables, a factor analysis, and the effects of variables and of the extracted factors on word naming are reported. Naming latencies were affected primarily by a factor including word length and neighborhood size and by a word frequency factor. Neither a semantic factor including imageability, concreteness, and age of acquisition nor a factor defined by mean bigram frequency had significant effects on pronunciation times. These results hold for a language with shallow orthography, like Italian, for which lexical nonsemantic properties have been shown to affect reading aloud. These norms are useful in a variety of research areas involving the manipulation and control of stimulus attributes.
We describe how to build a largecomprehensive, integrated Arabic lexicon byautomatic parsing of newspaper text. We havebuilt a parser system to read Arabic newspaperarticles, isolate the tokens from them, findthe part of speech, and the features for eachtoken. To achieve this goal we designed a setof algorithms, we generated several sets ofrules, and we developed a set of techniques,and a set of components to carry out thesetechniques. As each sentence is processed, newwords and features are added to the lexicon, sothat it grows continuously as the system runs.To test the system we have used 100 articles(80,444 words) from the Al-Raya newspaper.The system consists of several modules: thetokenizer module to isolate the tokens, the type findersystem to find the part of speech of eachtoken, the proper noun phrase parser module tomark the proper nouns and to discover someinformation about them and the feature findermodule to find the features of the words.
We collected number-of-translation norms on 562 Dutch-English translation pairs from several previous studies of cross-language processing. Participants were highly proficient Dutch-English bilinguals. Form and semantic similarity ratings were collected on the 1,003 possible translation pairs. Approximately 40{\%} of the translations were rated as being similar across languages with respect to spelling/sound (i.e., they were cognates). Approximately 45{\%} of the translations were rated as being highly semantically similar across languages. At least 25{\%} of the words in each direction of translation had more than one translation. The form similarity ratings were found to be highly reliable even when obtained with different bilinguals and modified rating procedures. Number of translations and meaning factors significantly predicted the semantic similarity of translation pairs. In future research, these norms may be used to determine the number of translations of words to control for or study this factor. These norms are available at http://www.talkbank.org/norms/tokowicz/.
Sentence completion norms are a valuable resource for researchers interested in studying the effects of context on word recognition processes. Norms for 112 Spanish sentences were compiled with the use of experimental software accessed over the World-Wide Web. Several measures summarizing the distribution of responses for each sentence are reported, including Schwanenflugel's (1986) multiple-production measure of sentence constraint strength, the type-token ratio, and the information-theoretic measure of redundancy. The complete set of completion norms is available at http://www.ling.ed.ac.uk/{\~{}}monica/spanish{\_}completion{\_}norms.html.
The present research investigated Internet search engines as a rapid, cost-effective alternative for estimating word frequencies. Frequency estimates for 382 words were obtained and compared across four methods: (1) Internet search engines, (2) the Kucera and Francis (1967) analysis of a traditional linguistic corpus, (3) the CELEX English linguistic database (Baayen, Piepenbrock, {\&} Gulikers, 1995), and (4) participant ratings of familiarity. The results showed that Internet search engines produced frequency estimates that were highly consistent with those reported by Kucera and Francis and those calculated from CELEX, highly consistent across search engines, and very reliable over a 6-month period of time. Additional results suggested that Internet search engines are an excellent option when traditional word frequency analyses do not contain the necessary data (e.g., estimates for forenames and slang). In contrast, participants' familiarity judgments did not correspond well with the more objective estimates of word frequency. Researchers are advised to use search engines with large databases (e.g., AltaVista) to ensure the greatest representativeness of the frequency estimates.
This paper presents an analysis of the distribution of phonological similarity relations among monosyllabic spoken words in English. It differs from classical analyses of phonological neighborhood density (e.g., Luce {\&} Pisoni, 1998) by assuming that not all phonological neighbors are equal. Rather, it is assumed that the phonological lexicon has psycholinguistic structure. Accordingly, in addition to considering the number of phonological neighbors for any given word, it becomes important to consider the nature of these neighbors. If one type of neighbor is more dominant, neighborhood density effects may reflect levels of segmental representation other than the phoneme, particularly prior to literacy. Statistical analyses of the nature of phonological neighborhoods in terms of rime neighbors (e.g., hat/cat), consonant neighbors (e.g., hat/hit), and lead neighbors (e.g., hat/ham) were thus performed for all monosyllabic words in the Celex corpus (4,086 words). Our results show that most phonological neighbors are rime neighbors (e.g., hat/cat) in English. Similar patterns were found when a corpus of words for which age-of-acquisition ratings were available was analyzed. The resultant database can be used as a tool for controlling and selecting stimuli when the role of lexical neighborhoods in phonological development and speech processing is examined.
Much of the power of neural network modeling for language use and acquisition derives from a reliance on statistical regularities implicit in the phonological properties of words. Researchers have devised several methods for representing the phonology of words, but these methods are often either unable to represent realistically sized lexicons or inadequate in the ways they represent individual words. In this paper, we present a new phonological pattern generator (PatPho) that allows connectionist modelers to derive accurate phonological representations of the English lexicon. PatPho not only generates phonological patterns that can scale up to realistically sized lexicons, but also accurately and parsimoniously captures the similarity structures of the phonology of monosyllabic and multisyllabic words.
We investigated the cognitive structure of emotions in Indonesia and The Netherlands in a series of three studies. Sets of 120 emotion terms were selected based on local ratings of prototypicality for ‘‘emotion''. With similarity sortings a threedimensional (evaluation, arousal, dominance) and a four-cluster (positive emotion, sadness, fear, anger) structure was found in each group. Of 50 pairs of translationequivalent terms, 42 pairs were also found to be cognitively equivalent. With these equivalent terms a good fit of a common cognitive emotion structure was demonstrated in both countries. In a fourth and final study, the location of two social emotions, ‘‘shame'' and ‘‘guilt'', in the common structure was found to be closer to ‘‘fear'' and somewhat further away from ‘‘anger'' in Indonesia than in the Netherlands.
Several studies on auditory word recognition indicate that word processing is influenced by phonological similarity with other words. We describe a lexical database, VoColex, which provides several statistical indexes of phonological similarity between French words. Phonological similarity is computed according to two distinct principles. According to the first principle, phonologically similar words share initial phonemes with the target word. According to the second principle, phonological neighbours correspond to any words which can be derived from the target by a single phoneme change (substitution, addition, or deletion) whatever the position of the modified phoneme. The statistical data provided by VoCoLex allow the control and the empirical manipulation of various measures of phonological similarity, as well as quantitative descriptions of the auditory lexicon. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
This paper presents two database methods for crosslinguistic data collection and comparison: autotypologizing and exemplar-based sampling. Autotypologizing dispenses with a priori defined comparative grids and instead lets structural types emerge inductively through a type list that is constantly updated in response to languages entered in a database. Examplar-based sampling allows identification of a single representative of cross-linguistically heterogeneous structural domains such as case. These two methods are helpful tools in fieldwork. Autotypologizing generates inventories of known types. These inventories update researchers' expectance range for newly encountered types (like published typological surveys, but more dynamically). Examplar-based sampling is useful for writing typological profiles at very early stages of description.
Word association data were obtained from two cohorts of British adults. Young adults (aged 21–30 yrs) and older adults (aged 66–81 yrs) responded to 90 words in a discrete word association task. An associative frequency measure was calculated by counting how many participants produced a particular word and then converting this number into a proportion. The degree of overlap between the cohorts in terms of dominant responses, the responses with the highest association frequencies, was moderate. Dominant responses were common to the two cohorts for only 36 of the 90 items. When the top three responses were considered the degree of overlap increased to approximately 60{\%}. Four measures of response heterogeneity were calculated for each stimulus item. Comparison of the responses of the younger and older adults indicates that there was less response heterogeneity amongst the older cohort. These norms should be of use to investigators interested in developmental changes in the structure of semantic memory across the adult lifespan as well as to researchers interested in comparing results from neurologically impaired older adults to a normative sample from the same age cohort.
The present article provides normative measures for 400 pictured objects (Cycowicz et al., 1997) viewed by Portuguese speaking Brazilian University students and 5-7 year-old children. Name agreement, familiarity and visual complexity ratings were obtained. These variables have been shown to be important for the selection of adequate stimuli for cognitive studies. Children's name agreement was lower than that of adults. The children also failed to provide adequate modal names for 103 concepts, rated drawings as less familiar and less complex, and chose shorter names for pictures. The differences in ratings between adults and children were higher than those observed in the literature employing smaller picture sets. The pattern of correlations among measures observed in the present study was consistent with previous reports, supporting the usefulness of the 400 picture set as a tool for cognitive research in different cultures and ages.
Subjective frequency estimates for large sample of monosyllabic English words were collected from 574 young adults (undergraduate students) and from a separate group of 1,590 adults of varying ages and educational backgrounds. Estimates from the latter group were collected via the internet. In addition, 90 healthy older adults provided estimates for a random sample of 480 of these words. All groups rated words with respect to the estimated frequency of encounters of each word on a 7-point scale, ranging from never encountered to encountered several times a day. The young and older groups also rated each word with respect to the frequency of encounters in different perceptual domains (e.g., reading, hearing, writing, or speaking). The results of regression analyses indicated that objective log frequency and meaningfulness accounted for most of the variance in subjective frequency estimates, whereas neighborhood size accounted for the least amount of variance in the ratings. The predictive power of log frequency and meaningfulness were dependent on the level of subjective frequency estimates. Meaningfulness was a better predictor of subjective frequency for uncommon words, whereas log frequency was a better predictor of subjective frequency for common words. Our discussion focuses on the utility of subjective frequency estimates compared with other estimates of familiarity. The raw subjective frequency data for all words are available at http://www.artsci.wustl.edu/dbalota/labpub.html.
In 1980, Bloom and Fischler published a paper describing a set of sentence contexts (SCs) and their cloze probabilities (CPs). This material has subsequently been employed in numerous studies of linguistic processing. We sought to define the completion words and their CPs for Bloom and Fischler's sentences in an inner-city British population in order to establish reliable norms for subsequent studies in the U.K. One hundred and fifty incomplete SCs were presented to 73 volunteers. The CPs for each of the words used to complete the SCs were computed. We then compared the CPs from our sample with those from Bloom and Fischler. There were significant differences between CPs from each sample in 14{\%} of the SCs analyzed (p {\textless} .01). Our data suggest that studies employing SCs and CPs may require locally defined norms if the test population differs substantially from the original one. The consequences of employing SCs and CPs to study linguistic processing without normalization are discussed.
SUMMARY : A lexical database for contemporary french on internet :LEXIQUEWe pr{\'{e}}sent a new lexical database of French, named Lexique. Based on acorpus oftexts written since 1950 which contained 31 million words, Lexiqueyields 130 000 entries including the inflected forms of verbs, nouns andadjectives. Each entry provides several kinds of information includingfrequency, gender, number, phonological form, graphemic and phonemicunicity points. Several tables give additional statistics such as the frequencies ofvarious units : letters, bigrams, trigrams, phon{\`{e}}mes and syllables. Thedatabase is available for free on the Internet.Key words : word r{\'{e}}cognition, database, frequencies
Age of acquisition and imageability ratings were collected for 2,645 words, including 892 verbs and 213 function words. Words that were ambiguous as to grammatical category were disambiguated: Verbs were shown in their infinitival form, and nouns (where appropriate) were preceded by the indefinite article (such as to crack and a crack). Subjects were speakers of British English selected from a wide age range, so that differences in the responses across age groups could be compared. Within the sub- set of early acquired noun/verb homonyms, the verb forms were rated as later acquired than the nouns, and the verb homonyms of high-imageability nouns were rated as significantly less imageable than their noun counterparts. A small number of words received significantly earlier or later age of acquisition rat- ings when the 20–40 years and 50–80 years age groups were compared. These tend to comprise words that have come to be used more frequently in recent years (either through technological advances or so- cial change), or those that have fallen out of common usage. Regression analyses showed that although word length, familiarity, and concreteness make independent contributions to the age of acquisition measure, frequency and imageability are the most important predictors of rated age of acquisition.
Many cognitive psychological, computational, and neuropsychological approaches to the organisation of semantic memory have incorporated the idea that concepts are, at least partly, represented in terms of their fine-grained features. We asked 20 normal volunteers to provide properties of 64 concrete items, drawn from living and nonliving categories, by completing simple sentence stems (e.g., an owl is {\_}{\_}, has {\_}{\_}, can{\_}{\_}). At a later date, the same participants rated the same concepts for prototypicality and familiarity. The features generated were classified as to type of knowledge (sensory, functional, or encyclopaedic), and also quantified with regard to both dominance (the number of participants specifying that property for that concept) and distinctiveness (the proportion of exemplars within a conceptual category of which that feature was considered characteristic). The results demonstrate that rated prototypicality is related to both the familiarity of the concept and its distance from the average of the exemplars within the same category (the category centroid). The feature database was also used to replicate, resolve, and extend a variety of previous observations on the structure of semantic representations. Specifically, the results of our analyses (1) resolve two conflicting claims regarding the relative ratio of sensory to other kinds of attributes in living vs. nonliving concepts; (2) offer new information regarding the types of features-across different domains-that distinguish concepts from their category coordinates; and (3) corroborate some previous claims of higher intercorrelations between features of living things than those of artefacts.
The International Affective Picture System (IAPS; Center for the Study of Emotion and Attention [CSEA], 1995) is a set of pictures that is widely used in experimental research on emotion and attention. In this study, the normative ratings of a subset of the IAPS were compared with the ratings from a Flemish sample. Eighty Flemish first-year psychology students from the Ghent University (Belgium) rated valence, dominance and arousal for a stratified sample of 60 IAPS pictures. Reliability coefficients indicate that the self-report ratings are internally consistent. Four findings converge upon the idea that the ratings in the Flemish sample are similar to the normative ratings. First, the affective ratings of the pictures in our sample correlated strongly with the North American ratings: .95, .84 and .87, respectively for valence, arousal and dominance. Second, mean valence and arousal ratings of the 60 pictures did not significantly differ between the Flemish and the North American sample. Third, plotting of the valence and arousal ratings in a two-dimensional figure results in a similar boomerang shaped distribution as the North American affective ratings. And fourth, as predicted, this distribution of the valence and arousal ratings shows the same asymmetry between positive and negative pictures as in North American samples.
The present study provides Italian normative measures for 266 line drawings belonging to the new set of pictures developed by Lotto, Dell'Acqua, and Job (in press). The pictures have been standardized on the following measures: number of letters, number of syllables, name frequency, within-category typicality, familiarity, age of acquisition, name agreement, and naming time. In addition to providing the measures, the present study focuses on indirect and direct comparisons (i.e., correlations) of the present norms with databases provided by comparable studies in Italian (in which normative data were collected with Snodgrass {\&} Vanderwart's set of pictures; Nisi, Longoni, {\&} Snodgrass, 2000), in British English (Barry, Morrison, {\&} Ellis, 1997), in American English (Snodgrass {\&} Vanderwart, 1980; Snodgrass {\&} Yuditsky, 1996), in French (Alario {\&} Ferrand, 1999), and in Spanish (Sanfeliu {\&} Fernandez, 1996)
When researchers are interested in the influence of long-term knowledge on performance, printed word frequency is typically the variable of choice. Despite this preference, we know little about what frequency norms measure. They ostensibly index how often and how recently words are experienced, but words appear in context, so frequency potentially reflects an influence of connections with other words. This paper presents the results of a large free association study as well as the results of experiments designed to evaluate the hypothesis that common words have stronger connections to other words. The norms indicate that common words tend to be more concrete but they do not appear to have more associates, stronger associates, or more connections among their associates. Two extralist cued recall experiments showed that, with other attributes being equal, high- and low-frequency words were equally effective as test cues. These results suggest that frequency does not achieve its effects because of stronger or greater numbers of connections to other words, as implied in SAM. Other results indicated that common words have more connections from other words, including their associates, and that free association provides a valid index of associative strength.
This paper describes a Japanese logographic character (kanji) frequency list, which is based on an analysis of the largest recently available corpus of Japanese words and characters. This corpus comprised a full year of morning and evening editions of a major newspaper, containing more than 23 million kanji characters and more than 4,000 different kanji characters. This paper lists the 3,000 most frequent kanji characters, as well as an analysis of kanji usage and correlations between the present list and previous Japanese frequency lists. The authors believe that the present list will help researchers more accurately and efficiently control the selection of kanji characters in cognitive science research and interpret related psycholinguistic data.
This paper presents Icelandic norms for the widely used pictorial stimuli of Snodgrass and Vanderwart (1980). Norms are presented for name agreement, familiarity, imageability, rated and objective age-of-acquisition (AoA) of vocabulary, and word frequency. The ratings were collected from 103 adult participants while the objective AoA values were collected from 279 children, 2.5-11 years of age. The present norms are in many respects similar to those already collected for other language groups indicating that the stimuli will be useful for further psychological studies in Iceland. The rated AoA values show a high correlation with objective AoA (r = 0.718) thus confirming previous studies conducted with English speaking participants that rated AoA is a relatively valid measure of objective AoA. However, word frequency and familiarity are more closely correlated with rated AoA than with objective AoA indicating that these factors play some role in the ratings. Objective AoA norms are therefore to be preferred in studies of cognitive processes.
The CL Research Senseval system wasthe highest performing system among the ``All-words''systems, with an overall fine-grained score of 61.6percent for precision and 60.5 percent for recall on98 percent of the 8,448 texts on the revisedsubmission (up by almost 6 and 9 percent from thefirst). The results were achieved with an almostcomplete reliance on syntactic behavior, using (1) arobust and fast ATN-style parser producing parse treeswith annotations on nodes, (2) DIMAP dictionarycreation and maintenance software (after conversion ofthe Hector dictionary files) to hold dictionaryentries, and (3) a strategy for analyzing the parsetrees in concert with the dictionary data. Furtherconsiderable improvements are possible in the parser,exploitation of the Hector data (and representation ofdictionary entries), and the analysis strategy, stillwith syntactic and collocational data. The Sensevaldata (the dictionary entries and the corpora) providean excellent testbed for understanding the sources offailures and for evaluating changes in the CL Researchsystem.
A three-phased study was conducted in order to develop a standardized list of touch-related adjec-tives. The final list consisted of 306 words that were categorized in 440 instances according to the Le-derman and Klatzky (1987, 1990)dimensions of haptic properties (some words were classified in more than one dimension). The Kucera and Francis (1967)frequency of occurrence in written English for all words in the final list was also determined. A correlation was found between frequency of occurrence on the list and Kucera and Francis frequency. An analysis of the word dimensions and future applica-tions are discussed.
This paper presents the Hellenic National (HNC), which is the corpus of Modern Greek developed by the Institute for Language and Speech Processing (ILSP). The presentation describes all stages of the creation of the corpus: collection of the material, tagging and tokenizing, construction of the database and the online implementation which aims at rendering the corpus accessible over Internet to the research community.
The UCSD Center for Research in Language is engaged in a large international study to provide norms for picture naming (both names produced and reaction times) in seven different languages (English, German, Italian, Spanish, Chinese, Bulgarian, Hungarian) as well as a separate sample of bilinguals (Spanish-English). For all languages, norms have been obtained for 520 line drawings of common objects. For a subset of the languages, norms are being collected for another 275 line drawings of actions. Here we present an overview of the methodology, some preliminary results, and a discussion of plans for publication. A cross-language data base, organized by items, will soon be available on the CRL website, including results of the norming study itself together with available lexical information (frequency, age of acquisition, etc.) for the associated target names.
Associative word knowledge changes throughout our lives (Anderson, 1983). Thus, the organization and use of this knowledge may vary as a function of cognitive development. However, there are no associative norms that provide information about associative representation of Spanish speaking children. The aim of the present study was to obtain normative data on the associative knowledge of children ranging from 8 to 13 years of age. Thus, 58 words were presented to three groups of 100 children varying in ages (8-9, 10-11, 12-13). Participants were asked to provide the first associate to a presented word that came to mind. Results indicated that there is an increment in the percentage of associates, an increase in the number of idiosyncratic responses and a decrease in strength of the associates as the ages of the children increased from 8 to 13. Comparisons with adult normative data are also provided. Results are interpreted as supporting evidence for developmental changes in knowledge organization.
Background: The Object and Action Naming Battery (OANB) was developed by Druks and Masterson in 2000 in response to the lack of materials for investigating the difference between the availability of nouns and verbs. This battery has been extensively used in psycholinguistic and aphasia research. The battery has also proved to be a useful tool in clinical practice by speech and language therapists. Aims: Till date, there are no published aphasia assessment tools specifically developed for the use of Saudi Arabic speakers. Therefore, the present study aimed to adapt the OANB for the use of Saudi Arabic speakers. This paper describes the adaptation process. Methods {\&} Procedures: Name agreement data for the items in the OANB was collected from 30 non-brain-damaged Saudi Arabic-speaking adults. This was followed by collecting values for the psycholinguistic variables available in the original battery, which are spoken-word frequency, imageability, age of acquisition, and visual complexity. Outcomes {\&} Results: The Saudi Arabic version of the OANB consists of 50 object and 50 action pictures with high level of name agreement (100{\%} for object pictures, and at least 93{\%} for action pictures), along with the normative data for the variables of spoken-word frequency, imageability, age of acquisition, and visual complexity of the verbal labels for the object and action pictures included in the Saudi Arabic version of the battery. Conclusions: This battery makes a significant contribution to aphasia resources available in Saudi Arabia as it can be used in clinical settings at the assessment stage and for therapeutic purposes for individuals with aphasia. The battery can also be used in aphasia and psycholinguistic research with Arabic speakers.
This article presents a computerized database of words for use in experimental research in cognitive psychology and psycholinguistics. The data are based on the oral vocabulary of 200 Spanish-speaking children aged from 11.16 to 49.16 months. The database includes 15,428 Spanish words (tokens) and comprises 1,259 different words (types). It provides information about age of acquisition, orthography, grammar, semantics, and frequency.
The use of computer tools has led to major advances in the study of spoken language corpora. One area that has shown particular progress is the study of child language development. Although it is now easy to lexically tag every word in a spoken language corpus, one still has to choose between numerous ambiguous forms, especially with languages such as French or English, where more than 70{\%} of words are ambiguous. Computational linguistics can now provide a fully automatic disambiguation of lexical tags. The tool presented here (POST) can tag and disambiguate a large text in a few seconds. This tool complements systems dealing with language transcription and suggests further theoretical developments in the assessment of the status of morphosyntax in spoken language corpora. The program currently works for French and English, but it can be easily adapted for use with other languages. The analysis and computation of a corpus produced by normal French children 2-4 years of age, as well as of a sample corpus produced by French SLI children, are given as examples.
Few tools for research in proper names have been available--specifically, there is no large-scale corpus of proper names. Two corpora of proper names were constructed, one based on U.S. phone book listings, the other derived from a database of Usenet text. Name frequencies from both corpora were compared with human subjects' reaction times (RTs) to the proper names in a naming task. Regression analysis showed that the Usenet frequencies contributed to predictions of human RT, whereas phone book frequencies did not. In addition, semantic neighborhood density measures derived from the HAL corpus were compared with the subjects' RTs and found to be a better predictor of RT than was frequency in either corpus. These new corpora are freely available on line for download. Potentials for these corpora range from using the names as stimuli in experiments to using the corpus data in software applications.
Normative values on various word characteristics were obtained for abstract, concrete, and emotion words in order to facilitate research on concreteness effects and on the similarities and differences among the three word types. A sample of 78 participants rated abstract, concrete, and emotion words on concreteness, context availability, and imagery scales. Word associations were also gathered for abstract, concrete, and emotion words. The data were used to investigate similarities and differences among these three word types on word attributes, association strengths, and number of associations. These normative data can be used to further research on concreteness effects, word type effects, and word recognition for abstract, concrete, and emotion words.
This paper presents some aspects of the Silfide server, a system dedicated to the delivery of linguistic resources on the web. After presenting the main issues behind the design of such a system, we focus on the editorial choices related to the use of the Text Encoding Initiative to represent our textual documents. In particular, we focus on the accommodations we have had to carry with regards to the TEI header and address the trade-off between extensive enrichment and genericity of the primary data when one wants to precisely mark-up a given document content. As a whole, we show how essential the TEI has proven to be for a project such as ours both from a practical and conceptual point of view.
Ratings of familiarity and pronounceability were obtained for a sample of 199 names and 199 nouns. Frequency and familiarity were more closely related in the proper name pool than the word pool, although the correlation was modest in both cases. Familiarity and pronounceability were highly related for both names and nouns.
The present article provides French normative measures for 400 line drawings taken from Cycowicz, Friedman, Rothstein, and Snodgrass (1997), including the 260 line drawings that were normed by Snodgrass and Vanderwart (1980). The pictures have been standardized on the following variables: name agreement, image agreement, familiarity, visual complexity, image variability, and age of acquisition. These normative data also include word frequency values and the first verbal associate (taken from Ferrand {\&} Alario, 1998). The six variables obtained are important because of their potential effect in many fields of psychology, especially the study of cognitive processes such as visual perception, language, and memory.
We present new Spanish norms for object familiarity and rated age of acquisition for 140 pictures taken from Snodgrass and Vanderwart (1980), together with data on visual complexity, image agreement, name agreement, word length (in syllables and phonemes), and five measures of word frequency. The pictures were presented to a group of 64 Spanish subjects, and oral naming latencies were recorded. In a multiple regression analysis, age of acquisition, object familiarity, name agreement, word frequency, and word length made significant independent contributions to predicting naming latency.
Roediger and McDermott (1995) induced false recall and false recognition for words that were not presented in lists. They had subjects study 24 lists of 15 words that were associates of a common word (called the critical target or critical lure) that was not presented in the list. False recall and false recognition of the critical target occurred frequently in response to these lists. The purpose of the current work was to provide a set of normative data for the lists Roediger and McDermott used and for 12 others developed more recently. We tested false recall and false recognition for critical targets from 36 lists. Despite the fact that all lists were constructed to produce false remembering, the diversity in their effectiveness was large--60{\%} or more of subjects falsely recalled window and sleep following the appropriate lists, and false recognition for these items was greater than 80{\%}. However, the list generated from king led to 10{\%} false recall and 27{\%} false recognition. Possible reasons for these wide differences in effectiveness of the lists are discussed. These norms serve as a useful benchmark for designing experiments about false recall and false recognition in this paradigm.
Cognitive models of language processing in English are founded on norms for word properties, but their universality is now being explored across different writing scripts and subject groups. Although Chinese characters are popular for this comparative work, their salient properties remain ill defined or poorly controlled. We describe how norms for semantic and phonetic regularity in Mandarin can be calibrated on a regional basis. The rating data that we present from China, Singapore, and Taiwan also illustrate why the diversity of both oral and written forms of Chinese should be considered in future empirical work.
During the last 20 years, psycholinguistic research has identified many variables that influence reading and spelling processes. We describe a new computerized lexical database, LEXOP, which provides quantitative descriptors about the relations between orthography and phonology for French monosyllabic words. Three main classes of variables are considered: consistency of print-to-sound and sound-to-print associations, frequency of orthography-phonology correspondences, and word neighborhood characteristics.
When using verbal stimuli, researchers usually equate words on frequency of use. However, for some ambiguous words (e.g., ball as a round object or a formal dance), frequency counts fail to distinguish how often a particular meaning is used. This study evaluates the use of ratings to estimate meaning frequency. Analyses show that ratings correlate highly with word frequency counts when orthographic and meaning frequencies should converge, are not unduly influenced by semantic factors, and may provide a better measure of relative meaning dominance than the word association task does. Furthermore, the ratings allow researchers to equate or manipulate frequency of meaning use for ambiguous and unambiguous words. Ratings for 211 words are reported.
Introduction: The Affective Norms for English Words (ANEW) is being developed to provide a set of normative emotional ratings for a large number of words in the English language. The goal is to develop a set of verbal materials that have been rated in terms of pleasure, arousal, and dominance to complement the existing International Affective Picture System (IAPS, Lang, Bradley, {\&} Cuthbert, 1999) and International Affective Digitized Sounds (IADS; Bradley {\&} Lang, 1999), which are collections of picture and sound stimuli, respectively, that also include these affective ratings. The ANEW, IAPS, and IADS are being developed and distributed by NIMH Center for Emotion and Attention (CSEA) investigators Margaret Bradley and Peter Lang, in order to provide standardized materials that are available to researchers in the study of emotion and attention. The existence of these affective collections should help in comparing results across different investigations of emotion, as well as in allowing replication within and across research labs assessing basic or applied problems in the study of emotion.
INTEX is a linguistic development environment that includes large-coverage dictionaries and grammars, and parses texts of several million words in real time. INTEX has tools to create and maintain large-coverage lexical resources as well as morphological and syntactic grammars. Dictionaries and grammars are applied to texts in order to locate morphological, lexical and syntactic patterns, remove ambiguities, and tag simple and compound words. INTEX can build lemmatized concordances and indices of large texts with respect to all types of Finite State patterns. INTEX is used as a corpus processor, to analyze literary, journalistic and technical texts. I describe here the subset of tools used to perform advanced search requests on large texts.
The ESPbase provides a tool for storing symbols and icons along with information about their characteristics. Information about a wide range of symbol characteristics is included on the database to facilitate the selection of symbol sets for research and design. The database includes information about the graphical characteristics and functions of symbols. It also includes ratings of symbol concreteness, complexity, familiarity, and meaningfulness. Symbols and icons can be accessed on the basis of each of these characteristics or any combination of characteristics. This makes it easier to select symbols on the basis of usability and design requirements. It also means that symbols can be easily selected for research while controlling their characteristics on a number of dimensions.
Dissociations between noun and verb processing are not uncommon after brain injury; yet, precise psycholinguistic comparisons of nouns and verbs are hampered by the underrepresentation of verbs in published semantic word norms and by the absence of contemporary estimates for part-of-speech usage. We report herein imageability ratings and rating response times (RTs) for 1,197 words previously categorized as pure nouns, pure verbs, or words of balanced noun-verb usage on the basis of the Francis and Kucera (1982) norms. Nouns and verbs differed in rated imageability, and there was a stronger correspondence between imageability rating and RT for nouns than for verbs. For all word types, the image-rating-RT function implied that subjects employed an image generation process to assign ratings. We also report a new measure of noun-verb typicality that used the Hyperspace Analog to Language (HAL; Lund {\&} Burgess, 1996) context vectors (derived from a large sample of Usenet text) to compute the mean context distance between each word and all of the pure nouns and pure verbs. For a subset of the items, the resulting HAL noun-verb difference score was compared with part-of-speech usage in a representative sample of the Usenet corpus. It is concluded that this score can be used to estimate the extent to which a given word occurs in typical noun or verb sentence contexts in informal contemporary English discourse. The item statistics given in Appendix B will enable experimenters to select representative examples of nouns and verbs or to compare typical with atypical nouns (or verbs), while holding constant or covarying rated imageability.
We report on a project to annotate biblical texts in order to create an aligned multilingual Bible corpus for linguistic research, particularly computational linguistics, including automatically creating and evaluating translation lexicons and semantically tagged texts. The output of this project will enable researchers to take advantage of parallel translations across a wider number of languages than previously available, providing, with relatively little effort, a corpus that contains careful translations and reliable alignment at the near-sentence level. We discuss the nature of the text, our annotation process, preliminary and planned uses for the corpus, and relevant aspects of the Corpus Encoding Standard (CES) with respect to this corpus. We also present a quantitative comparison with dictionary and corpus resources for modern-day English, confirming the relevance of this corpus for research on present day language.
This book describes the main objective of EuroWordNet, which is the building of a multilingual database with lexical semantic networks or wordnets for several European languages. Each wordnet in the database represents a language-specific structure due to the unique lexicalization of concepts in languages. The concepts are inter-linked via a separate Inter-Lingual-Index, where equivalent concepts across languages should share the same index item. The flexible multilingual design of the database makes it possible to compare the lexicalizations and semantic structures, revealing answers to fundamental linguistic and philosophical questions which could never be answered before. How consistent are lexical semantic networks across languages, what are the language-specific differences of these networks, is there a language-universal ontology, how much information can be shared across languages? First attempts to answer these questions are given in the form of a set of shared or common Base Concepts that has been derived from the separate wordnets and their classification by a language-neutral top-ontology. These Base Concepts play a fundamental role in several wordnets. Nevertheless, the database may also serve many practical needs with respect to (cross-language) information retrieval, machine translation tools, language generation tools and language learning tools, which are discussed in the final chapter. The book offers an excellent introduction to the EuroWordNet project for scholars in the field and raises many issues that set the directions for further research in semantics and knowledge engineering.
This paper provides rating norms for a set of symbols and icons selected from a wide variety of sources. These ratings enable the effects of symbol characteristics on user performance to be systematically investigated. The symbol characteristics that have been quantified are considered to be of central relevance to symbol usability research and include concreteness, complexity, meaningfulness, familiarity, and semantic distance. The interrelationships between each of these dimensions is examined and the importance of using normative ratings for experimental research is discussed.
We discuss ways in which EuroWordNet (EWN) can be used in multilingual information retrieval activities, focusing on two approaches to Cross-Language Text Retrieval that use the EWN database as a large-scale multilingual semantic resource. The first approach indexes documents and queries in terms of the EuroWordNet Inter-Lingual-Index, thus turning term weighting and query/document matching into language-independent tasks. The second describes how the information in the EWN database could be integrated with a corpus-based technique, thus allowing retrieval of domain-specific terms that may not be present in our multilingual database. Our objective is to show the potential of EuroWordNet as a promising alternative to existing approaches to Cross-Language Text Retrieval.
This paper gives a global introduction to the aims and objectives of the EuroWordNet project, and it provides a general framework for the other papers in this volume. EuroWordNet is an EC project that develops a multilingual database with wordnets in several European languages, structured along the same lines as the Princeton WordNet. Each wordnet represents an autonomous structure of language-specific lexicalizations, which are interconnected via an Inter-Lingual-Index. The wordnets are built at different sites from existing resources, starting from a shared level of basic concepts and extended top-down. The results will be publicly available and will be tested in cross-language information retrieval applications.
This paper describes how the Euro WordNet project established a maximum level of consensus in the interpretation of relations, without loosing the possibility of encoding language-specific lexicalizations. Problematic cases arise due to the fact that each site re-used different resources and because the core vocabulary of the wordnets show complex properties. Many of these cases are discussed with respect to language internal and equivalence relations. Possible solutions are given in the form of additional criteria.
We give a brief outline of the design and contents of the English lexical database WordNet, which serves as a model for similarly conceived wordnets in several European languages. WordNet is a semantic network, in which the meanings of nouns, verbs, adjectives, and adverbs are represented in terms of their links to other (groups of) words via conceptual-semantic and lexical relations. Each part of speech is treated differently reflecting different semantic properties. We briefly discuss polysemy in WordNet, and focus on the case of meaning extensions in the verb lexicon. Finally, we outline the potential uses of WordNet not only for applications in natural language processing, but also for research in stylistic analyses in conjunction with a semantic concordance.
In this paper the linguistic design of the database under construction within the EuroWordNet project is described. This is mainly structured along the same lines as the Princeton WordNet, although some changes have been made to the WordNet overall design due to both theoretical and practical reasons. The most important reasons for such changes are the multilinguality of the EuroWordNet database and the fact that it is intended to be used in Language Engineering applications. Thus, i) some relations have been added to those identified in WordNet; ii) some labels have been identified which can be added to the relations in order to make their implications more explicit and precise; iii) some relations, already present in the WordNet design, have been modified in order to specify their role more clearly.
This paper describes two fundamental aspects in the process of building of the EuroWordNet database. In EuroWordNet we have chosen for a flexible design in which local wordnets are built relatively independently as language-specific structures, which are linked to an Inter-Lingual-Index (ILI). To ensure compatibility between the wordnets, a core set of common concepts has been defined that has to be covered by every language. Furthermore, these concepts have been classified via the ILI in terms of a Top Ontology of 63 fundamental semantic distinctions used in various semantic theories and paradigms. This paper first discusses the process leading to the definition of the set of Base Concepts, and the structure and the rationale of the Top Ontology.
The present study examines the effect of the goodness of view on the minimal exposure time required to recognize depth-rotated objects. In a previous study, Verfaillie and Boutsen (1995) derived scales of goodness of view, using a new corpus of images of depth-rotated objects. In the present experiment, a subset of this corpus (five views of 56 objects) is used to determine the recognition exposure time for each view, by increasing exposure time across successive presentations until the object is recognized. The results indicate that, for two thirds of the objects, good views are recognized more frequently and have lower recognition exposure times than bad views.
The handbook has three major parts. It begins with an introduction to the topic of corpus linguistics, intended to bring the substantial amount of corpusbased work already done in a variety of research areas to the non-specialist reader's attention. It also provides an outline description of the BNC itself. The bulk of the book however is concerned with the use of the SARA search program. This part consists of a series of detailed task descriptions which (it is hoped) will serve to teach the reader how to use SARA eVectively, and at the same time stimulate his or her interest in using the BNC. There are ten tasks, each of which introduces a new group of features of the software and of the corpus, of roughly increasing complexity. At the end of each task there are suggestions for further related work. The last part of the handbook gives a summary overview of the SARA program's commands and capabilities, intended for reference purposes, details of the main coding schemes used in the corpus, and a select bibliography
This paper discusses the design of the EuroWordNet database, in which semantic databases like WordNet1.5 for several languages are combined via a so-called inter-lingual-index. In this database, language-independent data is shared whilst language-specific properties are maintained. A special interface has been developed to compare the semantic configurations across languages and to track down differences.
Druks and Masterson [Druks J, Masterson J. An object and action naming battery with pairwise matching on various psycholinguistic characteristics. (Submitted).] produced a set of 164 object and 102 action pictures which are matched on a range of variables known to affect the availability of picture names. In the present paper we describe the development of the set of pictures and provide the verbal labels for the pictures together with their printed word frequency values and ratings for age-of-acquisition, familiarity and imageability; we also present semantic categories for the verbal labels. Finally, we give visual complexity ratings for the pictures. The materials can be used for a range of psycholinguistic experiments and also for assessment and remediation with clinical populations.
Wordfrequency is one of the strongest determiners of reaction time (RT) in word recognition tasks; it is an important theoretical and methodological variable. The Kucera and Francis (1967) word fre- quency count (derived from the 1-million-word Brown corpus) is used by most investigators concerned with the issue of word frequency. Word frequency estimates from the Brown corpus were compared with those from a 131-million-wordcorpus (the HALcorpus; conversational text gathered from Usenet) in a standard word naming task with 32 subjects. RT was predicted equally well by both corpora for high-frequency words, but the larger corpus provided better predictors for low- and medium-frequency words. Furthermore, the larger corpus provides estimates for 97,261 lexical items; the smaller corpus, for 50,406items.
Deriving representations of meaning has been a long-standing problem in cognitive psychology and psycholinguistics. The lack of a model for representing semantic and grammatical knowledge has been a handicap in attempting to model the effects of semantic constraints in human syntactic processing. A computational model of high-dimensional context space, the Hyperspace Analogue to Language (HAL), is presented with a series of simulations modelling a variety of human empirical results. HAL learns its representations from the unsupervised processing of 300 million words of conversational text. HAL's high-dimensional context space can be used to (1) provide a basic categorization of semantic and grammatical concepts, (2) model certain aspects of morphological ambiguity in verbs, and (3) provide an account of semantic context effects in syntactic processing. The authors propose that the distributed and contextually derived representations that HAL acquires provide a basis for the subconceptual knowledge that can be used in accounting for a diverse set of cognitive phenomena. ((c) 1997 APA/PsycINFO, all rights reserved)
Studies of lexical processing have relied heavily on adult ratings of word learning age or age of acquisition, which have been shown to be strongly predictive of processing speed. This study reports a set of objective norms derived in a large-scale study of British children's naming of 297 pictured objects (including 232 from the Snodgrass {\&} Vanderwart, 1980, set). In addition, data were obtained on measures of rated age of acquisition, rated frequency, imageability, object familiarity, picture-name agreement, and name agreement. We discuss the relationship between the objective measure and adult ratings of word learning age. Objective measures should be used when available, but where not, our data suggest that adult ratings provide a reliable and valid measure of real word learning age.
Researchers concerned with the development of cognitive functions are in need of standardized material that can be used with both adults and children. The present article provides normative measures for 400 line drawings viewed by 5- and 6-year-old children. The three variables obtained - name agreement, familiarity, and visual complexity - are important because of their potential effect on memory and other cognitive processes. The normative data collected in the present study indicate that young children are different from adults in both the name most frequently assigned and the number of alternative names provided. The alternative names given by the children are either coordinate names or names of objects that are visually similar to the pictured object. In addition, the failure (to name) rate is higher among young children compared to adults. Thus, we conclude that unequivocal interpretation of age-related differences in cognitive functions can be made only when age-appropriate pictorial stimuli are chosen. {\textcopyright} 1997 Academic Press.
This article is a detailed account of COMLEX Syntax, an on-line syntactic dictionary of English, developed by the Proteus Project at New York University under the auspices of the Linguistics Data Consortium. This lexicon was intended to be used for a variety of tasks in natural language processing by computer and as such has very detailed classes with a large number of syntactic features and complements for the major parts of speech and is, as far as possible, theory neutral. The dictionary was entered by hand with reference to hard copy dictionaries, an on-line concordance and native speakers‘intuition. Thus it is without prior encumbrances and can be used for both pure research and commercial purposes.
This paper proposes two new methodologies for the placement of series FACTS devices in deregulated electricity market to reduce congestion. Similar to sensitivity factor based method, the proposed methods form a priority list that reduces the solution space. The proposed methodologies are based on the use of LMP differences and congestion rent, respectively. The methods are computationally efficient, since LMPs are the by-product of a security constrained OPF and congestion rent is a function of LMP difference and power flows. The proposed methodologies are tested and validated for locating TCSC in IEEE 14-, IEEE 30- and IEEE 57-bus test systems. Results obtained with the proposed methods are compared with that of the sensitivity method and with exhaustive OPF solutions. The overall objective of FACTS device placement can be either to minimize the total congestion rent or to maximize the social welfare. Results show that the proposed methods are capable of finding the best location for TCSC installation, that suite both objectives. {\textcopyright} 2006 Elsevier B.V. All rights reserved.
In this paper, we study the problem of adding a large number of new words into a Chinese thesaurus according to their definitions in a Chinese dictionary, while minimizing the effort of hand tagging. To deal with the problem, we first make use of a kind of supervised learning technique to learn a set of defining formats for each class in the thesaurus, which tries to characterize the regularities about the definitions of the words in the class. We then use traditional techniques in Graph theory to derive a minimal subset of the new words to be added into the thesaurus, which meets the following condition: if we add the new words in the subset into the thesaurus by hand, the other new words can be added into the thesaurus automatically by matching their definitions with the defining formats of each class in the thesaurus. The method uses little, if any, language-specific or thesaurus-specific knowledge, and can be applied to the thesauri of other languages.
The rating of English words and their Welsh equivalents provided the opportunity to compare subjective ratings in two languages as well as the opportunity to compare ratings in a deep and a shallow orthography (English and Welsh, respectively). Four variables—age of acquisition (AOA), familiarity, concreteness, and imageability—were rated. AOA and imageability emerged as the two most important extralingual variables (r=.8 and .73, respectively). Although the patterns of ratings were generally consistent within and between languages, some differences did emerge when these patterns were compared with those from other studies. Using similar instructions to rate familiarity and AOA resulted in a low correlation in English (r=2.5) and a high correlation in Welsh (r=2.84). The mean ratings for familiarity, concreteness, and imageability were higher in Welsh than in English (5.23 vs. 3.35, 5.46 vs. 4.41, and 5.29 vs. 4.38, respectively). Both of these findings are explained in terms of differences in orthographic depth, and it is suggested that Welsh may be a more imageable language than English.
In this study, normative data for typicality and familiarity in presented that can be used for research in semantic memory experiments in Portugal. Measures of typicality (Rosch, 1975) and familiarity (Larochelle {\&} Saumier, 1993) were obtained with a sample of university students (n=195) for 16 semantic categories. Experimental evidence is also presented that supports the reliability of the normative data.
Factors that determine unprimed performance in word-stem completion were investigated. Researchers' descriptions of materials used in word-stem completion suggest that normative word frequency, number of alternative completions, and response length are believed to be important. A total of 160 students provided normative responses to each of 914 multiple-completion three-letter word stems, resulting in over 12,000 unique responses. Although the number of alternative responses made correlated well with the number possible, word frequency and length were extremely variable in their relation to response frequency across stems. Creating more precise materials for implicit memory studies appears to require consulting normative response data.
Examined whether a rating-based procedure that has already been used by other investigators can be used for derivation of typicality ratings from children. In Exp 1, 96 kindergartners (aged 4.3-6 yrs) generated typicality for items belonging to each of 4 categories. The same Ss participated in Exp 2, in which they were asked to generate attributes for the members of 4 categories for the benefit of 2 people from another planet who had no knowledge about the particular items. In Exp 3, with 120 university students (aged 19-21.3 yrs), the correspondence between adults' family resemblance and typicality ratings on the same materials was tested. Results show that the procedure cannot be reliably used for this purpose; children rated category items in terms of personal preferences rather than as a function of how representative they considered the items to be of their superordinate category. On the basis of these findings, the authors propose an alternative method based on the family resemblance scores of the category members to derive typicality ratings from young children. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
A number of inconsistencies are evident in the literature examining word-neighborhood size and frequency effects. One reason for the inconsistency may be that there are no standardized materials and criteria used in the different studies. Each experimenter has devised his or her word neighborhoods using different criteria for neighborhood size and frequency. The purpose of the present study was to develop a standardized set of word neighborhoods. 800 orthographic neighborhoods were constructed with 4- and 5-letter words. The word lists were devised relative to the key elements that have been identified in the literature: (1) target-word frequency, (2) number of words in the neighborhood, (3) number of words higher in frequency than the target word, (4) number of letter positions contributing to the neighborhood, and (5) summation of the frequency of all neighbors (providing a standard metric for high- vs low-frequency neighborhoods). ((c) 1998 APA/PsycINFO, all rights reserved)
We have developed a set of naming and recognition tests for evaluating the retrieval of lexical and conceptual knowledge for actions. As a first step, normative information about 280 items was collected for the following variables: (1) the naming responses elicited by each item, (2) the degree to which the image of each item agreed with a target name, (3) the familiarity to each depicted action, and (4) the visual complexity of each item. This information was used to develop administration and scoring procedures for a standardized test of action naming. The effectiveness and reliability of these procedures were evaluated in a second experiment. In a third experiment, five tests were developed to probe the retrieval of conceptual knowledge: (1) independently of the production of a naming response, (2) in response to pictorial and nonpictorial stimuli, (3) in terms of the attributes associated with specific actions, and (4) in terms of similarities and differences between various actions.
Recent studies suggest that performance attendant on visual word perception is affected not only by feedforward inconsistency (i.e., multiple ways to pronounce a spelling) but also by feedback inconsistency (i.e., multiple ways to spell a pronunciation). In the present study, we provide a statistical analysis of these types of inconsistency for all monosyllabic English words. This database can be used as a tool for controlling, selecting, and constructing stimulus materials for psycholinguistic and neuropsychological research. Such large-scale statistical analyses are necessary devices for developing metrics of inconsistency, for generating hypotheses for psycholinguistic experiments, and for building models of word perception, speech perception, and spelling.
We present the lexical-semantic net for German "GermaNet" which integrates conceptual ontological information with lexical semantics, within and across word classes. It is compatible with the Princeton WordNet but integrates principlebased modifications on the constructional and organizational level as well as on the level of lexical and conceptual relations. GermaNet includes a new treatment of regular polysemy, artificial concepts and of particle verbs. It furthermore encodes cross-classification and basic syntactic information, constituting an interesting tool in exploring the interaction of syntax and semantics. The development of such a large scale resource is particularly important as German up to now lacks basic online tools for the semantic exploration of very large corpora.
This paper describes a computer search program based on the Medical Research Council Psycholinguistic Database of English words. The program allows words to be extracted from that database according to word length, number of syllables or phonemes, and various psycholinguistic criteria such as frequency of use, imageability, concreteness, meaning, and so forth. Thus it is possible to create, for example, lists of two-syllable words of high and low familiarity. It is also possible to examine properties of given sets of words created by the researcher. Lists of these words with or without their properties and with or without a statistical analysis of those properties may be produced. Particular spellings but not particular phonemes may be searched for.
Homographs and homophones have interesting linguistic properties that make them useful in many experiments involving language. To assist researchers in the elicitation of homophones, this paper presents a set of 93 line-drawn pictures of objects with homophonic names and a set of 108 questions with homophonic answers. Statistics are also included for each picture and question: Picture statistics include name-agreement percentages, dominance, and frequency statistics of depicted referents, and picture-naming latencies both with and without study of the picture names. For questions, statistics include answer-agreement percentages, difficulty ratings, dominance, frequency statistics, and naming latencies for 60 of the most consistently answered questions.
We present new Spanish norms for object familiarity and rated age of acquisition for 140 pictures taken from Snodgrass and Vanderwart (1980), together with data on visual complexity, image agreement, name agreement, word length (in syllables and phonemes), and five measures of word frequency. The pictures were presented to a group of 64 Spanish subjects, and oral naming latencies were recorded. In a multiple regression analysis, age of acquisition, object familiarity, name agreement, word frequency, and word length made significant independent contributions to predicting naming latency.
Recent studies suggest that performance attendant on visual word perception is affected not only bythe "traditional" feedforward inconsistency (spelling ---7phonology) but also by its feedback incon- sistency (phonology ---7spelling). The present study presents a statistical analysis of the bidirectional inconsistency for all French monosyllabic words. Weshow that French is relatively consistent from spelling to phonology but highlyinconsistent from phonology to spelling. Appendixes Band Clist prior and conditional probabilities for all inconsistent mappings and thus provide a valuable tool for con- trolling, selecting, and constructing stimulus materials for psycholinguistic and neuropsychological research. Such large-scale statistical analyses about a language's structure are crucial for develop- ing metrics of inconsistency, generating hypotheses for cross-linguistic research, and building com- putational models of reading. When
Collected normative data for 254 line drawings from the set used by J. G. Snodgrass and M. Vanderwart (see record 1981-06756-001) to be used in research with Spanish-speaking samples. 261 Spanish-speaking Ss participated in 1 of 6 tasks: name agreement, familiarity, complexity, image agreement, picture-name agreement, and image variability. Each S responded to every drawing. Results are compared to those obtained by Snodgrass and Vanderwart from English-speaking Ss. There were small but significant differences for familiarity and complexity. The English-speaking sample rated the pictures as more familiar; the Spanish Ss judged the pictures as slightly more simple. The evidence justifies the statement that normative data of cognitive stimuli cannot be taken into another language directly, because object names common in one language may not be so in another, or objects that have a specific name in one language may have a generic name in another. (PsycINFO Database Record (c) 2007 APA, all rights reserved)
Determined month-by-month norms for comprehension and production of 396 words from 8 to 16 mo, and production of 680 words from 16 to 30 mo, derived from a norming study of 1,789 children aged 8-30 mo that used the Communicative Development Inventories. The norms are available in the form of a database program, LEX, for MS-DOS-based computers.
Two databases of Spanish surface word forms are presented. Surface word forms are words considered as orthographically or phonologically specified without reference to their meaning or syntactic category. The databases are based on the productive written vocabulary of children between the ages of 6 and 10 years. Statistical and structural information is presented concerning surface word-form frequency, consonant-vowel (CV) structure, number of syllables, syllables, syllable CV structure, and subsyllabic units. LEX I was intended to aid in the study of reading processes. Entries were orthographic surface word forms; words were divided in their components following orthographic criteria. LEX II was designed for spoken language research. Accordingly, words were transcribed phonologically and phonological criteria were applied in extracting the internal units. Information about stress location was also provided. Together, LEX I and LEX II represent a useful tool for psycholinguists interested in the study of people acquiring Spanish as a first or foreign language and of Spanish-speaking populations in general
This paper concerns the Charrette Project, a multimedia electronic archive of a medieval manuscript tradition. In this paper, we argue that the computer's strengths in manipulating complex and varied resources should be an important organizing principle in the conception and construction of electronic text projects. Specifically, we describe the elements of the Charrette archive, its architecture, and its potential for scholarly research and pedagogical applications.
Examined familiarity and relatedness among homograph meanings, particularly subordinate meanings, for 110 common English words. 160 undergraduates completed a meaning collection task, and the number of meanings provided for words varied widely. In a familiarity rating task, 42 Ss rated each meaning on its familiarity. Familiarity ratings were more sensitive than meaning collection to knowledge of subordinate meanings. In a comparison task, 125 Ss made relatedness judgments for all meaning pairs within each word. Results provide a database of complex relationships among word meanings that can be used to investigate the effects of relatedness and other semantic variables. (PsycINFO Database Record (c) 2000 APA, all rights reserved)
Compared sentence completion responses across 130 adults, aged 18-56 yrs old, for 198 highly constrained sentence contexts designed to elicit the best completion in the vast majority of Subjects. For each context, completions and their respective frequency of occurrence are provided. Subjects of all ages produced highly similar terminal words. Results indicate that greater SES and higher levels of education were mildly associated with a greater probability of producing a best completion response. Although increasing age correlated with greater probability of producing a best completion, this very weak association would not preclude use of these stimuli with a wide age range. (PsycINFO Database Record (c) 2000 APA, all rights reserved)
The Century of Prose Corpus is a historical corpus of British English of the period 1680–1780. It has been designed to provide a resource for students of the language of that era. The COPC is diachronic and may be considered a unit in what will eventually become a series of corpora providing access to the whole of the English language from the oldest specimens to the present. This article describes and explains the various features of the COPC.
Two experiments (modeled after J. Deese's 1959 study) revealed remarkable levels of false recall and false recognition in a list learning paradigm. In Experiment 1, subjects studied lists of 12 words (e.g., bed, rest, awake); each list was composed of associates of 1 nonpresented word (e.g., sleep). On immediate free recall tests, the nonpresented associates were recalled 40{\%} of the time and were later recognized with high confidence. In Experiment 2, a false recall rate of 55{\%} was obtained with an expanded set of lists, and on a later recognition test, subjects produced false alarms to these items at a rate comparable to the hit rate. The act of recall enhanced later remembering of both studied and nonstudied material. The results reveal a powerful illusion of memory: People remember events that never happened. False memories—either remembering events that never happened, or remembering them quite differently from the way they happened—have recently captured the attention of both psychologists and the public at large. The primary impetus for this recent surge of interest is the increase in the number of cases in which memories of previously unrecognized abuse are reported during the course of therapy. Some researchers have argued that certain therapeutic practices can cause the creation of false memories, and therefore, the apparent "recovery" of memories during the course of therapy may actually represent the creation of memories (Lindsay {\&} Read, 1994; Loftus, 1993). Although the concept of false memories is currently enjoying an increase in publicity, it is not new; psychologists have been studying false memories in several laboratory paradigms for years. Schacter (in press) provides an historical overview of the study of memory distortions. Bartlett (1932) is usually credited with conducting the first experimental investigation of false memories; he had subjects read an Indian folktale, "The War of the Ghosts," and recall it repeatedly. Although he reported no aggregate data, but only sample protocols, his results seemed to show distortions in subjects' memories over repeated attempts to recall the story. Interestingly, Bartlett's repeated reproduction results never have been successfully replicated by later researchers (see Gauld {\&} Stephenson, 1967; Roediger, Wheeler, {\&} Rajaram, 1993); indeed, Wheeler and Roediger (1992) showed that recall of prose passages (including "The War of the Ghosts") This research was supported by Grant F49620-92-J-0437 from the Air Force Office of Scientific Research. We thank Ron Haas and Lubna Manal for aid in conducting this research. Also, we thank Endel Tulving for bringing the Deese (1959) report to our attention. The manuscript benefited from comments
A set of full-color images of objects is described for use in experiments investigating the effects of in-depth rotation on the identification of three-dimensional objects. The corpus contains up to 11 perspective views of 70 nameable objects. We also provide ratings of the "goodness" of each view, based on Thurstonian scaling of subjects' preferences in a paired-comparison experiment. An exploratory cluster analysis on the scaling solutions indicates that the amount of information available in a given view generally is the major determinant of the goodness of the view. For instance, objects with an elongated front-back axis tend to cluster together, and the front and back views of these objects, which do not reveal the object's major surfaces and features, are evaluated as the worst views.
Data from parent reports on 1,803 children--derived from a normative study of the MacArthur Communicative Development Inventories (CDIs)--are used to describe the typical course and the extent of variability in major features of communicative development between 8 and 30 months of age. The two instruments, one designed for 8-16-month-old infants, the other for 16-30-month-old toddlers, are both reliable and valid, confirming the value of parent reports that are based on contemporary behavior and a recognition format. Growth trends are described for children scoring at the 10th-, 25th-, 50th-, 75th-, and 90th-percentile levels on receptive and expressive vocabulary, actions and gestures, and a number of aspects of morphology and syntax. Extensive variability exists in the rate of lexical, gestural, and grammatical development. The wide variability across children in the time of onset and course of acquisition of these skills challenges the meaningfulness of the concept of the modal child. At the same time, moderate to high intercorrelations are found among the different skills both concurrently and predictively (across a 6-month period). Sex differences consistently favor females; however, these are very small, typically accounting for 1{\%}-2{\%} of the variance. The effects of SES and birth order are even smaller within this age range. The inventories offer objective criteria for defining typicality and exceptionality, and their cost effectiveness facilitates the aggregation of large data sets needed to address many issues of contemporary theoretical interest. The present data also offer unusually detailed information on the course of development of individual lexical, gestural, and grammatical items and features. Adaptations of the CDIs to other languages have opened new possibilities for cross-linguistic explorations of sequence, rate, and variability of communicative development.
An idiom is classically defined as a formulaic sequence whose meaning is comprised of more than the sum of its parts. For this reason, idioms pose a unique problem for models of sentence processing, as researchers must take into account how idioms vary and along what dimensions, as these factors can modulate the ease with which an idiomatic interpretation can be activated. In order to help ensure external validity and comparability across studies, idiom research benefits from the availability of publicly available resources reporting ratings from a large number of native speakers. Resources such as the one outlined in the current paper facilitate opportunities for consensus across studies on idiom processing and help to further our goals as a research community. To this end, descriptive norms were obtained for 870 American English idioms from 2,100 participants along five dimensions: familiarity, meaningfulness, literal plausibility, global decomposability, and predictability. Idiom familiarity and meaningfulness strongly correlated with one another, whereas familiarity and meaningfulness were positively correlated with both global decomposability and predictability. Correlations with previous norming studies are also discussed.
Based on a sample of 145 Flemish first year psychology students at the University of Leuven (Belgium), affective and subjective familiarity norms were obtained for 740 Dutch words. One group of students (N = 64) rated 370 nouns, and a second group (N = 81) rated 370 personality-trait words on seven-point visual analogue scales, both for the positive-negative, and familiar-unfamiliar dimensions. Test-retest and inter-rate reliability coefficients were very high for both wordsets and response-types. The mean ratings and their standard deviations are presented in the Appendix. Gender differences for specific words are tabulated, and the observed association between the affective and the familiarity ratings is discussed.
Measurements of similarity have typically been obtained through the use of rating, sorting, and perceptual confusion tasks. In the present paper, a new method for measuring similarity is described, in which subjects rearrange items so that their proximity on a computer screen is proportional to their similarity. This method provides very efficient data collection. If a display hasn objects, then, after subjects have rearranged the objects (requiring slightly more thann movements),n(n-1)/2 pairwise similarities can be recorded. As long as the constraints imposed by two-dimensional space are not too different from those intrinsic to psychological similarity, the technique appears to offer an efficient, user-friendly, and intuitive process for measuring psychological similarity.
The use of rhyme in learning/memory and cognitive studies is extensive. However, there are very few normative studies for words that rhyme. The current study rectified this problem by collecting rhyme norms for 477 words from 545 subjects. Groups of subjects were given 40 words in serial order and requested, for each word, to generate as many rhymes as possible within a 30-sec interval. The data include several rhyme measures as well as measures of other word attributes that were taken from other sources. In addition, the rhyme responses to the target words were given along with their Thomdike and Lorge (1944) and Ku{\v{c}}era and Francis (1967) normative frequencies. Finally, the data were used to investigate various relationships including the spew hypothesis and the accessibility of rhyme sets.
Over the past two decades, homographs have been used in psychological experiments aimed at testing a variety of theoretical issues concerning memory and language. Often, such research requires prior knowledge of the dominance relations among various meanings of the homographs. Previously available homograph meaning norms are limited because they are now more than 10 years old, and they have typically reported only the two most dominant meanings even though many homographs have three or more common meanings. This paper presents normative data on 120 homographs from a relatively large, heterogeneous sample of subjects (N = 100). Meaning dominance was assessed by having subjects write the first definition that came to mind for each homograph. Definition responses were grouped by similarity, and the resulting meaning categories were verified against dictionary meaning classifications. The number of distinct meanings varied from two to six for the homographs investigated, and frequency of response is reported for all definition categories. {\textcopyright} 1994 Psychonomic Society, Inc.
The subjective familiarity of 40 homophone pairs was examined. The homophones consisted of monosyllabic English words (on one reading) and male first names (on the other)—for example,art andArt. Subjects heard these homophones embedded in two kinds of lists, one with 40 unambiguous words and one with 40 unambiguous names. Ratings were made for familiarity as words and as names. These correlated significantly with the log of printed frequency (.63 for words, .53 for names). In a final task, just the homophones were presented, and the subjects were asked for a comparative rating of whether the word usage or the name usage was more familiar. This direct comparison correlated well (.91) with the difference between the ratings for the name and word familiarities, but less well (.55) with the differences between the printed frequencies of the word and name meanings. This indicates either consistent biases in the judgments or true differences between printed frequencies and subjective familiarity.
For many models oflexical ambiguity resolution, relative frequency of the different meanings of homographs (words with more than one meaning) is crucial. Although several homograph as-sociation norms have been published in the past, none has involved a large number of subjects responding to a large number of homographs, and most homograph norming studies are now at least a decade old. In Experiment 1, associations to 566 homographs were collected from an aver-age of 192 subjects per homograph. Frequency of occurrence for the three most common mean-ings is reported, along with the corresponding associates, and a measure of the overall ambiguity of each homograph. Homographs whose meanings differed in part of speech were more ambigu-ous overall than homographs whose different meanings belonged to a single grammatical class. Homographs whose pronunciation depended on meaning (heterophones) were no more ambigu-ous than nonheterophones, and word frequency was unrelated to overall ambiguity. Estimates of homograph balance across different norming studies were compared, and homographs with two meanings of approximately equal relative meaning frequency (balanced homographs) and homographs with one clearly dominant meaning (polarized homographs) were identified. In Ex-periment 2, reliability of meaning categorizations was measured for a subset of the homographs in the first experiment. Meaning categorizations were shown to be highly reliable across raters. Homographs are words that have more than one mean-ing but share the same orthography. They most often also share phonology (e.g., a dog's bark vs. a tree's bark; a fireplace poker vs. a poker game), but a few English homographs have distinct phonologies for their different meanings. For these heterophonic homographs, pronun-ciation depends on meaning; examples are "bass" (fish vs. guitar) and "wind" (gale vs. to coil). Contrary to in-tuition, homographs are not an obscure class of linguistic items. Rather, homographs could be considered impor-tant topics of study solely because of their abundance in English. Britton (1978) found that 44 {\%} of a random sam-ple of English words had more than one meaning, and that 85 {\%} of a sample of high-frequency English words
In order to develop an additional measure of availability for the nouns from Paivio, Yuille, and Madigan's (1968) list, we used a CD-ROM version of theOxford English Dictionary (OED) to obtain the number of times a word was used to define other words. This variable was added to Rubin and Friendly's (1986) set of measures for these words. In multiple regression analyses, our measure proved to be a useful predictor of free recall. These results suggest that the OED may be useful for providing additional psycholinguistic measures.
A collection of 4,741 word fragments that have a unique completion is described. All word fragments are specified by two letters (e.g.,{\_}{\_}Q{\_}{\_}U{\_}{\_}can only be completed by the word LIQUEURS). The words completing these fragments range in length from five to nine letters. The fragments are unique with respect to a pool of 146,205 words, which helps rule out the possibility that obscure words could be used as a completion to the fragments. The collection of fragments as well as the words that complete them is available in ASCII format on computer disks or in printed form.
There is a growing consensus that significant, rapid progress can be made in both text understanding and spoken language understanding by investigating those phenom- ena that occur most centrally in naturally occurring unconstrained materials and by attempting to automatically extract information about language from very large cor- pora. Such corpora are beginning to serve as important research tools for investigators in natural language processing, speech recognition, and integrated spoken language systems, as well as in theoretical linguistics. Annotated corpora promise to be valu- able for enterprises as diverse as the automatic construction of statistical models for the grammar of the written and the colloquial spoken language, the development of explicit formal theories of the differing grammars of writing and speech, the investi- gation of prosodic phenomena in speech, and the evaluation and comparison of the adequacy of parsing models.
Twenty-four judges estimated the conceptual difficulty level of 870 five-letter words, using a 5-point scale. The words were selected from three word-frequency categories (1, 5–10, and 50–3,562/million) based on the word counts provided by Ku{\v{c}}era and Francis (1967). The ratings were reliable. Tables in this paper list the means and standard deviations of the ratings for each word. Reaction time (RT) for valid word identification was tested in 20 subjects, using four sets of 50 words designed to test the effects of word frequency and word difficulty. RT was longer for more difficult words when word frequency was held constant. A word-frequency effect on RT was present when difficulty was held constant. The relationship of the results to subjective estimates of word familiarity is discussed.
Partly in order to facilitate research on the relation between some standard psychological variables, we gathered normative data on 500 proverbs sampled from theOxford Dictionary of English Proverbs (Wilson, 1970). The scales for which we gathered data are imagery, concreteness, goodness , and familiarity. These norms may be of value to researchers who wish to sample linguistic units larger than the word from a set that contains an extensive number of unfamiliar and familiar items. To illustrate the possible uses to which these data may be put, we presented a causal model of the relation between the four variables mentioned above.
This study was concerned with the impact of stimulus familiarity on young children's ability to recognize spoken words and make explicit judgments about them. In Experiment 1, 5-year-olds made age-of-acquisition (AOA) estimates for a set of words that were very similar to estimates made by older children and adults. In Experiment 2, young children's picture recognition, mispronunciation detection, and vocabulary monitoring performance all varied systematically with these AOA estimates and with a stimulus-type (intact-mispronounced) manipulation. Subjective AOA estimates (whether from children or from adults) proved to be a better predictor of performance than did two objective familiarity measures and subjective imageability. These results point to considerable metalexical knowledge on the part of young children or explicit sensitivity regarding their own vocabulary knowledge. In addition, the results lend some support to the notion that actual AOA contributes to subjective AOA estimates.
The set of names corresponding to the pictures from Snodgrass and Vanderwart (1980) were screen fragmented by means of a series of procedures implemented in Microsoft QuickBasic on a Macintosh microcomputer. Words were screen fragmented by deleting blocks of pixels from their images rather than by deleting individual letters. The screen-fragmentation procedure is particularly useful for the present set of words, in which a large proportion ofthe names of the pictures are short (fewer than five letters). The screen-fragmentation procedure can produce any number of fragmentation levels. In the present implementation, eight levels of fragmented images were produced, to correspond to the eight levels available for the Snodgrass and Vanderwart pictures.
With a view to designing a speaker-independent large vocabulary recognition system, we evaluate a vector quantization approach for speaker adaptation. Only one speaker (the reference speaker) pronounces the application vocabulary. He also pronounces a small vocabulary called the adaptation vocabulary. Each new speaker then merely pronounces the adaptation vocabulary. We have compared two adaptation methods, establishing a correspondence between the codebooks of the reference and the new speakers, on a 20-speaker database with a 104-word application vocabulary. Method I uses a transposed codebook to represent the new speaker during the recognition process, whereas Method II uses a codebook which is obtained by clustering analysis on the NS's pronunciation of the adaptation vocabulary. The adaptation vocabulary contains 136 words. Comparison of the performance of the two methods shows that a new speaker's codebook is not necessary to represent the new speaker. Consequently we have used the first method to perform tests with a 5000-word application vocabulary, and a 4-speaker database. The adaptation is still efficient (the mean improvement is about 14{\%}), even if the relative improvement is 30{\%} compared to 56{\%} obtained in the 104-word application experiment. Further experiments show that the recognition accuracy can be improved by increasing the adaptation vocabulary size and the codebook size. {\textcopyright} 1991.
Principles of lexical semantics developed in the course of building an on-line lexical database are discussed. The approach is relational rather than componential. The fundamental semantic relation is synonymy, which is required in order to define the lexicalized concepts that words can be used to express. Other semantic relations between these concepts are then described. No single set of semantic relations or organizational structure is adequate for the entire lexicon: nouns, adjectives, and verbs each have their own semantic relations and their own organization determined by the role they must play in the construction of linguistic messages. {\textcopyright} 1991.
Normative values for word characteristics were obtained from a sample of 12 college-educated, totally congenitally blind subjects on the basis oftheir ratings of 161 nouns on scales of familiar- ity, concreteness, meaningfulness, and imageability. Thedominantmodality ofimagery foreach image-evoking word and the strongest word associate for each item also were recorded. The same data were collected for a group of sighted subjects, both to provide a comparison group for the blind subjects and totest the comparability of sightedsubjects' ratings with existing norms. Rat- ings for sighted subjects correlated strongly with those norms, although the coefficients were slightly higher for ratings of concreteness and imageability than for ratings of familiarity and meaningfulness. Ratings of blind subjects correlated only slightly lower with existing norms for imagery and concreteness, but considerably lower for familiarity and meaningfulness.
CLIPON is an acronym for Concordanze della Lingua Italiana Poetica dell'Otto/Novecento. The aim of the project described here is to produce lexicons and lemmatized concordances of the literary Italian language of the nineteenth and twentieth centuries. The corpus involves groups of mainly poetic works and authors that have a common denominator as regards schools, currents, culture and chronology.
Describes BRULEX, a computer database of 36,000 French-language words developed for use in psycholinguistic research. The words are sorted according to a variety of criteria, including spelling, pronunciation, word length, number of syllables, grammatical class, frequency of use, and number of homonyms.
The Corpus dei Manoscritti Copti Letterari is a project whose original aim was to reconstruct the Coptic codices from the White Monastery in Upper Egypt. The project was later expanded to include all Coptic literature. In 1980 a new project was launched to transfer the data into machine-readable form and make the information available, in as generic a format as possible, to scholars throughout the world.
This article summarizes the activities of the Istituto di Linguistica Computazionale. We discuss the Italian Multi-functional Lexical Databases; the projects focussing on linguistic analysis and generation; corpora in the MRF, textual databases and linguistic workstations; computer-assisted humanities teaching; and the various cooperative ventures, seminars and conferences offered by the Institute.
The Opera del Vocabolario Italiano was given a mandate in 1964 to create a Historical Dictionary of the Italian Language. The main objective was to provide a tool which would give vital information on the development of the Italian language from its origins to the present day. In 1986 the Center incorporated modern computer technology into the project and this led to a series of decisions which affected the nature and the outcome of the project. This article traces the development of the project, and describes both hardware and software systems used, as well as the nature of the relational database being created and its linguistic applications.
A corpus of 576 words and orthographically legal pseudowords was rated by 150 undergraduates to obtain a subjective estimate of the number of meanings possessed by the stimuli. The information contained in this corpus may be used to supplement current, sources of word-meaning information (e.g., total number of dictionary entries). Experimental evidence is presented that supports the reliability of the normative data.
o collect normative data on familiar spaces and behaviors of college students, two recall tasks were performed. In Experiment 1, 219 undergraduates recalled spaces where they spend time. In Experiment 2,155 undergraduates recalled the kinds of activities they do in familiar spaces. The norms provided here include 40 spaces and 123 different behaviors. Gender differences and transituational behaviors are also reported.
Statistical information on a substantial corpus of representative Spanish texts is needed in order to determine the significance of data about individual authors or texts by means of comparison. This study describes the organization and analysis of a 150,000-word corpus of 30 well-known twentieth-century Spanish authors. Tables show the computational results of analyses involving sentences, segments, quotations, and word length.
Four hundred stimulus behaviors and their mean normative ratings for kindness, intelligence, goodness, and normality were presented for use in person perception and memory studies. Each of the four normative ratings was based on a separate sample of 35 to 39 undergraduate students from the University of Illinois. Rankings of the mean ratings were provided to facilitate a quick comparison of the behavior ratings along each of the four trait dimensions. In addition, a cluster analysis of the behaviors was reported, using the mean kindness, intelligence, and normality ratings as defining variables. Six clusters were distinguished: (1) behaviors that primarily conveyed kindness; (2) behaviors that primarily conveyed unkindness; (3) behaviors that conveyed an unusual amount of intelligence; (4) behaviors that conveyed general intelligence; (5) behaviors that conveyed a general lack of intelligence; and (6) behaviors that conveyed very little information about kindness or intelligence.
The Oxford English Dictionary is the standard reference work for determining the earliest known instance of the occurrence of a word (its date of entry). Partly in order to facilitate research on the relation between the date of entry and other psychological variables, we gathered normative data on 1,046 words sampled from the Oxford English Dictionary. We present data for scales that measure the imagery, concreteness, goodness, and familiarity values for words. These norms may also be of use to researchers who are not explicitly concerned with words' date of entry, but who wish to sample words from a set that contains a large number of unfamiliar as well as familiar words.
Developmental differences in name agreement, familiarity, and visual complexity in response to line drawings of common objects were obtained from children and adults. Sixty-one pictures were taken from the Peabody Picture Vocabulary Test-Revised and 259 pictures were taken from the set normed for adults by Snodgrass and Vanderwart (1980). Although there were some differences between the two sets of pictures, the present results replicated the relative independence of these three measures, which was reported by Snodgrass and Vanderwart for adults. Children and adults showed substantial agreement on the names of the pictures. Although the children's ratings were lower on all measures, the differences were trivial for most pictures. We concluded that judgments of familiarity, complexity, and the names of line drawings of common objects are based primarily on information processing accomplished prior to age 7.
Ss in 2 studies were asked to report on 32 emotional states. Ss were asked to remember instances of experiencing these states and, for each experience, to fill out a questionnaire on appraisal dimensions and action readiness modes. Appraisal patterns as well as patterns of action readiness show distinct relations to various emotional categories, or names; the contributions of both kinds of components to emotion distinction are in part independent and additive. Multiple correlations, predicting action readiness scores from appraisal scores, demonstrated significant relations between particular modes of action readiness and particular appraisal patterns. The results of these studies are interpreted as providing support for the view that emotions can be regarded both as experiences of forms of appraisal and as states of action readiness. Emotions can profitably be analyzed in terms of sets of components, in which action readiness components take an important share. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
A full upper- and lowercase visual similarity matrix is presented for a standard set of com- puter characters, implemented on the Apple-Psych system. The 2,704 (52 × 52) letter pairs were rated by 12 subjects each. From the ratings, generation and veridical similarity values are derived, and they are tabled for use in research on mixed-case letter matching. In addition, the results of multidimensional scaling and cluster analyses are presented, which give complementary, simplified descriptions of the data.
The MRCmachine-usable dictionary contains 150,837 words and up 26 linguistic and psycholin- guistic attributes for each. The attributes are from sources that are publicly available but are difficult to obtain and structure into a single dictionary, Three utility programs are described that permit the selection of words defined by a set of specified attribute values and the selection of attribute values for a set of specified words. These programs permit the construction of word sets for psycholinguistic experiments that control for the attributes specified in the dictionary. The dictionary may also be of use to researchers in artificial intelligence and computer science who require psychological and linguistic descriptions of words.
Analyzed 204 literary metaphors selected from works of poetry and 264 nonliterary metaphors generated by the present authors' 10 dimensions representing ratings of comprehensibility, perceived metaphoric qualities, imagery values, familiarity, and tenor-vehicle relatedness. Analyses of the normative data indicated that (a) the mean ratings of the metaphors were reliable; (b) 634 undergraduate raters varied in their reactions to the metaphors; (c) the 10 dimensions correlated substantially with one another; and (d) literary and nonliterary metaphors showed similar patterns for the descriptive and relational statistics examined. Data indicate the need for metaphor researchers to consider multiple attributes if they are to achieve less confounded or factorial variation of theoretically motivated variables.
Cognitive psychology is finding increasing use for the word fragment completion test, in which words have to be completed from a subset of their letters (e.g., horizon from --r-z--). Researchers often try to restrict their fragments to those that can be completed with only one word, but this is difficult to do and probably never has been achieved. To help resolve this problem, a list is provided of 1,086 three- to eight-letter words, each of which is uniquely specified by a two-letter fragment, where uniqueness is defined on the basis of two sizable word collections.
This paper presents the University of Valencia's computerized word pool. This is a database that includes 16,109 Spanish words, together with 11 psychological variables for limited groups of items. The purpose behind the creation of this database was to have available a large quantity of verbal stimuli in a well-controlled system, ready for automatic selection. The description in-cludes a summary of statistics on each of the 11 psychological variables, together with a correla-tional and factor analysis of them. This statistical analysis produces results close to those ob-tained for equivalent English material.
Supplied normative data for the probability of successfully completing 192 single-solution word fragments. Normative data on the familiarity of 80 college students with the solution words were obtained, using ratings, as were estimates of word frequency from existing norms. Regression analyses were performed to predict fragment completion difficulty from familiarity, frequency, and structural characteristics of the fragments. Familiarity, whether or not first and/or last letters appeared in the fragment, and the ratio of letters to missing letters in the fragment were included in the regression equation as significant predictors of difficulty for this fragment set.
Prior probabilities of graphemes and conditional probabilities for their pronunciation as specific phonemes are given based on a corpus of 17,310 English words. Phonemes are as given in recent editions ofWebster's New Collegiate Dictionary, with minor revisions; graphemes are defined as letters or letter clusters corresponding to single phonemes. Grapheme-phoneme probabilities were derived from a revised table of frequency of occurrence of phoneme-to-grapheme correspondences generated in a study of spelling regularities (P. R. Hanna, J. S. Hanna, Hodges, {\&} Rudorf, 1966). This quantitative descriptive information provides an index of the strength of particular grapheme-phoneme associations in English. Suggestions are made for the utilization of these probabilities as estimates of spelling/sound predictability in reading research.
Examined completion responses for 40 3-letter word stems (e.g., ABO) produced by 100 undergraduates. Data include a list of the different words that were written as stem completions, their frequency of occurrence as completions, and their frequency of occurrence in English according to published norms. Analyses revealed 3 primary factors that determined overall performance on a stem-completion test: word frequency, word length, and meanings per word. Usually, however, only 1 of these factors made a significant contribution to performance. It is suggested that results can be used as a database for selecting target words for the construction of completion tests. The entire list of completion responses is appended.
Sentence-completion norms for sentences using a multiple production measure are presented. A subset of these items were taken from the Bloom and Fischler (1980) sentence-completion norms in order to compare the Cloze measure with the present multiple production measure. For both measures, the sentence constraint correlated negatively with the number of responses generated across subjects. Although the Cloze measure and the multiple production measure were highly correlated, sentence predictability was higher when the multiple production measure was used. These sentence norms provide an alternative to norms derived using the Cloze procedure.
In language-related psychological research, it is often necessary to search for sets of words with certain well-controlled phonological properties. To aid in searches of this kind, this paper presents a matrix of consonant-cluster-free, monosyllabic English words that are classified according to their phonemes. The matrix is of considerable use in the construction of experimental stimuli. Its applications are discussed.
No catalog of words currently available contains normative data for large numbers of words rated low or high in affect. A preliminary sample of 1,545 words was rated for pleasantness by 26–33 college students. Of these words, 274 were selected on the basis of their high or low ratings. These words, along with 125 others (Rubin, 1981), were then rated by additional groups of 62–76 college students on 5-point rating scales for the dimensions of pleasantness, imagery, and familiarity. The resulting mean ratings were highly correlated with the ratings obtained by other investigators using some of the same words. However, systematic differences in the ratings were found for male versus female raters. Females tended to use more extreme ratings than did males when rating words on the pleasantness scale. Also, females tended to rate words higher on the imagery and familiarity scales. Whether these sex differences in ratings represent cognitive differences between the sexes or merely differences in response style is a question that can be determined only by further research.
Presents normative data concerning the emotionality, imagery, concreteness, and meaningfulness of 580 German adjectives. Normative data are based on responses of undergraduates at German universities. (English abstract) (PsycINFO Database Record (c) 2010 APA, all rights reserved)
Describes software that provides a set of pictorial stimuli prepared for computer presentation circumventing the time-consuming nature of individually drawing pictures on the computer. Access of the stimuli from Microsoft Basic, hardware and software requirements, and availability of the software are discussed.
Although the typicality effect has been much studied in the semantic memory literature, typ-icality ratings exist for exemplars from only a very limited number of categories. This lack of ratings frequently limits the range of stimuli that can be used in investigations of the typicality effect. As an aid in stimulus construction, this paper reports typicality ratings of 893 exemplars from 93 different categories.
Developed a confusion matrix using computer-generated continuous-line uppercase letters. Data were gathered from 20 university students. The letters were presented in 1 of 5 possible positions on the circumference of an imaginary circle at 2.75 deg around the fixation point. The resulting matrix is compared with those of J. T. Townsend (see record 1971-28051-001) and G. C. Gilmore et al (1979) and with D. J. Mewhort and M. L. Dow's analysis of the data of Gilmore et al.
There exist surprisingly few normative lists of word meanings even though homographs—words having single spellings but two or more distinct meanings—are useful in studying memory and language. The meaning norms that are available all have one or more weaknesses, including: (1) the collection of free associates rather than meanings as responses to the stimulus words; (2) the collection of single rather than multiple responses to the stimulus words; (3) the inclusion of only the two most frequently occurring meaning categories, rather than all meaning categories, for the stimulus words; (4) omission of the responses typical of each meaning category; (5) inadequate randomization of the presentation order of the stimulus words; and (6) unpaced presentation of the stimulus words. We have compiled meaning norms for 90 common English words of low, medium, and high concreteness using a methodology designed to correct these weaknesses. Analysis showed that words of medium concreteness have significantly more first-response meanings than do words of either low or high concreteness, lending support to the view that concreteness is a categorical, rather than a continuous, semantic attribute.
A frequency count of more than 190,000 words of spoken English is presented. The count is based on a published corpus of spontaneous conversation (Svartvik {\&} Quirk, 1980). A brief description of the count is presented, and the correlations between spoken word frequency and a range of other word variables are reported. It is expected that the frequency count will be useful in the interpretation of certain psychological data.
Independent groups of subjects generated restricted free associations that either rhymed with the cue or were members of the semantic category designated by the cue. The data include both a listing of the responses to each cue and a cross-index of all words appearing as responses to more than one cue. These data will permit researchers to take into account the a priori associative strength between cues and targets at these two levels of processing.
Six dimensions that are relevant to our knowledge of common activities are introduced and quantified. Twelve component actions in each of 30 activities were chosen. Two of the dimensions examined were the familiarity and frequency of performance of the activities themselves. The other four dimensions—sequence, centrality, distinctiveness, and standardness—are characteristics of the actions within the activities. There was high agreement among subjects on the norms for each dimension. Correlational analyses demonstrate the possibility of choosing actions in such a way as to compare different levels of the dimensions as factors in controlled experiments. The importance of considering these dimensions in interpretations of script or schema experiments is stressed.
Reported a systematic investigation of the most popular synonym for each of 279 nouns in a test with 50 Ss drawn from a variety of college departments. A further 50 Ss performed the same task for the synonyms used in a previous rating task devised by the present authors (see PA; Vol 67:3240) for each of the 279 nouns. The preferred synonym, number of Ss giving it, and the number giving the previously used synonym are provided. The ratings from the previous task were correlated with the number of Ss giving the previously used synonym. Though the correlations were significant, they accounted for less than 15{\%} of the variance. The present data should be of value in a number of tasks investigating perception, memory, and judgment in relation to meaning, enabling control of both accessibility of a synonym and rated similarity.
Category typicality norms from 12 natural language categories are presented for kindergarten, third-grade, sixth-grade, and college students. Subjects first selected examples of familiar word concepts and rated them on a 3-point scale in terms of category typicality. Age differences in the percentage of items included as category members were found primarily for the less typical items, with inclusion rates varying as a function of both age and typicality level. The absolute level of typicality judgments increased with age, although correlations between the children's and college students' ratings were generally significant for all three children's groups, with average correlations increasing somewhat with age. It was suggested that the rating data would be useful to developmental investigators interested in children's processing of category information.
Used 2 techniques to gather data on property dominance and property goodness. Sensory properties of verbally depicted items, such as those of color or shape, were indexed for dominance (frequency of output) and typicality (perceptual goodness). 193 college students were each assigned to 1 of 4 conditions. The most dominant property response was computed for 105 nouns by 1 group. A 2nd group rated these properties for typicality, relative to a constituent property (i.e., given one's idea of yellow, how typically "yellow" were specific items?). A 3rd group produced as many properties as possible for each of 65 nouns. Dominance was computed for all 459 properties so produced. The 4th group rated these 459 properties for typicality relative to the parent noun. A multimethod-multitrait analysis indicated that both typicality and dominance were reliable and that both exhibited convergent and discriminant validity. Typicality measured relative to an ideal property exhibited greater discriminant validity from dominance than when measured relative to the parent noun. Selected uses of these norms in studies of semantic memory, metaphor judgment, and concept identification are discussed. (PsycINFO Database Copyright 1983 American Psychological Assn, all rights reserved)
A review of previous word and letter counts in addition to the applications of these counts were reported. A comprehensive count of initial and terminal letters and bigrams was compiled based on the Ku{\v{c}}era and Francis (Computational analysis of present-day American English. Providence: Brown Univ. Press, 1967) corpus of English words. The count included frequency of occurrence and versatility, or number of different words in which letters or bigrams occurred. It was shown how such counts can be used to describe "Englishness" and make predictions as to the information load of letters in words and pseudo words. {\textcopyright} 1982 Academic Press, Inc.
Four continuous word associations to each of 107 homographs were obtained from 50 male and 50 female undergraduates. Included in the word sample were 12 nonhomophonic homographs (heterophones). The data were analyzed to derive two indexes. A dominance score was defined on the basis of the frequency that a particular meaning was associated to each homograph. A stability score was a measure of the likelihood that the continuous associations were consistent with the first associate. Norms were provided for these measures. Comparison of heterophones to homophones indicates that the former are significantly more stable.
Confusion matrices were compiled for uppercase letters and for braille characters presented to observers in two ways: as raised touch stimuli and as visual stimuli that had been optically filtered of their higher spatial frequencies. These and other existing matrices were subjected to a number of analyses, including the choice model and hierarchical clustering. The strong similarity of the visual and tactile matrices from this study lends additional support to the claim that visual recognition of low-pass filtered characters, to a first approximation, can be taken as a model of tactile recognition of small two-dimensional raised patterns. Besides this, the analysis questions the widely held assumption that response bias contributes significantly to the stimulus-response contingencies in a character-recognition task.
400 undergraduates completed booklets made up of words from the Toronto Word Pool. Ss rated the words for imagery or concreteness. Imagery was defined as the ease with which a word aroused a mental image, and concreteness was defined in relation to level of abstraction. The degree to which a word was functionally a noun was estimated in a sentence generation task. The mean and standard deviation of the imagery and concreteness ratings for each item were derived, together with letter and printed frequency counts for the words and indications of sex differences in the ratings. In a follow-up study with 120 undergraduates, norms included a grammatical function code derived from dictionary definitions, a percent noun judgment, indices of statistical approximation to English, and an orthographic neighbor ratio.
First- and second-order approximations to English and orthographic neighbor ratio values are provided for Paivio, Yuille, and Madigan's (1968) 925 nouns. First- and second-order approximations to English are information theory measures of the probability of generating a word on a letter-by-letter basis. The orthographic neighbor ratio is the frequency of a word divided by the sum of the frequencies of all words that can be generated by changing one of its letters. Thus, the orthographic neighbor ratio provides a measure of a sophisticated guessing model in which partial information about a word is obtained and a decision is made on the basis of the relative frequencies of the possible responses. Correlations with existing norms are reported.
Six groups of subjects rated word pairs for the degree to which they exemplified one of six semantic relationships. The relationships that subjects were instructed to rate were antonymy, synonymity, subordination, superordination, coordination, and similarity. Stimulus pairs represented antonyms, synonyms, subordinates, superordinates, and coordinates. The pairs representing each stimulus relationship varied across four levels of typicality, ranging from good examples of the relationship to unrelated pairs. The highest rating in each group was given to the stimulus relationship corresponding to the relationship being judged (e.g., antonyms received the highest rating under antonym judgment instructions). This interaction was strongest for high-typicality pairs and decreased across the levels of typicality. Semantic decision models cannot explain these results unless the models are modified so that decisions are based on relationship similarity, the degree to which a stimulus pair exemplifies the relationship subjects are instructed to judge.
An experiment was conducted to investigate the effect of phonemic or graphemic similarity on recoding of Chinese words. The introduction of acoustic and visual similarity within the list using a retroactive interference design for memory of items showed that both intralist variables were factors causing reduction in recall, with acoustic similarity being more detrimental to short-term retention. An auditory interference task was also shown to have a more severe effect than a visual interference task. Results suggested that dual encoding processes are involved in reading and remembering Chinese.
Several methods to study the recognition and similarity of alphanumeric characters are briefly discussed and evaluated. In particular, the application of the choice-model (Luce, 1959, 1963) to recognition of letters is criticized. A feature analytic model for recognition of alphanumeric characters based on Tversky's (1977) features of similarity is proposed and tested. It is argued that the proposed model: (a) is parsimonious in that it utilizes a relatively small number of parameters, (b) is psychologically more meaningful compared with other approaches in that it is attempting to study underlying processes rather than just reveal a similarity structure, (c) yields predictions that have a high level of fit with the observed data. Possible implications from the use of the model for future research are briefly discussed.
This paper describes a computerised database of psycholinguistic information. Semantic, syntactic, phonological and orthographic information about some or all of the 98,538 words in the database is accessible, by using a specially-written and very simple programming language. Word-association data are also included in the database. Some examples are given of the use of the database for selection of stimuli to be used in psycholinguistic experimentation or linguistic research.
Ratings were obtained from 100 subjects on a seven-point scale of the degree of synonymity of 279 pairs of nouns, half the subjects rating the pairs presented in one order and half in the other. Alternative synonyms were also invited. Mean ratings ranged from 6.76 to 3.34. Forty-six pairs were identical with those presented in a similar American study by Whitten et al. (1979). While the British ratings showed some differences from the American ones, the two sets of ratings were significantly correlated over these pairs. As in the American study, order of presentation significantly affected rated similarity in some pairs, though the American and British studies were not highly consistent in this respect. No distinguishing characteristic of such pairs was apparent.
Heteronyms are words with 2 different possible pronunciations that are associated with 2 (or more) different meanings. They can be used to investigate psychological mechanisms in reading and other cognitive processes. A corpus of English heteronyms has been collected and is tabulated here. In addition, a corpus of English polyphones is tabulated. These are words with different pronunciations that are not associated with different meanings. (10 ref) (PsycINFO Database Record (c) 2009 APA, all rights reserved)
A set of word triads are presented that, as stimulus materials for free association, permit the assessment of the relative salience of two varieties of feature enhancement. There are 10 triads of each of four types that differ with respect to how the words of each triad are related. In the first type, the three words share inherent features that determine their membership in a common natural category, words of a second type are from different categories but share a sensory characteristic, the third type share both membership in a natural category and a sensory feature, and the fourth type are unrelated. The distributions of responses to these triad types are presented for a college sample as a baseline for a population of high linguistic ability. Changes in such distributions from 1st to 5th grade are then explored, and differences are shown in the associations produced by 10th- and 11th-grade high school students of high, medium, and low reading levels.
Age-of-acquisition, imagery, concreteness, familiarity, and ambiguity measures for 1,944 words of varying length and frequency of occurrence are presented. The words can all be used as nouns. Intergroup reliabilities are satisfactory on all attributes. Correlations with pre-vious word lists are significant, and the intercorrelations between measures match previous findings. METHOD Table 1 Distribution of Words by Length and Frequency in the Sample of 1,944 Drawn From Thorndike-Lorge (1944) many alternative meanings are unknown or of low salience to most subjects. However, words may be expected to vary in their degree of effective ambiguity, and it is this that we set out to measure. L;;.9 Word Length (L) L.;;5 Thorndike-Lorge Frequency Word Sample A total of 1,944 words were selected from the Thorndike-Lorge (1944) word count by means of a semirandom procedure, in such a way as to fill the cells of a 3 by 3 matrix of word length and frequency (Table I). The overall strategy was to select every 10th word that could be used as a noun. If this was not suitable, the first appropriate word within that group of 10 was selected. It was decided that, for research purposes, a fairly even distribution of words over length by frequency combinations was desirable. Due to the underrepresentation of infrequent short words and frequent long words, the latter were selected in preference to other frequency by length combina-tions to give a more even distribution. The numbers of words selected in each cell aregiven in Table I. Ratings Procedure F or the age-of-acquisition, imagery, concreteness, and famil-iarity ratings, the following procedure was used. The words were printed, in random order, 20 to a page, and alongside each word was a 7-point rating scale. The pages were then shuffled and assembled into booklets, so that each booklet contained the pages in a different random order. Due to the large number of pages, the booklets were divided into three sections, each containing approximately 33{\%} of the words. The three
Normative data were collected on 300 general-information questions from a wide variety of topics, including history, sports, art, geography, literature, and entertainment. Male and female undergraduates at two different universities made a one-word response to each question either in a response booklet or at a computer console. The reported data include the following for each question: (a) probability of recall for all 270 undergraduates, for males versus females, and for University of Washington subjects versus University of California, Irvine, subjects, (b) latency of correct recall, (c) latency of errors, and (d) feeling-of-knowing ratings for nonrecalled items. Correlations among these dependent variables, along with measures of reliability, are also reported. {\textcopyright} 1980 Academic Press, Inc.
The purpose of this research was to establish norms for the relative frequency of use of
Strings of letters that form words when read both forward and backward (e.g., “deliver” and “reviled”) are termed heteropalindromes. They may be used in both reading research and memory retrieval research. A list of English heteropalindromes that approaches comprehensiveness is reported.
Norms were collected to determine the relative dominance of different meanings of homo- graphic words. Forty-six subjects wrote down the first word that came to mind for each of 320 homographs. Each homograph, the number of times each meaning was given, and the specific associates are made available. In addition, correlations with other norms are presented.
The problem of word ambiguity has generally been overlooked in compiling lists of words measured on various attributes. In this study, rating measures were obtained on the meanings of 387 words, the ambiguity of which had been established empirically. Imagery, age-of-acquisition, concreteness, and familiarity ratings are reported for each meaning, together with an index of meaning dominance. The results suggest that the most dominant meanings tend to be the most imageable, concrete, familiar, and earliest acquired. Generally satisfactory correlations with other norms were obtained.
A list of English palindromes that approaches comprehensiveness is given. A stochastic model for the distribution of palindromes within the language is proposed. The model relates frequencies of palindromes and of heteropalindromes (Jones, 1980) as a function of word length. and it is shown to predict accurately the actual numbers of palindromes that occur
In this article we present a standardized set of 260 pictures for use in experiments investigating differences and similarities in the processing of pictures and words. The pictures are black-and-white line drawings executed according to a set of rules that provide consistency of pictorial representation. The pictures have been standardized on four variables of central relevance to memory and cognitive processing: name agreement, image agreement, familiarity, and visual complexity. The intercorrelations among the four measures were low, suggesting that they are indices of different attributes of the pictures. The concepts were selected to provide exemplars from several widely studied semantic categories. Sources of naming variance, and mean familiarity and complexity of the exemplars, differed significantly across the set of categories investigated. The potential significance of each of the normative variables to a number of semantic and episodic memory tasks is discussed.
A comprehensive count of bigram frequencies and versatilities by position was tabulated for 2-9 letter words recorded by H. Kucera and W. Francis (1967). A total of 577 bigrams were found variously distributed throughout words. Such counts should prove useful in determining the orthographic regularity of specific words. (10 ref) (PsycINFO Database Record (c) 2006 APA, all rights reserved)
Abstract Completion responses were collected for two sets of sentence contexts, which were designed to produce different distributions of probabilities for the primary responses. The subject population consisted of undergraduate college students. For each context, ...$\backslash$n
The extent to which an item is a prototypical exemplar of a category has been found to predict several experimental results (e.g.,reaction times in category classification, free and cued recall of lists, release from proactive inhibition in recall). We present prototypicality ratings for 840 words, equally distributed over 28 categories. The categories were taken from Battig and Montague's (1969) normative tables; only those categories that contained "concrete" items in common usage were employed in the study. Intragroup reliability correlations were high for all categories tested, as were the correlations for prototypicality ratings between the present study and that of Rosch (1975). In addition, correlations between prototypicality ratings, production frequencies, and word frequencies of the items are given.
Ratings of concreteness and picturability and production data for meaningfulness of 310 words were gathered from 207 6th-grade children and 265 college adults. Adults also provided ratings of imagery value. Correlations among the various stimulus attributes indicated that for adults, imagery, concreteness, and picturability were overlapping attributes. For children, however, the attributes of concreteness and picturability did not overlap as much.
Data from an experiment on recognition of tactually presented upper case letters was used to make a confusion matrix like that made by G. C. Gilmore et al (1979) with visually presented letters. The significant product-moment correlation (.88) between the 2 matrices suggests that similar processes are involved in the 2 modalities and that visual models should not rely too heavily on uniquely visual mechanisms. (6 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)
In Exp I, 328 adjectives were presented to 324 undergraduates for rating of imagery (I), ease of definition (ED), and animateness (A). The normative value of these indices was tabulated for each adjective. A correlational analysis of these measures and Ku{\v{c}}era-Francis frequency (KF) is also presented. To demonstrate the usefulness of these rating scales, Exp II requested 90 undergraduates to free recall a list of 50 adjectives after they completed an incidental learning task of rating these adjectives for I, ED, or A. Ss recalled more high I than low I adjectives and more difficult- than easy-to-define adjectives regardless of which incidental rating task they performed. Neither the degree of animateness nor the KF value of the adjectives influenced the percentage of recall.
A comprehensive count of bigram and trigram frequencies and versatilities was tabulated for words recorded by Ku{\v{c}}era and Francis. Totals of 577 different bigrams and 6,140 different trigrams were found. Their frequencies of occurrence and the number of different words in which they appeared are reported in this article.
A confusion matrix of the whole block capital letters of the alphabet was obtained to examine the nature of tactile letter recognition. A 17 x 17 matrix of tactile stimulators was placed against the backs of 4 blind Ss. A hierarchical cluster analysis {\&} a nonmetric multidimensional scaling technique were applied to the matrix. Results of the two analyses were consistent with each other, {\&} indicated that at least three independent basic letter features - enclosing shapes, vertical parallel lines, {\&} angle of lines - play important parts in tactile letter recognition. Most confusion may be attributable to displacement of the apparent loci, omission or fusion of loci of stimulation, {\&} failure to detect gaps in the tactile letters. 1 Table, 6 Figures. HA
In lexical decision experiments, subjects have difficulty in responding NO to non-words which are pronounced exactly like English words (e.g. BRANE). This does not necessarily imply that access to a lexical entry ever occurs via a phono- logical recoding of a visually-presented word. The phonological recoding procedure might be so slow that when the letter string presented is a word, access to its lexical entry via a visual representation is always achieved before phonological recoding is completed. If prelexical phonological recodings are produced by using grapheme-phoneme correspondence rules, such recodings can only occur for words which conform to these rules (regular words), since applications of the rules to words which do not conform to the rules (exception words) produce incorrect phonological representations. In two experiments, it was found that time to achieve lexical access (as measured by YES latency in a lexical decision task) was equivalent for regular words and exception words. It was concluded that access to lexical entries in lexical decision experiments of this sort does not proceed by sometimes or always phonologically recoding visually-presented words.
Each of 464 noun pairs was rated for synonymy on a 7-point scale by 100 college students. The purpose of this study was twofold. First, it was designed to provide memory and psycholinguistic researchers with extensive synonym norms. Second, it was designed to evaluate the effects of encoding order on perceived synonymy. The hypothesis that limited semantic-memory access can cause synonym pairs to be rated as more synonymous in one word-order than in the other was tested by presenting each noun pair to 50 judges in one order and to another 50 judges in the reverse order. Mean synonym ratings (averaged across word orders) ranged from 6.79 to 2.24, thus demonstrating the need for normative data on synonyms. A significant number of noun pairs showed strong directional effects such that perceived synonymy was significantly changed by word encoding order. The practical and theoretical importance of these directional effects are discussed. {\textcopyright} 1979 Academic Press, Inc.
The positional frequency and versatility of letters were tabulated for six-, seven-, and eight-letter English words.
In order to provide a reliable measure of the similarity of uppercase English letters, a confusion matrix based on 1,200 presentations of each letter was established. To facilitate an analysis of the perceived structural characteristics, the confusion matrix was decomposed according to Luce's choice model into a symmetrical similarity matrix and a response bias vector. The underlying structure of the similarity matrix was assessed with both a hierarchical clustering and a multidimensional scaling procedure. This data is offered to investigators of visual information processing as a valuable tool for controlling not only the overall similarity of the letters in a study, but also their similarity on individual feature dimensions.
Rated imagery values are given for the Hunt and Hodge (1971) taxonomic-category names, and are shown to correlate positively with category-name m' as reported by Hunt and Hodge. The presence of a correlation is consistent with findings for other materials (Paivio, 1971). However, neither categoryname m' nor the present values correlate with imagery values from the major scale of imagery in the literature (Paivio, Yuille, {\&} Madigan, 1968), possibly because the Paivio et a1. set of items included many nontaxonomic items or the range of values obscured finer grained differences among taxonomic items.
Three hundred and seventy-five children in Grades 2, 3, 4, and 6 were asked to generate instances of 25 different categories within a time period of 1 min per category. Data were tallied so that category instances are ranked as to proportion of subjects making each response at each grade level. Indications of the average number of category instances generated by children at each grade level within each category are provided.
The priming technique was used to investigate the conditions under which a homograph's dominant and/or nondominant semantic sense will be retrieved. Subjects verified whether “A(n) A is a(n) B” when A was an ambiguous word and B was a word corresponding to either a dominant or an unusual semantic sense of word A. When word B most often corresponded to the dominant sense of word A (Experiment I), a Priming by Dominance interaction was obtained in the reaction time (RT) data; viz, the facilitatory effect of priming was greater for the dominant-sense sentences than for the unusual-sense sentences. When the word B equally often corresponded to the dominant and unusual senses of A (Experiment 2), the facilitatory effect of priming was equal for the dominant-sense and unusual-sense sentences. These results were interpreted within the framework of a two-stage model of lexical access (d. Posner {\&} Snyder, 1975; Neely, 1977). An application of this two-stage model to the now rather extensive literature on homographic processing helps clear up the apparent contradictions that have been prevalent in this literature.
Abstract A description is presented of normative data for property responses to 121 words— 17 category labels, three typical and three atypical members of each category , and the words “plant” and “animal.” The production frequency of properties is considered a ... $\backslash$n
This paper gives a brief survey of the Saarbrücken project on Old Icelandic legal texts, sponsored by the German Research Society, within the Special Research Area “Computer linguistics.” The project's main points of interest are (1) producing adequate machine-readable versions and parsed indices of all legal texts in Old Icelandic, (2) graphemic studies of legal manuscripts, and (3) studies of the distribution and valence of the verbs in those texts. A proposal for encoding Old Norse/Old Icelandic demonstrates how texts of different standards (normalized, diplomatic, graphetic) can be encoded as compatibly as possible. The description of a combined normalization-lemmatization process reveals that even little normalization in a diplomatic text will save much manual parsing.
To assist research in anagram solving, this paper presents bigram statistics for 205 5-letter words known to form single-solution anagrams. Imagery, concreteness, age-of-acquisition, familiarity, and meaningfulness values for these words have previously been published. The bigram statistics presented here include the bigram rank and GTZERO (a variable involving the number of nonzero entries in the bigram frequency matrix) measures recently devised and tested by G. A. Mendelsohn (1976). ((c) 1999 APA/PsycINFO, all rights reserved)
Forty groups of subjects were given six lists of 25 nouns each for immediate free written recall. A measure of free recall was thereby obtained for each of 900 nouns in the Paivio, Yuille, and Madigan (1968) norms, each noun's measure based on the recall of 32 subjects. First-order correlations showed recall to be correlated with imagery, concreteness, meaningfulness, Thorndike-Lorge frequency, and Ku{\v{e}}ra-Francis frequency. Partial correlations showed meaningfulness to be essentially unrelated to recall and concreteness only moderately related. In contrast to previous comparisons, which were based on smaller ranges of frequency and were more susceptible to list-specific effects, imagery and frequency were found to be approximately equal in their influence on free recall.
Conducted 7 studies to determine lower bounds for the amount of lexical ambiguity of words used in English text. Two special types of ambiguity are described and quantified, and a refined method for quantifying the ambiguity of individual lexical units is presented. Results of the studies indicate that at least 32{\%} of the words used in English text were ambiguous; it is suggested, however, that this figure is probably conservative. Temporary word definitions established for special purposes occurred in 30{\%} of the sample of texts. (21 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Presents information derived from college students' ratings of a large number and variety of individual words (and some nonwords) for 7 basic semantic characteristics (concreteness, imagery, familiarity, pleasantness, number of attributes or features, categorizability, and meaningfulness). The normative information is presented in the form of 8 clusters of words that are mutually similar in their semantic properties, and a complete alphabetical listing of all words is given.
A sample of 45 student subjects provided solution scores for 80 five-letter anagrams. These scores were analysed as a function of solution word imagery, con-creteness, familiarity, objective frequency, age-of-acquisition and associative meaningfulness using multiple regression techniques. Two bigram measures together with number of vowels, nature of starting letter (vowel or consonant), anagram pronounceability and anagram-solution similarity scores were also entered into the regression equations. The bigram measures, the starting letter and anagram-solution similarity emerged as having significant associations with the solution scores. Previous reports of imagery effects in anagram are discussed in the light of the present results.
We describe the Age-Dependent Evaluations of German Adjectives (AGE). This database contains ratings for 200 German adjectives by young and older adults (general word-rating study) and graduate students (self-other relevance study). Words were rated on emotion-relevant (valence, arousal, and control) and memory-relevant (imagery) characteristics. In addition, adjectives were evaluated for self-relevance (Does this attribute describe you?), age relevance (Is this attribute typical for young or for older adults?), and self-other relevance (Is this attribute more relevant for the possessor or for other persons?). These ratings are included in the AGE database as a resource tool for experiments on word material. Our comparisons of young and older adults' evaluations revealed similarities but also significant mean-level differences for a large number of adjectives, especially on the valence dimension. This highlights the importance of age in the perception of emotional words. Data for all the words are archived at www.psychonomic.org/archive/.
The confusion matrix for the full lowercase English alphabet is estimated, based upon 25 trials per letter for each of seven subjects. Average correct recognition was controlled to 0.5 by limiting brightness and duration of displays to individually determined levels. Comparison of the obtained data to that reported by Bouma 11971t for eccentric vision supports the conclusion that limited energy foveal recognition is qualitatively different from eccentric vision recognition. Comparison of the obtained data to the uppercase confusion matrix reported by Townsend (1971) supports the inference that recognition performance has more between-letter variability for both recognizability and confusion pairings for the lowercase alphabet than for the upper.
A set of rating data was collected which compared the visual similarity of consonants to the auditory similarity of the same letters. Analysis of the rating patterns raises important questions about the use of similarity data in drawing conclusions about mode of memory coding in a variety of situations.
Solution-word imagery appears to affect difficulty of anagram solving. To assist research in this area, imagery, concreteness, age-of-acquisition, familiarity, and meaningfulness values for 205 5-letter words, known to form single-solution anagrams, are presented. None of the words have repeated letters. In an empirical study with university students, intergroup reliabilities were satisfactory on all attributes. Significant correlations were found with previous word lists, and the intercorrelations between dimensions matched previous findings. (PsycINFO Database Record (c) 2008 APA, all rights reserved).
In order to establish the relative dominance of the various alternative meanings of 20 homographs. 100 subjects were given printed lists of words and asked to write a definition for each. Definitions were categorized and the frequency of definitions corresponding to each alternative meaning noted. Substantial variation in meaning dominance imbalance across homographs was observed. A high degree of correspondence between the dominance levels determined in this manner and those obtained by alternative methods was noted.
Normative values are given for the rated meaningfulness (m′) of 300 consonant-consonant-consonant trigrams (CCCs). The CCCs were selected from the entire range of association values in Witmer (1935). Thus, they represent a greater range than Costantini and Blackwood's (1968) norms and are more current than the Witmer norms.
Tabulations of letter and letter-combination versatility and frequency were made based on the Kucera and Francis (1967) word frequency count. Letter versatility, a new descriptive statistic, was defined as the number of different words in which a letter appears. These tabulations may be useful in the investigation of visual information processing, reading skills, and human memory.
In the first part of this paper bigram frequency counts are given for the first letter- pairs, last letter-pairs and “other” letter-pairs of words of more than three letters. A short discussion of the use and relevance of such tables is given. In the second part, lists of anagram-pairs of words are given for words of length three or more letters, together with approximate percentages of occurrence of such “anagrammatical” words in the English language.
Abstract 1. Presented 200 transitive verbs in Exp I to 141 British undergraduates for ratings of concreteness , ease-of-definition, observability, and acceptability of being combined with an abstract subject or object. The normative value of these indices is tabulated for each ...
A list of 75 CVCVC words and paralogs were assessed for associative reaction time, meaningfulness (a', rated association frequency), and pronunciability. Uncorrected split-half reliability coefficients of correlation were, respectively,.97,.89, and.90. Intercorrelations were consistent with those of a list of 319 CVCVCs previously assessed for associative reaction time, meaningfulness (a'), and pronunciability. Sixty-three of the 75 CVCVCs were added to the list of 319. Intercorrelations among the assessment variables for the combined list of 382 CVCVCs were highly consistent with the intercorrelations based on the list of 319. This larger alphabetically ordered single list of 382 CVCVC words and paralogs containing all three assessment values will facilitate selection of units for the purposes of research.
The age at which words are first learned appears to be more influential in determining the ease of retrieving words from semantic memory than objective frequency, familiarity, imagery, and meaningfulness. To facilitate research on a wider variety of tasks, we present norms for 543 words for age-of-acquisition, imagery, familiarity, and meaningfulness. Most of the words form single-solution anagrams. There are 471 six-letter nouns and 72 five-letter words. Also reported are the means, 80s, and ranges for each dimension and the intercorrelations between dimensions. Intergroup reliabilities ranged from .847 to .982. Recent studies have indicated that the age at which words are first learned is influential in determining the ease of retrieving words from semantic memory. Frequency of usage during childhood was found to predict latency to name category instances more accurately than adult frequency of usage (Loftus {\&} Suppes, 1972). Age-of-acquisition as rated by young adults was found by Carroll and White (1973) to be a more relevant variable than objective frequency in predicting latency to name pictures. Rated age -of-acquisition also predicted the speed and likelihood of solving anagrams more accurately than rated familiarity, objective frequency, imagery, and meaningfulness (Stratton, Jacobus, {\&} Brinley, Note l). The norms reported in this paper provide adult norms on rated age-of-acquisition, rated imagery, rated familiarity, and meaningfulness for 543 words. Two word samples from these norms were used in an earlier study (see Note 1). METHOD
Obtained ratings from 703 undergraduates for a sample of 192 surnames selected randomly from a large metropolitan area phone book. Measures of reliability and the intercorrelations among the various measures indicate that the present information may be useful for investigations of learning and remembering names. While certain unique aspects of name stimuli may have important advantages for researchers of verbal processes, the present ratings should also prove useful to personality and social psychologists when names are of interest. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
Conducted 9 experiments with a total of 663 undergraduates using the technique of priming to study the nature of the cognitive representation generated by superordinate semantic category names. In Exp I, norms for the internal structure of 10 categories were collected. In Exps II, III, and IV, internal structure was found to affect the perceptual encoding of physically identical pairs of stimuli, facilitating responses to physically identical good members and hindering responses to identical poor members of a category. Exps V and VI showed that the category name did not generate a physical code (e.g., lines or angles), but rather affected perception of the stimuli at the level of meaning. Exps VII and VIII showed that while the representation of the category name which affected perception contained a depth meaning common to words and pictures which enabled Ss to prepare for either stimulus form within 700 msec, selective reduction of the interval between prime and stimulus below 700 msec revealed differentiation of the coding of meaning in preparation for actual perception. Exp IX suggested that good examples of semantic categories are not physiologically determined, as the effects of the internal structure of semantic categories on priming (unlike the effects for color categories) could be eliminated by long practice.
Three experiments were conducted to study the effects of enactive imagery (EI) on associative learning. In Experiment I, groups of Ss rated 226 verbs on EI and frequency. In Experiments II and III, Ss learned a 24- and a 16-item list, respectively. The lists consisted of the four possible stimulus-response combinations of high (H) and low (L) EI verb pairs: H-H, H-L, L-H, L-L. In both experiments, EI was found to be a significant factor on the stimulus side, performance being superior when the stimulus was of high EI. In Experiment III, the response EI main effect and the Stimulus by Response EI interaction were also found to be significant. The results indicated that like the imagery evoked by nouns, the EI evoked by verbs facilitates learning.
Block capital letters were displayed to experienced and inexperienced Ss, using a 20 x 20 matrix of vibratory tactors placed against the back. In two separate experiments, a total of five modes of stimulus presentation. three of them employing a linear scanning slit, were studied. The poorest method, stationary flashing of the letter, allows performance that is well above chance, implying that a purely spatial presentation does convey information. Performance is improved when the letter is moved horizontally across the display. The best performance is achieved when the amount of simultaneous stimulation is limited by using a linear scanning slit. In one method, the letter moves behind a stationary slit, with the result that its horizontal dimension is portrayed only in time. In the other two methods, the scanning slit moves across the stationary letter, portraying the letter both in time and in space. The results of all five display modes indicate that Ss can use whichever representation, spatial or temporal, is available, although patternings which most closely approximate sequential tracing by a single moving point lead to the highest recognition accuracy. We interpret these results in terms of the limited spatial resolution of the cutaneous sense. While the perception of a letter presented in either full-field condition is limited by the spatial resolution, the best measure being the two-point limen, the perception of a letter traced sequentially is limited by the localization acuity of the cutaneous sense, the best measure being the "error of localization," which is known to be considerably smaller than the two-point limen. Inasmuch as the slit methods of presentation are a compromise between simultaneous and sequential display, letter-recognition accuracy ia better with slit presentation than with the corresponding full-field mode of display.
CVCVC (319) words and paralogs previously assessed for associative reaction time (RT) by Taylor and Kimble were assessed for rated frequency (a′) and scaled rated meaningfulness (m′) following procedures used by Noble. Reliability of the a′ scale, based on three intergroup correlations, resulted in rs of .92, .90 and .89. Reliability of the m′ scale, based on an internal consistency test resulted in a mean discrepancy between the 128 empirical proportions and their corresponding theoretical proportions of 2.4; that is, the average error of reproducing all the original data from m′ scaled values was 2.4{\%}. The r between m′ and RT was .71. {\textcopyright} 1972 Academic Press, Inc. All Rights Reserved.
Describes 2 experiments with a total of 203 introductory psychology undergraduates. Exp I compared recognition memory for 2 types of homographs. For balanced homographs, there are 2 relatively equiprobable semantic encodings; for polarized homographs, there is 1 dominant encoding. It was predicted, on the basis of encoding variability theory, that recognition would be superior for polarized homographs; however, the opposite outcome was obtained and replicated at 2 retention intervals. In Exp II, successive word association tests were administered at 2 intertest intervals. The semantically scored responses support the assumption of greater variability of semantic encodings for the balanced homographs. A retrieval strategy of semantic recoding is proposed to account for the better recognition of balanced homographs.
12 Ss in each of grades K, 2, 4, and 6 rated the likability of 22 common trait adjectives. The mean ratings and standard deviations are given for each trait. Analysis of variance indicated that only 2 of the 22 traits showed significant differences across grade levels. The data indicated that the evaluative meaning for this set of traits was remarkably stable and that the conventional meaning of common trait words is achieved at an early age
Mean ratings of graphic distinctiveness were obtained for pairs of consonants. The comparisons were between uppercase forms of different consonants, lowercase forms of different consonants, and uppercase vs lowercase forms of the same consonants. The ratings were demonstrated to have satisfactory reliability and to covary moderately well with feature-component measures of letter-pair distinctiveness.
A matrix is presented of the errors of perception made by 135 men and women listening to three male and three female speakers reading aloud different randomized lists constructed from the letters of the alphabet and the digits 1—9, heard in white noise. Data from a short-term memory (STM) experiment, using simultaneous visual presentation and immediate ordered recall of two selected vocabularies of nine letters and the digits 1-9, are cited as evidence of phonemic confusion between letters and digits in STM. Conrad (1964) established a high correlation between the systematic errors made by Hsteners identifying letters of the alphabet spoken one at a time in white noise (auditory confusions), and those made by subjects when visually presented with strings of alphabetic material for immediate ordered recall. Briefly, items which sounded similar were more probably confused not only in listening, but also in short-term memory (STM). Once this relationship between auditory and STM errors had been demonstrated, it was possible, using Clarke's constant ratio rule (Clarke, 1957), to select from the listening matrix subsets of letters of diifering probabilities of auditory confusion. Conrad {\&} Hull (1964) have shown that such subsets selected for minimal auditory confusability, whether of three or nine items in the vocabulary, are significantly better recalled, after visual sequential presentation, than subsets of letters of high auditory confusability. In STM tasks using both letters and digits, Wickelgren (1965) with auditory pre-sentation, and Hintzman (1965) using visual stimuli, have shown that inter-class errors are also related to phonemic similarity. Definition of the extent and relative confusability of letters with digits has been hampered by the lack of a complete matrix of auditory confusions amongst these stimuli. The provision of such a matrix was the purpose of this study. METHOD AND PEOCEDTTKB
To obtain data for the further evaluation of age-of-acquisition as a word attribute in studies of verbal behavior, learning, and memory, estimates were secured from 62 undergraduates (35 males, 27 females) of the age at which they believed they had learned each of 220 picturable nouns (divided into two lists assigned randomly to halves of the sample), according to a 9-point scale. Reliabilities of these ratings were about .98. For comparative purposes, word frequency values for the words were secured from three large word-count studies or, where necessary, from subjective estimates made by 20 adults. Use of these and other variables as predictors of previously obtained picture-naming latencies (Carroll {\&} White, 1973) yielded results supporting the previous finding that age-of-acquisition is a more relevant predictor than word frequency. Some word frequency indices tend to reflect age-of-acquisition, but when this influence is minimized word frequency makes little contribution to the prediction.
The Palermo and Jenkins (1964) word association norms were recodified, with responses alphabetically arranged, and related to the various levels of stimuli.
Used correlation functions obtained in 2 experiments with undergraduate Os (N = 10) as a basis for describing human visual letter recognition. Visual images were filtered by means of autocorrelation for pattern information. This operation gave the relative visibilities or legibilities of the characters. The visual impressions were then cross-correlated with a set of memory records whose outputs described the relative probabilities that the stimulus was a given character. This operation described confusion errors. Finally "response bias" was described in terms of the reliability with which a memory record provides identification of a given stimulus. In these terms response bias represented an attempt by the recognition system to minimize errors in high-information responses, at the expense of producing more low-information responses as errors. (French summary)
Notes that anagram solution time may be affected by bigrams in the solution word. A data base is presented from which solution anagrams may be created based on bigram frequency and versatility.
Project on Linguistic Analysis, Berkeley.
CVCVC (319) words and paralogs previously assessed for associative reaction time (RT) by Taylor and Kimble were assessed for rated frequency (a′) and scaled rated meaningfulness (m′) following procedures used by Noble. Reliability of the a′ scale, based on three intergroup correlations, resulted in rs of .92, .90 and .89. Reliability of the m′ scale, based on an internal consistency test resulted in a mean discrepancy between the 128 empirical proportions and their corresponding theoretical proportions of 2.4; that is, the average error of reproducing all the original data from m′ scaled values was 2.4{\%}. The r between m′ and RT was .71.
To obtain more complete tables of letter sequences varying in order of approximation to English than those generally available, sequences of zero through fourth-order approximation were computer-generated using tables of single-letter, digram, trigram, and tetragram frequencies. Two sets of tables are presented. One consists of 100 randomly selected 10-letter sequences of each of zero to fourth-order material. The other consists of 40 8-letter sequences of each type, selected with the restriction that no letter appear more than once in the sequence.
Ten colors and 10 color words were scaled for meaningfulness by means of Noble's m and a' scale. The most frequent verbal responses to colors and color words were also collected.
Ratings of orthographic distinctiveness were obtained for 139 homonym pairs. Mean ratings on a 9-point scale ranged from 7.75 to 2.44. Reliability of the ratings was high (r = .91). In addition, orthographic distinctiveness was found to be independent of disparity in perceived meanings of the separate homonym forms.
200 male and 200 female undergraduates from a southern university rated the meaningfulness (m') of 84 word categories on a 7-point scale and then subsequently provided 4 instances of each category. The ratings of the category names served as the basis of separate m scales for males, females, and both together. The 84 categories are arranged in decreasing order of overall category-name M. Within each category, the frequency of occurrence of the category instances was also tabulated separately for males, females, and both together. The norms are compared with those of other experiments. (PsycINFO Database Record (c) 2000 APA, all rights reserved)
Perceptual cues mediating recognition of isolated lowercase letters have been investigated in two conditions of marginal reading:from a long distance and in eccentric vision. A high incidence of confusions indicates that observers readily use available cues for arriving at letter responses. Analysis of the confusions leads to perceptual similarities and to common properties that have possibly served as perceptual cues. Dominating similarities are /h k b/, /tilfr/,/eoc/,/aszxe/,/vw/and/gq/. Properties with high cue values are: 1. (1) vertically ascending and descending parts, 2. (2) slenderness, 3. (3) outer vertical and outer oblique parts, and also outer gaps. Inner parts are weak cues at best. Bias effects occur towards letters that occur frequently in the printed language, but they are restricted to confusions. Perceptual cues prevail over bias effects. In eccentric vision, recognition is limited by more factors than just a low visual acuity. {\textcopyright} 1971.
Examined multitrial free recall and subjective organization of 1,005 undergraduates. Ss recalled 1 of 48 lists of 16 unrelated words of high, low, or mixed word frequencies. Recall was greater for high- than for low-word-frequency lists, but no performance differences were found in subjective organization. Analyses of the kinds of subjective organization units employed and their frequencies of usage provided evidence that some subjective organization units occur frequently and should be amenable to classification.
A study was undertaken to acquire a confusion matrix of the entire upper-case English alphabet with a simple nonserifed font under tachistoscopic conditions. This was accomplished with two experimental conditions, one with blank poststimulus field and one with noisy poststimulus field, for six Ss run 650 trials each. Three mathematical models of recognition, two based on the concept of a finite number of sensory states and one being the choice model, were compared in their ability to predict the confusion matrix after their parameters were estimated from functions of the data. In order to ascertain the facility with which estimates of similarity among the letters could lead to a psychological space containing the letters, $\eta$ ij, the similarity parameter of the choice model was input to an ordinally based multidimensional scaling program. Finally, correlation coefficients were computed among parameters of the models, the scaled space, and a crude measure of physical similarity. Briefly, the results were: (1) the finite-state model that assumed stimulus similarity (the overlap activation model) and the choice model predicted the confusion-matrix entries about equally well in terms of a sum-of-squared deviations criterion and better than the all-or-none activation model, which assumed only a perfect perception or random-guessing state following a stimulus presentation; (2) the parts of the confusion matrix that fit best varied with the particular model, and this finding was related to the models; (3) the best scaling result in terms of a goodness-of-fit measure was obtained with the blank poststimulus field condition, with a technique allowing different distances for tied similarity values, and with the Euclidean as opposed to the city-block metric; and (4) there was agreement among the models in terms of the way in which the models reflected sensory and response bias structure in the data, and in the way in which a single model measured these attributes across experimental conditions, as well as agreement among similarity ami distance measures with physical Similarity. {\textcopyright} 1971 Psychonomic Society, Inc.
Norms of free association to common ambiguous English words are reported. Responses were categorized on the basis of sense relevance. On this basis, the sense dominance of the words was quantified, and the degree of ambiguity associated with each word estimated by the information measure U. This publication will be of interest primarily to researchers in verbal learning and psycholinguistics
Bilingual. eduoatiOn programs lave, been established in such. Native American languages as Aleut, Yupik, Tlingit, Haida, AthabaSkan, therokee, Lakota, Navajo, Papago, Pomo, passamagdoddy, Seminole, Tewa, and Zuni. These TrOgrais{\_}include'tle; Choctaw ,Bilingual Education Program, Northern Cleyenne,'Bilingual Education Program, Iakota Bilingual Education:Broject, Rough:Rock ,Demonstration' School BilingualAidultUral Projedt, .Ramal Navajo Tigh School Bilingual Education Program, Papago Bilingual EduCation Program, Seminole Bilingual project; San Juan, Pueblo Tewa Bilingual PrOjeCt, and Wisconsin Native American Languages'PrOject. These-programs are .funded by "three= 'main sources of Federal- fUnds,t-=the, 1965-Elementary and SecOndary{\_}Education Act (ESEA) the 1,968 'ESEA Title ,VII (Bilingual Education Ad4, and Title IV,of the 1972.EdudatiOn Amendments (Indian Education Act).. model ,proposed for the 'description and analysis of :bilingual programs. tries to map all releVant factors .outcya Single,integratedstructura and. to suggest some of the lines ,of interaction (see RC ,009 343). This- report describes 17 of the currently existingNati4e,imerican Bilingual Education programs. pSing the proposed -Model (which is briefly described) as a. guide, the differendeS -among the 17 programs are -disdussed.
An earlier article reported extensive analyses of confusion data compiled from group averages (Townsend, 1971). The present study provided for essentially the same analyses with different long-term data obtained with two individuals, the primary intent being to examine the ability of recognition and scaling models to explain data at the individual level (which the recognition models purport to describe) and to compare the confusion characteristics of the English uppercase alphabet between the two Ss and between the individual Ss and the group-averaged data. The choice and overlap models were superior to the all-or-none model in predicting the empirical confusion matrices and tended to explain the data structure in a similar manner. Multidimensional scaling analysis again supported a Euclidean metric and suggested four or five underlying stimulus dimensions. However, as before, there were no overriding intuitively appealing psychological dimensions corresponding to these, and possible reasons are discussed. The choice and overlap models appeared to fit as well or better at the individual level than at the group level and the all-or-none model to fit worse. In the present study, probability correct was fit even better by the all-or-none model than in the group study and replicated the result of being better here than the overlap and choice models. Individuals and the group were consistent in their sensory confusions as represented by similarity parameters in the choice and overlap models but differed in their response biases. A simple measure of physical similarity explained 50{\%} of the variance of the similarity structure in the confusion data.
The report presents evidence that a lognormal distribution provides a satisfactory model for the relative frequencies of occurrence of 420 conceptual nouns. The nouns had been randomly selected from the Thorndike-Lorge (T-L) source and had been used as the words in a familiarity scale. The model also provides a satisfactory description of the words from the scale which were common to the T-L count and the later Ku{\v{c}}era-Francis (K-F) count and of scaled words in the T-L count which were missing from the K-F count. For common words, there has been a very small shift toward lower frequencies of occurrence in the time between the counts. The shift originated in frequency changes in words from the middle and upper frequency categories of the familiarity scale. The report suggests that differences in types of words are responsible for differences in distributions obtained by Carroll for all words in the K-F count and the distributions presented here. It is suggested, further, that the familiarity scale continues to be useful.
This chapter discusses free-association responses to the primary purposes and other responses selected from the Palermo–Jenkins norms. A useful supplement to free-association norms is the additional determination of associations to responses that are elicited on the original test. These supplemental norms increase the number of association hierarchies available to the investigator and also provide information concerning the independent probabilities of chains of words. Free-association responses allow the independent manipulation of associative directionality of pairs of words, for example, A and B, that is, it becomes possible to choose word pairs on the basis of either the A-B or B-A associative strength. Listing of the associative probabilities of other words, given in response to B, can greatly increase the size of the pool of associative triads, that is, A-B-C chains. The original norms can be used to discover A-B-C word chains or word pairs varying in degree of bidirectionality. However, such a procedure identifies only a limited number of usable word pairs or chains.
32 undergraduates learned a list of 8 paired-associates in which 4 pairs were composed of short-latency associative RT CVCVC response terms and 4 pairs of long-latency RT terms. 2 of the 4 response terms of each group were low in meaningfulness and 2 were high. The list was learned under either a 3- or 6-sec presentation rate. Consistent with predictions and with findings of previous studies using CVC response terms, the short-latency RT terms were learned faster than the long-latency terms and the effect of RT was most pronounced at the 3-sec presentation rate. Fewer trials were required to learn the high- than the low-meaningfulness terms, and fewer trials were required for learning at the 6-sec presentation rate than at the 3-sec rate. For a sample of 65 CVCVCs, intercorrelations among RT, associative frequency, pronunciability, and association value were found to be high. Correlations between log frequency value and the other variables were low.
This paper reports imagery ratings for 338 nouns by a method similar to that of Paivio. For 111 of the nouns a comparison with ratings reported by Paivio with those reported here yielded a correlation of 0.944. Selection of nouns rated for this study was made from stimulus terms employed in various free-word-association studies.
This chapter discusses the 1952 Minnesota word association norms. The Minnesota word association norms are usually known as the Minnesota norms for the Kent–Rosanoff word association test. If the frequency of occurrence of responses to the particular stimulus word is tabulated and then arranged in order of descending frequency, it is customary to describe this series as the associative hierarchy or the associative response hierarchy for the particular stimulus word. There is a general negative relationship between the frequency of popular responses to stimuli and the number of different responses made to the stimuli. The norms have found a wide variety of uses in the experimental laboratory and have also provided a measure of individual differences. The norms provide an index of powerful verbal habits that are shared by members of the college student community.
Word associations to 40 homographic stimuli were scored in terms of the semantic features serving as S's apparent functional stimulus, and hierarchies related to the separate sets of features of homographs were determined. For most homographs the dominant meaning in the associative hierarchy was the more frequently occurring meaning in semantic counts, a finding in agreement with a spew-like principle of perceiving homographic stimuli.
This chapter discusses the complete German language norms for responses to 100 words from the Kent–Rosanoff association test. The collection of German language norms for the Kent–Rosanoff word association test was undertaken as one phase of a larger project dealing with the manner in which linguistic habits can modify aspects of behavior such as perception, learning, recall, and generalization. The normative project was carried out in the following manner: a translation from English to German was made of the Kent–Rosanoff word association test, the translated test was administered to a normative group of German students, and the results were analyzed to determine the frequency of each response to each stimulus word. The test forms were prepared on two mimeographed sheets, with numbered stimulus words arranged in columns of 25 words each, two columns to a sheet. The 100 stimulus words of the Kent–Rosanoff word association test occur quite frequently in the English language are considered, as a whole, to be emotionally neutral.
This chapter discusses free-association responses to words from the original Kent–Rosanoff word association test obtained from students residing in England and in Australia. It describes the subject populations represented in the present norms and highlights critical aspects of the procedure. The English sample consisted of 200 men and 200 women who were drawn from seven universities located throughout England. Approximately 60{\%} were enrolled in first-year courses in one of the social sciences, while the remainder were studying arts, science, or education. The median age of the subjects was 18 years, 5 months. The Australian samples were drawn from the universities of Sydney and Tasmania. The test was administered individually, using the conventional Kent–Rosanoff procedure. Response latencies were recorded. For the Australian sample, certain other responses were combined: verbs and participles, nouns and adverbs, and variances of nouns, for example, sit and sitting, peace and peaceful, and worm and earthworm.
This chapter discusses the concepts of substitution, context, and association. The elicited sentences cannot give a particularly biased estimate of word contexts for the stimulus words; however, the substitution task elicits far more antonyms than can be expected on the basis of actual contextual similarities of antonyms. The collocational study involves no editing and gives statistics on text frequencies regardless of the grammatical class of the forms. In connected discourse, nouns, verbs, indefinite pronouns, determiners, prepositions, auxiliaries, nominative pronouns, and copulas are most frequent, with adjectives and adverbs relatively infrequent. For all stimulus words, except verbs and gerunds, there is a significant positive relation between association and substitution, that is, the use of semantically and grammatically paradigmatic words as associates. Where there is high commonality in substitutions, responses tend to be paradigmatic. Where there is a high primary in the precontext or high commonality in the five most common postcontexts, the responses tend to be syntagmatic.
The meaningfulness of nonsense syllables has generally been considered in terms of Ss' ratings and use of associations and pronounceability. The redundancy of nonsense syllables was quantified by means of their component transitional probabilities, using information-theory measurements. These mathematically derived ratings were in agreement with Ss' ratings of association value and pronounceability used by previous investigators to identify the relative meaningfulness of CVC trigrams. It is hypothesized that the redundancy measures, by measuring the amount of structure in trigrams, are indicative of the potentiality for yielding signification meaning in short verbal units.
This chapter discusses single-word free-association norms for 328 responses from the Connecticut cultural norms for verbal items in categories. There are all sorts of ways in which the associative relation between two or more words can be measured. The stimulus items used in a study described in the chapter are presented with the rank and frequency with which these responses were given to the category name in the Connecticut norms listed beside each word. Stimulus items were selected from only 21 of the 43 categories, and the number of stimuli from each category ranged from 10 to 18 words. The frequency with which a word occurs as a response across the sample population of n = 100 determines the order of its listing in the norms. Thus, out of 100 Ss, 65 respond bird to the stimulus bluejay and this response, as the highest frequency response, is listed first in the associative responses to this stimulus. The normative data are presented in the sequence.
This chapter discusses homographs. As an isolated unit, the identical spelling and pronunciation of the word provides no clue as to which meaning is intended. Although interpretation of the isolated unit can be influenced by the relative frequency with which the word is used to denote one rather than the other, meaning that only the surrounding context makes clear the intended meaning. Such words that have the same form but more than one meaning are called homonyms. If the two meanings of the word are represented by the same spelling, the word is a homograph. If the two meanings of the word are represented by the same pronunciation, the word is a homophone. A word can be both a homograph and a homophone. Some words are homographs but not homophones. Because homonyms are heavily dependent on context for their interpretation, they provide useful material for studying the modification of verbal meaning as a function of experimental variations in context.
Obtained normative data from undergraduates for instances of 100 conceptual categories different from those in the Connecticut category norms from a study by B. H. Cohen, W. A. Bousfield, and G. A. Whitmarsh. Ss were instructed to provide the 1st 4 items that they thought of as representative members of a category. 200 Ss responded to 50 of the categories and another 200 Ss to the other 50 categories. The total frequencies of each response to each category are presented in ranked order, and the frequencies are also presented by sex. The norms extend the number and kinds of categories available for research on category clustering and other conceptual processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
This chapter discusses the history of use of the Kent–Rosanoff list of word association stimuli. It describes the two sets of French word association norms and the conditions under which they were obtained. The difference between speakers of English and of Western European languages in the diversification of their responses is brought out by the use of the rank-frequency function. The chapter presents a comparison of few of the main characteristics of the sets of Kent–Rosanoff norms among countries and across languages. A simple and frequently used measure of similarity of responses among normative studies is to count the number of items for which the primary responses are identical. This method can be used to compare norms in different languages, to the extent that confidence can be placed in the translation. The chapter also presents a classification of the primary responses of French and American students and workers in terms of grammatical classes of stimuli and responses.
2 samples, totaling 470 undergraduates, were given booklets containing names of common taxonomic categories and wrote 4 examples of each. Data were obtained for 30 categories; for each category, a list was compiled ranking words according to their frequencies of occurrence as responses to the category name. For 1 sample, rank-order correlations were obtained comparing 3 different methods of tallying the data: (a) an unweighted frequency count, (b) a weighted frequency, and (c) the frequency of a word as the 1st or dominant response. Additional correlations were obtained comparing (a) the 2 samples, (b) males and females, and, where applicable, (c) the combined sample and data of W. A. Bousfield, B. H. Cohen, and G. A. Whitmarsh. (PsycINFO Database Record (c) 2000 APA, all rights reserved)
Mean probabilities of acoustic confusability were computed for 1172 CCC trigrams chosen at random from Witmer?s list of association values, and the results were tabulated. ? 1969, Psychonomic Journals, Inc.. All rights reserved.
Complete normative data are presented for responses to 56 verbal categories by students from the Universities of Maryland (W=270) and Illinois (JV= 172). All 43 categories from the Connecticut norms are included, and complete data are presented for all responses given by each S to each category label within 30 sec. Additional data include (a) number of times each response was given first and mean rank of each response, (6) correlations between the various measures and between the Maryland and Illinois samples for each category, and (c) "category potency" measures and ratings for each category.
Relatively little research has been done on the quantitative characteristics of children's word usage. This spoken count was undertaken to investigate those aspects of word usage and frequency which could cast light on lexical processes in grammar and verbal development in children. Three groups of 30 children each (boys and girls) from middle-class background were used. No child had any apparent speech or hearing handicap; all were from relatively urban areas and came from English-speaking homes. Each child was shown a 20-card form of the Thematic Apperception Test and was asked to tell stories for the pictures displayed on the cards. Each story-telling session lasted one hour and was tape-recorded. For each age group there are three word lists: (1) words ordered by frequency of use, (2) words categorized by part-of-speech classes, and (3) words alphabetically ordered. The codification procedure and the rationale for each frequency list are explained in detail.
Ratings are presented for 650 stimuli from word-association lists on each of five scales: good—bad, pleasant—unpleasant, emotional—neutral, concrete—abstract, and easy to associate to—difficult to associate to. The ratings are shown to be highly reliable, and to agree well with previously collected norms of a similar character. The intercorrelations of the five scales with one another and with word frequency are reported.
The study was devised to determine empirically the pronounciability values of 200 nonsense syllables. The rating method of Underwood and Schulz (1960) was employed and 201 college Ss participated. Obtained data were highly reliable in reference to previous research and a marked relationship with m' values was found. Relationships with association value and speed of learning were also determined. It is anticipated that the obtained values will be useful as methodological aids in the design of verbal learning studies.
In prior experiments S-generated associative devices or natural language mediators (NLMs) linking pairs of items have been shown to facilitate acquisition of paired associates. Since Ss are questioned about NLMs after learning, such repos/rts may be a result of the questioning. To obtain an a priori estimate of NLM probability, several hundred pairs, each composed of CVCs of about equal association value (AV), were shown for 15 sec. while student Ss wrote down any NLM they could generate which linked both the stumulus and response. The AV level was varied between pairs. The proportion of Ss able to generate an NLM is the associability value (AS). As expected, AS and AV are correlated although AS varies considerably among pairs composed of items about equal in AV. Experiments run after the AS scale was obtained demonstrated that AS is valuable as a predictor of learning rate. AS values were highly correlated with the frequency of NLMs in postexperiment reports. It is concluded that the AS measure represents a valuable addition to our understanding of the complexity of verbal learning.
A count of the relative incidence of letters in 431 pleasant (P) words and 702 unpleasant (U) words revealed that some letters tend to occur more frequently in the initial position of P words and other letters more frequently in the initial position of U words. A task requiring Ss to guess for each letter whether it occurred in a P word or in a U word showed that people are able to approximate these objective probabilities of initial-letter occurrence. These findings can explain how it is possible to identify the probable affective meaning of a word seen in a tachistoscope prior to its complete recognition. Faster recognition of P words in tachistoscopic experiments was accounted for in terms of response probability. {\textcopyright} 1969 Academic Press Inc. All rights reserved.
Ratings of pleasantness (PL) on a 7-point scale and of association value (a′) on a 5-point scale are reported for 101 two-syllable nouns. The ratings were obtained from two samples of 100 women each and two samples of 100 men each. Sizable differences were obtained between words on both scales. For women and men respectively, PL and a′ were correlated .570 and .585; PL and printed frequency were correlated .233 and .207; frequency and a′ were correlated .533 and .764. Women's and men's ratings correlated .973 for PL and .899 for a′. {\textcopyright} 1969 Academic Press Inc. All rights reserved.
95 CVCs were rated for pronounceability (p') on a modification of Underwood and Schulz's 9-point scale. The correlation between testretest mean p' ratings of the CVCs was .99. Correlation between mean p' ratings of CVCs by two different groups of raters was .96. Mean p' ratings of the two groups correlated .88 and .95 with Noble's m' ratings of CVCs. Stability of the means in conjunction with the correlations is evidence that the p' scale may prove to be useful in evaluating CVCs used in verbal learning experiments.
COMPILED A SINGLE ALPHABETICAL LISTING BY STIMULUS WORD FROM 20 COLLECTIONS OF NORMATIVE DISCRETE FREE ASSOCIATION DATA. INCLUDED ARE THE PRIMARY RESPONSE TO EACH STIMULUS WORD, THE ASSOCIATIVE PROBABILITY OF THE PRIMARY RESPONSE, THE WORD FREQUENCY OF EACH STIMULUS AND PRIMARY RESPONSE, AND THE NORM SOURCE FOR EACH STIMULUS.
2 groups of 21 Ss ranked 20 CVC trigrams, which represented the full-range of meaningfulness values according to established norms, for either pronunciability or ease of learning, and then recalled the trigrams incidentally. 2 additional groups of 21 Ss learned the trigrams by free learning and then ranked the trigrams for either pronunciability or ease of learning. Correlations between mean ranks, group learning measures, and meaningfulness were fairly high.
Subjects were given the sense impression adjectives from Underwood {\&} Richardson's (195 6a) list of concept formation materials and had to give associations to each word for 60 sec. Noble's (l952) m technique was used to compute association values
All 2100 CVC trigrams were scaled for pronunciability by measuring pronunciation latency (PLat). The resulting distribution of PLat scores was extremely leptokurtic and positively skewed. Scores ranged from a minimum of .531 sec to a maximum of 1.726 sec. Average PLat was .81 sec. The relationship of PLat to Archer meaningfulness was linear; however, the degree of relationship was slight (r=−.37). This finding is interpreted as indicating that PLat is relatively free of bias from such other stimulus attributes as meaningfulness. As such, P Lat is viewed as reflecting a basic processing time for such stimulus materials.
GROUPS OF SS, 17-46 YR. OLD COLLEGE STUDENTS, WERE USED TO SCALE 925 NOUNS ON ABSTRACTNESS-CONCRETENESS (C), IMAGERY (I), AND MEANINGFULNESS (M). CONCRETENESS WAS DEFINED IN TERMS OF DIRECTNESS OF REFERENCE TO SENSE EXPERIENCE, AND I, IN TERMS OF WORD'S CAPACITY TO AROUSE NONVERBAL IMAGES; C AND I WERE RATED ON 7-POINT SCALES. MEANINGFULNESS WAS DEFINED IN TERMS OF THE MEAN NUMBER OF WRITTEN ASSOCIATIONS IN 30 SEC. THE MEAN SCALE VALUES FOR THESE VARIABLES ARE PRESENTED FOR EACH OF THE 925 NOUNS. ALSO REPORTED ARE THE INTERCORRELATIONS OF THE VARIABLES, TOGETHER WITH AN EXAMINATION OF THE WORDS FOR WHICH C, I, AND M VALUES ARE MOST CLEARLY DIFFERENTIATED; AND RELIABILITY DATA, INCLUDING COMPARISONS WITH SCALE VALUES FOR THE VARIABLES FROM OTHER STUDIES. (45 REF.) (PsycINFO Database Record (c) 2012 APA, all rights reserved)
The 343 CCC trigrams having the lowest association values in Witmer's 1935 list were re-evaluated. Reliability of ratings was .89 and the correlation with Witmer's values was. 79. Large changes were found in trigrams such as JFK, QJF, LBJ, LBH, HFC, GXM, DHJ, DXB, etc. There were substantial increases in values of trigrams beginning with D. New norms are presented for experimenters who need more precise control over stimulus materials.
The method of Glaze was used to scale 320 words and paralogs for meaningfulness. One hundred Ss provided data from which three such measures were derived. Employing the most conventional of these measures (percentage of Ss responding in less than 2.5 sec) to select the items to be learned, a validating study demonstrated the usual relationship between association value and speed of learning. Other investigations have employed the materials successfully for purposes of control when the main interest of the experiment was in some other problem. {\textcopyright} 1967 Academic Press Inc.
A sample of 100 CVCs was presented to 200 Ss from widely-separated college populations at the Universities of Georgia and Montana in order to obtain comparative ratings of pronounceability (p). The results indicated high within-population reliability coefficients (r ≥.98) and a high between-population correlation coefficient (r =.975). When the ratings were grouped by fives into 20 pairs of means, the Montana norms regressed linearly on the Georgia norms with r =.993.
Focuses on the statistical relationship established between high frequency, small variety and shortness in length of words. Analysis of written language; Influence of age differences with the relationship between variety and frequency of occurrence of words; Growth of available vocabulary.
CONTAINS 3 LISTS FOR USE IN ANAGRAM STUDIES: (1) A SOURCE LIST GROUPING WORDS ACCORDING TO THEIR ALPHABETIZED FORM, (2) ALL 5-LETTER ENGLISH WORDS WHOSE LETTERS CANNOT BE REARRANGED TO SPELL ANY OTHER WORD, AND (3) AN ALPHABETICAL LISTING OF ALL MULTIPLE- AND SINGLE-SOLUTION 5-LETTER WORDS. (PsycINFO Database Record (c) 2016 APA, all rights reserved)
329 nouns of A-frequency were rated on a 7-point scale for concreteness, specificity, and pronunciability by three different groups of Ss. A fourth group was used to scale the words for m by a 30-sec production method. Means of each word for s, c, and m are presented. The interrelationships among all parameters and the second-order partial correlations between m, s, c and number of letters (L) are discussed. {\textcopyright} 1966 Academic Press Inc.
A TABLE OF WORD FREQUENCIES DERIVED FROM 250,000 WORDS OF RECORDED INTERVIEWS WITH UNIVERSITY STUDENTS AND HOSPITAL PATIENTS IS PRESENTED. DATA FOR SUBSAMPLES OF 100,000 WORDS EACH FROM THE STUDENT PATIENT POPULATIONS ARE ALSO GIVEN TO PERMIT EVALUATION OF THEIR DIFFERENCES. A TOTAL OF 9699 DIFFERENT WORDS, OF WHICH 4097 OCCURRED ONLY ONCE IN THE COMPLETE SAMPLE, ARE LISTED.
NORMATIVE SOLUTION TIMES BASED ON A SAMPLE OF 134 SOLUTION WORDS AND 378 ASSOCIATED ANAGRAMS COMPILED FROM 9 STUDIES ARE PRESENTED, AS WELL AS THE 120 LETTER ORDERS POSSIBLE WITH A 5-LETTER WORD, AND A SKELETON-WORD TEST AND SCORING KEY USED FOR ASSESSING THE DEGREE TO WHICH SS STORE DIGRAM FREQUENCY INFORMATION. (16 REF.)
Word association norms are reported for 200 boys and girls, aged 12 yr. 9 mo.-13 yr. 8 mo. For those words which constituted a retest, analysis showed a high consistency in the primary response. When the primaries were used as S words, Ss frequently responded with the word that had originally produced the primary. In 14 cases, however, a new association was produced and these "chains of association" are presented. ((c) 1997 APA/PsycINFO, all rights reserved)
Word associations to each of the 26 letters of the alphabet were obtained under procedures of single association or continued associations for both upper- and lower-case letters. The results showed a significant relationship between m values and measures of frequency of letters, preferences for letters, and vocal reaction time to letters. The data also showed that m values for each letter were stable within a session. Analyses of the most frequent associations showed a high degree of consistency among the common associations for the single and continued instructional procedures and upper- and lower-case stimulus presentations of the letters. {\textcopyright} 1965 Academic Press Inc. All rights reserved.
Semantic differential (SD) factor scores on the Evaluation, Activity, and Potency dimensions are presented for 1,000 most frequently used English words. Also given are the standard errors of the factor scores, the results of several reliability studies, and a listing (for all words) of 3 types of derived scores: polarizations, n Affiliation contents, n Achievement contents. Test-ing procedures and statistics on the sample of raters are detailed. Some uses of the dictionary are suggested, and an example of its use in a study of motivation is presented including empirical results. Conditions favoring further cumulation of SD data are discussed. THE semantic differential (SD) has proven to be an accurate instrument for recording affective associations of stim-uli, particularly to the extent that such as-sociations are culturally or subculturally denned so that measurements may be aver-aged over groups of individuals (Norman, 1959). In a wide variety of studies, includ-ing many involving cross-cultural samples of raters, it has been demonstrated that affective judgments on bipolar adjective scales reliably resolve into three major dimensions or factors which Osgood has named Evaluation, Activity, and Potency 'This paper is part of a doctoral dissertation submitted to the
Single letter responses to each letter of the alphabet were collected from 1611 Ss. Subsamples were instructed either to respond with associations or to attempt to reproduce letter sequences of the English language. The results differed in several ways from previous norms.
One hundred institutionalized adolescent educable retardates gave oral free-association responses to the 100 words of the Kent-Rosanoff list presented in an order that was expected to minimize antonym response set. Their responses were tabulated to provide preliminary word-association norms for this subject population.
A WORD LIST OF SPOKEN RUSSIAN WAS COMPILED BASED ON AN ACTUAL COUNT OF 10,000 WORDS. THE WORDS WERE COMPOSED OF 50-WORD SAMPLES TAKEN FROM 200 ACTS OF 93 PLAYS PUBLISHED SINCE 1957. IT WAS FOUND THAT JUST 360 WORDS, FROM A TOTAL OF 2,380 WORDS TABULATED, REPRESENTED 73 PERCENT OF ALL OCCURRENCES. THE AUTHOR PREPARED SAMPLE DIALOGUES USING ONLY THESE HIGH-FREQUENCY VOCABULARY ITEMS AS FURTHER PROOF THAT INTELLIGENT COMMUNICATION AT AN ADULT LEVEL WAS POSSIBLE.
Six groups, each consisting of 10 Ss, produced strings of 11, 22, or 33 words, under one of two sets of instructions: “structured” instructions which asked S to produce a grammatically acceptable string; “unstructured instructions” which solicited a random, unconstrained string. Each word in S's production was classified into one of six grammatical classes. For the structured instructions the rank ordering of frequency of the six classes was functors (40{\%}), nouns (25{\%}), verbs (13{\%}), adjectives (12{\%}), pronouns (6{\%}), adverbs (3{\%}). For the unstructured instructions the rank ordering of frequency was nouns (68{\%}), adjectives (12{\%}), verbs (11{\%}), functors (4{\%}), adverbs (1{\%}), pronouns (0.55{\%}). The findings were discussed in relation to three questions.
PRESENTS TABLES, BASED ON A SAMPLE OF 20,000 ENGLISH WORDS, WHICH SHOW SINGLE-LETTER AND DIGRAM FREQUENCY COUNTS BROKEN DOWN TO ACCOUNT FOR ALL WORD-LENGTH AND LETTER-POSITION COMBINATIONS, FOR WORDS 3-7 LETTERS IN LENGTH. THESE TABLES ARE SIMILAR TO THOSE OF PRATT AND OF UNDERWOOD AND SCHULZ, BUT IN ADDITION ALLOW FOR THE ASSESSMENT OF FREQUENCIES FOR ALL WORD-LENGTH AND LETTER-POSITION VARIATIONS IN THE WORD SAMPLE. THE TABLES MAY BE EMPLOYED TO PROVIDE NORMATIVE FREQUENCY DATA FOR STUDIES IN VERBAL LEARNING AND RETENTION, ANAGRAM PROBLEM SOLVING, WORD RECOGNITION THRESHOLDS, LINGUISTIC ANALYSES, ETC. (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Word association norms are presented for 2 lists of 200 words including 100 words of the Kent-Rosanoff list. Ss were 250 boys and 250 girls in each of the Grades 4-8, 10, and 12 in the Minneapolis public schools and 500 male and 500 female students in introductory psychology classes at the University of Minnesota. (PsycINFO Database Record (c) 2007 APA )
This paper presents an analysis of the relations among the associative distributions to 278 English adjectives. These adjectives provide a nearly exhaustive sample of adjectives with Thorndike-Lorge frequencies of 50 occurrences per million or greater. The analysis shows that among these adjectives there are 40 pairs of polar opposites or contrasts. Factor analysis of the intersections of associative distributions among these polar opposites reveals that they are nearly completely orthogonal. There is a sizeable correlation between loadings for the pairs big-little and large-small; nearly all the remaining pairs produce correlations close to zero. Some of the pairs define scales on which it is possible to locate parts of the intraverbal meaning of other adjectives. More than 100 adjectives in the present sample are located on one or more scales produced by the basic contrasts. Not all pairs of contrasts, however, have scalar properties; these words define contrasts but not contrasting scales. Finally, not all adjectives are organized by contrast, though it is probable that a very large number, perhaps the majority, are. Finally evidence is presented for the general linguistic validity of the results of factor analysis of associative meaning. Such evidence points to the determination of intraverbal relations in meaning by partial contextual equivalences plus whatever external contingencies exist between words and the environment at large. {\textcopyright} 1964 Academic Press Inc. All rights reserved.
Multiple word associations to 52 CVCs were obtained from grade-school-aged children. Meaningfulness values (m) for each word, defined as the mean number of associations, were determined. While m values for the set of words increased with grade-age, the rank ordering of these values was essentially the same within each grade-age-sex group. Sex was not related to m values. Values of m when compared with Noble's m' showed significant agreement both for rank-order comparisons and values dichotomized into high and low categories.
If communality of responses is stable, the relative popularity of responses to the Kent-Rosanoff Word Association Test should remain the same for subjects from young adulthood to advanced age. The Kent-Rosanoff was administered individually to 738 subjects from 18 to 87 years of age from various occupations and from various parts of the country. The results indicate that there is a decrease in the strength of communality accompanied by an increase in variability with the advance in age. ((c) 1997 APA/PsycINFO, all rights reserved)
Sequences of 6 letters of the alphabet were visually presented for immediate recall to 387 subjects. Errors showed a systematic relationship to original stimuli. This is held to meet a requirement of the decay theory of immediate memory. The same letter vocabulary was used in a test in which subjects were required to identify the letters spoken against a white noise background. A highly significant correlation was found between letters which confused in the listening test, and letters which confused in recall. The role of neurological noise in recall is discussed in relation to these results. It is further argued that information theory is inadequate to explain the memory span, since the nature of the stimulus set, which can be defined quantitatively, as well as the information per item, is likely to be a determining factor.
There is a lack of indices of rated association for English words, in contrast to a large pool of rated nonsense syllables. To fill this need, 446 English words were randomly selected from Webster's New International Dictionary to represent all English monosyllables, bisyllables, and trisyllables. They were rated for associations on a 7-point scale by 126 Ss. From the ratings three indices of association were obtained. All indices have uncorrected reliabilities in the range 0.95–0.98.
Ratings of the association values on a five-point scale by 95 Ss are reported for the 101 numbers between 0 and 100. The results reveal sizable and relatively consistent differences between numbers which are shown to be related to learning difficulty, as well as a high degree of individual variability. {\textcopyright} 1962.
Alfred Castaneda, Leila Snyder Fahel, Richard Odom, Associative Characteristics of Sixty-Three Adjectives and Their Relation to Verbal Paired-Associate Learning in Children, Child Development, Vol. 32, No. 2 (Jun., 1961), pp. 297-304
An operational definition of abstractness in nouns was constructed by using the human discriminative response to identify two points on a scale of abstractness. This scale, consisting of 490 'abstract' and 571 'concrete' nouns, was found to have adequate reliability. When the scale was manipulated as an independent variable, the effect of abstractness on short-term recognition memory was highly significant, 'abstract' nouns being less well remembered than 'concrete' nouns. Frequency was found to be pertinent variable, independent of abstractness, very frequent nouns being less well remembered than some-what rarer nouns.
A re-evaluation of all possible 3-letter combinations of the Roman alphabet of the form consonant-vowel-consonant with the restriction that the 2 consonants are different and that neither is a y when y is the vowel. A list of 2480 possible trigrams is included. Reliability was determined by 2 measures, and the association values were correlated with the previously reported list of trigrams by Glaze and Krueger. It was concluded that the present list provides a better estimate of the meaningfulness of trigrams than has thus far been available. From Psyc Abstracts 36:01:1CI23A. (PsycINFO Database Record (c) 2006 APA, all rights reserved)
INFORMATION regarding the sequential dependencies among letters in the English language is of interest not only in connexion with linguistics and cryptography, but also because of its value for communication theory and for the psychology of language1,2. ? 1960 Nature Publishing Group.
The authors report a series of original studies and analyze previous work in the area. "{\ldots} the position is taken that the frequency with which verbal units have been experienced is the fundamental variable responsible for the characteristics which have been used to define meaningfulness." The implications of the frequency hypothesis were tested in 16 experiments. These experiments deal with the effects of the frequency of letters and letter-combinations on serial and paired-associate learning, the effect of "pronunciability" on learning, the effect of frequency on letter-sequence habits, and the difference in the effect of the meaningfulness of a word depending on whether the word is a stimulus or a response.
A sample of 100 college students ranked the alphabet according to their preference for the appearance of the capital letter. Rankings are presented for the total sample, and for subgroups based on age and sex. Coefficients of concordance among judges are low, but the rankings for the total sample and the age and sex subsamples appear to be quite reliable.
(1) The first objective is to relate the semantic differential to other associative techniques. The semantic diFerential may be viewed as a restricted association-test of potentially high sensitivity. In essence, the subject (S) is given a concept and asked whether it is more ... $\backslash$n
This is an extension of the 1921 and 1931 Thorndike word books, including the data from these counts and from three other counts of over 4Β million words each. Most of the book comprises Part I, being the list of English words occurring at least once per million words of representative general reading matter. The five columns after each alphabetized entry report (1) numbers stating occurrences per million words, (2) the Thorndike general count of 1931, (3) the Lorge magazine count, (4) the Thorndike count of 120 juvenile books, and (5) the Lorge-Thorndike semantic count. Part II is a list of words occurring at least once per four million words of printed usage but not so often as once per million words. A list of the 500 most frequently encountered words and of the 500 next in order completes the volume. The introduction contains a detailed set of instructions for the use of this material by elementary and secondary school teachers in furthering pupil growth in vocabulary. (PsycINFO Database Record (c) 2009 APA, all rights reserved)
The normal range of reaction in response to any of our stimulus words is largely confined within narrow limits. The frequency tables compiled from test records given by one thousand normal subjects comprise over ninety per cent of the normal range in the average case. With the aid of the frequency tables and the appendix normal reactions, with a very few exceptions, can be sharply distinguished from pathological ones. The separation of pathological reactions from normal ones simplifies the task of their analysis, and makes possible the application of a classification based on objective criteria. By the application of the association test, according to the method here proposed, no sharp distinction can be drawn between mental health and mental disease; a large collection of material shows a gradual and not an abrupt transition from the normal state to pathological states. In dementia pr{\ae}cox, some paranoic conditions, manic-depressive insanity, general paresis, and epileptic dementia the test reveals some characteristic, though not pathognomonic, associational tendencies.
In this work we present SENTIWORDNET 3.0, a lexical resource explicitly devised for supporting sentiment classification and opinion mining applications. SENTIWORDNET 3.0 is an improved version of SENTIWORDNET 1.0, a lexical resource publicly available for research purposes, now currently licensed to more than 300 research groups and used in a variety of research projects worldwide. Both SENTIWORDNET 1.0 and 3.0 are the result of automatically annotating all WORDNET synsets according to their degrees of positivity, negativity, and neutrality. SENTIWORDNET 1.0 and 3.0 differ (a) in the versions of WORDNET which they annotate (WORDNET 2.0 and 3.0, respectively), (b) in the algorithm used for automatically annotating WORDNET, which now includes (additionally to the previous semi-supervised learning step) a random-walk step for refining the scores. We here discuss SENTIWORDNET 3.0, especially focussing on the improvements concerning aspect (b) that it embodies with respect to version 1.0. We also report the results of evaluating SENTIWORDNET 3.0 against a fragment of WORDNET 3.0 manually annotated for positivity, negativity, and neutrality; these results indicate accuracy improvements of about 20{\%} with respect to SENTIWORDNET 1.0.
This landmark publication in comparative linguistics is the first comprehensive work to address the general issue of what kinds of words tend to be borrowed from other languages. The authors have assembled a unique database of over 70,000 words from 40 languages from around the world, 18,000 of which are loanwords. This database (http://loanwords.info) allows the authors to make empirically founded generalizations about general tendencies of word exchange among languages"--Provided by publisher. Notational conventions -- Acknowledgments -- List of authors -- General chapters: I. The loanword typology project and the world loanword database / Martin Haspelmath and Uri Tadmor -- II. Lexical borrowing: Concepts and issues / Martin Haspelmath -- III. Loanwords in the world's languages: Findings and results / Uri Tadmor -- THE LANGUAGES: 1. Loanwords in Swahili / Thilo C. Schadeberg -- 2. Loanwords in Iraqw, a Cushitic language of Tanzania / Maarten Mous and Martha Qorro -- 3. Loanwords in Gawwada, a Cushitic language of Ethiopia / Mauro Tosco -- 4. Loanwords in Hausa, a Chadic language in West Africa / Ari Awagana and H. Ekkehard Wolff, with Doris Löhr -- 5. Loanwords in Kanuri, a Saharan language / Doris Löhr and H. Ekkehard Wolff, with Ari Awagana -- 6. Loanwords in Tarifiyt, a Berber language of Morocco / Maarten Kossmann -- 7. Loanwords in Seychelles Creole / Susanne Michaelis with Marcel Rosalie -- 8. Loanwords in Romanian / Kim Schulte -- 9. Loanwords in Selice Romani, an Indo-Aryan language of Slovakia / Viktor Elšík -- 10. Loanwords in Lower Sorbian, a Slavic language of Germany / Hauke Bartels -- 11. Loanwords in Old High German / Roland Schuhmann -- 12. Loanwords in Dutch / Nicoline van der Sijs -- 13. Loanwords in British English / Anthony Grant -- 14. Loanwords in Kildin Saami, a Uralic language of northern Europe / Michael Riessler -- 15. Loanwords in Bezhta, a Nakh-Daghestanian of the North Caucasus / Bernard Comrie and Madzhid Khalilov -- 16. Loanwords in Archi, a Nakh-Daghestanian of the North Caucasus / Marina Chumakina -- 17. Loanwords in Manange, a Tibeto-Burman language of Nepal / Kristine A. Hildebrandt -- 18. Loanwords in Ket, a Yeniseian language of Siberia / Edward Vajda -- 19. Loanwords in Sakha (Yakut), a Turkic language of Siberia / Brigitte Pakendorf and Innokentij N. Novgorodov -- 20. Loanwords in Oroqen, a Tungusic language of China / Fengxiang Li and Lindsay J. Whaley. Loanwords in Japanese / Christopher K. Schmidt -- 22. Loanwords in Mandarin Chinese / Thekla Wiebusch and Uri Tadmor -- 23. Loanwords in Thai / Titima Suthiwan and Uri Tadmor -- 24. Loanwords in Vietnamese / Mark J. Alves -- 25. Loanwords in White Hmong / Martha Ratliff -- 26. Loanwords in Ceq Wong, an Austroasiatic language of Peninsular Malaysia / Nicole Kruspe -- 27. Loanwords in Indonesian / Uri Tadmor -- 28. Loanwords in Malagasy / Alexander Adelaar -- 29. Loanwords in Takia, an Oceanic language of Papua New Guinea / Malcolm Ross -- 30. Loanwords in Hawaiian / 'Ōiwi Parker Jones -- 31. Loanwords in Gurindji, a Pama-Nyungan language of Australia / Patrick McConvell -- 32. Loanwords in Yaqui, a Uto-Aztecan language of Mexico / Zarina Estrada Fernández -- 33. Loanwords in Zinacantán Tzotzil, a Mayan language of Mexico / Cecil H. Brown -- 34. Loanwords in Q'eqchi', a Mayan language of Guatemala / S{\o}ren Wichmann and Kerry Hull -- 35. Loanwords in Otomi, an Otomanguean language of Mexico / Ewald Hekking and Dik Bakker -- 36. Loanwords in Saramaccan, an English-based creole of Suriname / Jeff Good -- 37. Loanwords in Imbabura Quechua / Jorge Gómez Rendón and Willem Adelaar -- 38. Loanwords in Kali'na, a Cariban language of French Guiana / Odile Renault-Lescure -- 39. Loanwords in Hup, a Nadahup language of Amazonia / Patience Epps -- 40. Loanwords in Wichí, a Mataco-Mataguayan language of Argentina / Alejandra Vidal and Verónica Nercesian -- 41. Loanwords in Mapudungun, a language of Chile and Argentina / Lucía A. Golluscio.
This book promotes the development of linguistic databases by describing a number of successful database projects, focusing especially on cross-linguistic and typological research. It has become increasingly clear that ready access to knowledge about cross-linguistic variation is of great value to many types of linguistic research. Such a systematic body of data is essential in order to gain a proper understanding of what is truly universal in language and what is determined by specific cultural settings. Moreover, it is increasingly needed as a tool to systematically evaluate contrasting theoretical claims. The book includes a chapter on general problems of using databases to handle language data and chapters on a number of individual projects.
TLS explores the conceptual schemes of pre-Buddhist Chinese on the basis of over 8500 A4 pages of text with interlinear translations. TLS is a sustained effort in philological and philosophical fieldwork, designed throughout to make the classical Chinese evidence strictly comparable to that of other cultures, and to make possible meaningful analytic primary-evidence-based disagreement among non-sinologists on classical Chinese concepts and words. TLS is compiled in the hope that careful philosophical reflection on Chinese texts might serve to broaden the empirical basis for philosophical theories and generalisations on conceptual schemes. TLS is based on the conviction that we should improve the clarity and bite of declarations of difference between conceptual schemes by enlarging the basis of literally translated and analysed texts from widely (though never radically) different intellectual cultures. The necessary charitable assumption that if we want to understand others we must count them right in most matters will not prevent TLS from looking for and exploring deep conceptual contrasts to the full. TLS seeks to make precise criteria of translation for classical Chinese, mainly through a detailed description in English of systematic recurrent semantic relations between Chinese words, especially distinctive semantic features. TLS is the first synonym dictionary of classical Chinese in any Western language. TLS focusses on distinctive semantic nuances. TLS is the first interactive dictionary of Chinese. TLS is the first dictionary which systematically organises the Chinese vocabulary in taxonomic and mereonomic hierarchies thus showing up whole conceptual schemes or cognitive systems. These are taken to circumscribe the changing topology of Chinese mental space. TLS is the first dictionary that systematically registers a range of lexical relations like antonym, converse, epithet etc. TLS thus aims to define conceptual space as a relational space. TLS is the first dictionary of Chinese which incorporates detailed syntactic analysis of (over 600 distinct kinds of) syntactic usage. TLS thus enables us to make a systematic study of such basic phenomena as the natural history of abstract nouns in China. TLS is the first corpus-based dictionary which will record the history of rhetorical devices in texts and will thus enable us to study such intellectually crucial things as the natural history of irony in China. All analytic categories and procedures of analysis in TLS are flexible in the sense that they are continuously being revised and improved in the light of new observation and analysis
Languages with binary stress systems frequently tolerate a stress lapse over the final two syllables, but almost none tolerate a word-initial stress lapse. Lunden (to appear) argues that this lapse asymmetry can be explained by the presence of word-level final lengthening, which can then create the perception of prominence alternation in languages that use duration as stress correlate. The results of a production and a perception study with English speakers are presented which compare /ɑ/s that occur under stress lapse to /ɑ/s in non-stress-lapse positions. While word-final unstressed /ɑ/ is always longer than non-final unstressed /ɑ/, it is significantly longer when immediately following an unstressed syllable. Similarly, unstressed word-final /ɑ/ has a higher F1 and lower F2 than non-final unstressed /ɑ/, but word-finally this less-reduced vowel is closer to a full vowel when the final syllable is part of a stress lapse. The perception study finds that these differences have perceptual consequences that can lead to a perceived continued rhythm in stress lapse. The phonetic differences explain why a word-final unstressed vowel can be perceived as relatively strong when following an unstressed syllable but as relatively weak when following a stressed syllable.
Today, people generate and store more data thanever before as they interact with both real and virtual environ-ments. These digital traces of behavior and cognition offercognitive scientists and psychologists an unprecedented op-portunity to test theories outside the laboratory. Despite gen-eral excitement about big data and naturally occurring datasetsamong researchers, threeBgaps{\^{}}stand in the way of theirwider adoption in theory-driven research: theimaginationgap, theskillsgap, and theculturegap. We outline an ap-proach to bridging these three gaps while respecting our re-sponsibilities to the public as participants in and consumers ofthe resulting research. To that end, we introduce Data on theMind (http://www.dataonthemind.org), a community-focusedinitiative aimed at meeting the unprecedented challenges andopportunities of theory-driven research with big data and nat-urally occurring datasets. We argue that big data and naturallyoccurring datasets are most powerfully used tosupplement—not supplant—traditional experimental para-digms in order to understand human behavior and cognition,and we highlight emerging ethical issues related to the collec-tion, sharing, and use of these powerful datasets.
This paper describes the acquisition, preparation, and properties of a corpus extracted from the Proceedings of the European Parliament. This corpus is available in 11 languages, consists of over 200 million words per language, and is preprocessed for use in statistical machine translation. We describe the methods we used for crawling, document alignment, and sentence alignment. We also present a common test set for machine translation and report the results of a number of basic statistical machine translation experiments.
Die vorliegende Dissertation beschreibt Aufbau und Funktionalit{\"{a}}t der bayerischen Dialektdatenbank BAYDAT. Die Datenbank fasst die Erhebungsdaten der Teilprojekte des Bayerischen Sprachatlas (BSA) zusammen, speichert sie zukunftssicher und macht sie zentral nutzbar. Die Arbeit zeigt die Vorgehensweise bei der Aufbereitung der Quelldateien, beschreibt die einzelnen Datenbanktabellen der BAYDAT-Datenbank und widmet sich der Realisierung und Funktionalit{\"{a}}t der Onlineoberfl{\"{a}}che, {\"{u}}ber die die BAYDAT-Datenbank einem weltweiten Nutzerkreis aus Dialektologen und interessierten Laien zur Verf{\"{u}}gung stehen soll. The PhD thesis at hand describes the setup and functionality of the Bavarian dialect database BAYDAT. The database integrates the data of the subprojects of the Bayerischer Sprachatlas (BSA, Atlas of the Bavarian language). The database ensures a future-proof storage of the data and makes the data available at one central point. The thesis describes the methods used in formatting the source data. It also describes the structure of the different database tables. It also contains a description of the development and the functionality of BAYDAT's graphical user interface that will allow linguists and laypersons to access the database online.
India is a multilingual country where machine translation and cross lingual search are highly relevant problems. These problems require large resources- like wordnets and lexicons- of high quality and coverage. Wordnets are lexical structures composed of synsets and semantic relations. Synsets are sets of synonyms. They are linked by semantic relations like hypernymy (is-a), meronymy (part-of), troponymy (manner-of) etc. IndoWordnet is a linked structure of wordnets of major Indian languages from Indo-Aryan, Dravidian and Sino-Tibetan families. These wordnets have been created by following the expansion approach from Hindi wordnet which was made available free for research in 2006. Since then a number of Indian languages have been creating their wordnets. In this paper we discuss the methodology, coverage, important considerations and multifarious benefits of IndoWordnet. Case studies are provided for Marathi, Sanskrit, Bodo and Telugu, to bring out the basic methodology of and challenges involved in the expansion approach. The guidelines the lexicographers follow for wordnet construction are enumerated. The difference between IndoWordnet and EuroWordnet also is discussed.