Browsing by Subject "Hebrew language education"
Now showing 1 - 1 of 1
- Results Per Page
- Sort Options
Item Creating a frequency dictionary of spoken Hebrew : a reproducible use of technology to overcome scarcity of data(2018-05) Pinto, Juan Daniel; Raizen, Esther, 1951-; Horwitz, Elaine Kolker, 1950-Studies using word frequency dictionaries—on topics such as vocabulary acquisition, vocabulary load, extensive reading, and vocabulary testing—have historically centered around corpora and morphological issues specific to European languages, especially English. One of the reasons for this is the lack of resources that often plagues departments of less commonly taught languages. Corpora of spoken language are particularly difficult to obtain—the funding and time necessary often make such a project impossible. This thesis is an effort to provide some of the methodology and tools necessary for educators interested in creating frequency dictionaries for research purposes, for their own classrooms, or even for wider dissemination. In doing so, it will provide an overview of some of the key decisions that must be taken into account for such a project. Throughout this thesis, the creation process behind the Frequency Dictionary of Spoken Hebrew (FDOSH)—a list of the most common words in conversational Modern Hebrew—will be explained. The tools used to create the FDOSH, including corpus resources and customized scripts, are provided as part of a repository of supplementary materials. The goal is to make the entire dictionary-creation process as reproducible as possible while allowing for flexibility and transparency in the tools used. It does this by using well-documented open-source scripts written in an easily readable programming language, Python. Beyond providing these tools, the present project explores the theory and many of the considerations that play an important role in the creation of a frequency dictionary. These include issues such as corpus size, corpus text type, whether the list is intended for general or specialized use, word family levels, and objective criteria. Issues regarding Hebrew’s synthetic morphology and ambiguous non-vocalized writing system are also addressed. The project aims to serve as a catalyst for future research that may build upon the ideas discussed here. The development and open dissemination of tools such as these can only lead to greater cooperation among educators and researchers, to the benefit of all involved.