Estimating Semantic Transparency of Constituents of English Compounds and Two-Character Chinese Words using Latent Semantic Analysis

Hsueh Cheng Wang, Li Chuan Hsu, Yi Min Tien, Marc Pomplun

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

The constituents of English compounds (e.g., butter and fly for butterfly) and two-character Chinese words may differ in meaning from the whole word. Furthermore, the meanings of the words containing the same constituent (e.g., butter in “butterfingers”, or “buttermilk”) may or may not be consistent. Estimating semantic transparency of a constituent is usually difficult and subjective because of these uncertainties and ambiguities. It is rather unexplored why a constituent is considered transparent/opaque by raters, and how its polysemy correlates to its transparency. We propose a computational method for predicting semantic transparency based on Latent Semantic Analysis. We computed the primary meaning of a constituent by a clustering analysis and compared it to the whole-word meaning. The proposed method successfully predicted participants’ transparency ratings, and may explain the cognitive processes in raters when classifying semantic transparency of English compounds and two-character Chinese words.

Original languageEnglish
Title of host publicationBuilding Bridges Across Cognitive Sciences Around the World - Proceedings of the 34th Annual Meeting of the Cognitive Science Society, CogSci 2012
EditorsNaomi Miyake, David Peebles, Richard P. Cooper
PublisherThe Cognitive Science Society
Pages2499-2504
Number of pages6
ISBN (Electronic)9780976831884
StatePublished - 2012
Event34th Annual Meeting of the Cognitive Science Society: Building Bridges Across Cognitive Sciences Around the World, CogSci 2012 - Sapporo, Japan
Duration: 1 Aug 20124 Aug 2012

Publication series

NameBuilding Bridges Across Cognitive Sciences Around the World - Proceedings of the 34th Annual Meeting of the Cognitive Science Society, CogSci 2012

Conference

Conference34th Annual Meeting of the Cognitive Science Society: Building Bridges Across Cognitive Sciences Around the World, CogSci 2012
Country/TerritoryJapan
CitySapporo
Period1/08/124/08/12

Keywords

  • Chinese
  • clustering
  • compound words
  • latent semantic analysis
  • semantic transparency

Fingerprint

Dive into the research topics of 'Estimating Semantic Transparency of Constituents of English Compounds and Two-Character Chinese Words using Latent Semantic Analysis'. Together they form a unique fingerprint.

Cite this