Chemosensory vocabulary in wine, perfume and food product reviews: Insights from language modeling
2025 (English)In: Food Quality and Preference, ISSN 0950-3293, E-ISSN 1873-6343, Vol. 124, article id 105357Article in journal (Refereed) Published
Abstract [en]
Chemosensory sensations are often hard to describe and quantify. Language models may facilitate a systematic understanding of sensory descriptions. We accessed consumer and expert reviews of wine, perfume, and food products (English language; about 68 million words in total) and analyzed their sensory descriptions. Using a novel data-driven method based on natural language data, we compared the three chemosensory vocabularies (wine, perfume, food) with respect to their vocabulary overlap and semantic properties, and explored their semantic spaces. The three vocabularies primarily differ with respect to domain specificity, concreteness, descriptor type preference and degree of gustatory vs. olfactory association. Wine vocabulary primarily distinguishes between white wine and red wine flavors and qualities. Food vocabulary separates drinkable and edible food products and ingredients, on the one hand, and savory and non-savory products, on the other. A salient distinction in all three vocabularies is between concrete and abstract/evaluative terms. Valence also plays a role in the semantic spaces of all three vocabularies, but valence is less prominent here than in general olfactory vocabulary. Our method allows a systematic comparison of sensory descriptors in the three product domains and provides a data-driven approach to derive sensory lexicons that can be applied by sensory scientists.
Place, publisher, year, edition, pages
Elsevier Ltd , 2025. Vol. 124, article id 105357
National Category
Food Science
Identifiers
URN: urn:nbn:se:ri:diva-76154DOI: 10.1016/j.foodqual.2024.105357Scopus ID: 2-s2.0-85208399146OAI: oai:DiVA.org:ri-76154DiVA, id: diva2:1915693
Note
An earlier version of this work were presented at the 15th Pangborn Sensory Science Symposium 2023, Nantes, France. The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at the PDC Center for High Performance Computing, KTH Royal Institute of Technology, partially funded by grants from the Swedish Research Council to T.H. (2021-03440) and J.K. O. (2020-00266), and from the Knut and Alice Wallenberg Foundation to J.K.O. (2016:0229).
2024-11-252024-11-252025-09-23Bibliographically approved