Presentation is loading. Please wait.

Presentation is loading. Please wait.

Wine Informatics Dr. Bernard Chen Ph.D. University of Central Arkansas.

Similar presentations


Presentation on theme: "Wine Informatics Dr. Bernard Chen Ph.D. University of Central Arkansas."— Presentation transcript:

1 Wine Informatics Dr. Bernard Chen Ph.D. University of Central Arkansas

2 Data science Data science is the study that incorporates varying techniques and theories from distinct fields, such as Data Mining, Scientific Methods, Math and Statistics, Visualization, natural language processing, and the Domain Knowledge, to discover useful information from domain-related data.

3 Domain Knowledge in Wine The quality of the wine is usually assured by the wine certification, which is generally assessed by Physicochemical, and sensory tests The existing data mining researches focus on the physicochemical laboratory tests much more than sensory tests.

4 Domain Knowledge in Wine it is very interesting to mine useful information from those sensory testing notes for answering the questions such as “What makes wine become a 90+ one?”, “What is the common characteristics shared by 90+ Napa Cabernet sauvignon?”, “What are the group of the wine share similarities?”, “What are the characteristics differ the wine from France and Italy?”

5 Domain Knowledge in Wine The key to the success of the wine sensory related data science research relays on the consistent reviews from prestigious experts. Several popular wine magazines provide widely accepted sensory reviews toward wines produced every year, such as Wine Spectator [13], Wine Advocate [14], Decanter [15]

6 Wine Spectator Review Example Kosta Browne Pinot Noir Sonoma Coast 2009 Ripe and deeply flavored, concentrated and well- structured, this full-bodied red offers a complex mix of black cherry, wild berry and raspberry fruit that's pure and persistent, ending with a pebbly note and firm tannins. Drink now through 2018. 5,818 cases made.

7 Wine Spectator Our first dataset is compiled from the list of “Top 100 Wines of 2011” [16] by Wine Spectator, a lifestyle magazine that focuses on wine and wine culture. Their reviews are straight and to the point.

8 Review Example Kosta Browne Pinot Noir Sonoma Coast 2009 Ripe and deeply flavored, concentrated and well- structured, this full-bodied red offers a complex mix of black cherry, wild berry and raspberry fruit that's pure and persistent, ending with a pebbly note and firm tannins. Drink now through 2018. 5,818 cases made.

9 Ann C. Noble’s Wine Aroma Wheel

10 Our own wine wheel Based on “Top 100 wines in 2011”, we analyzing all one hundred wine reviews and adding all necessary categories and subcategories, we came out with a total of 547 distinct attributes. When looking at our finished list, we noticed many cases where groups of attributes were really just permeations of the same thing. An example would be the following three attributes: FRESHLY-CUT APPLE, RIPE APPLE, and APPLE.

11 Hierarchical Clustering Dendrogram Venn Diagram of Clustered Data From http://www.stat.unc.edu/postscript/papers/marron/Stat321FDA/RimaIzempresentation.ppt

12 Distance Measure

13 Distance Measure Example WINECHERRY CHEWY TANNINS BEAUTY WINE1111 WINE2001

14 Clustering Results

15 1234567891011

16 Clustering Results Ref # Vintag e Typ e Varietal 1 2008RED MERLOT (.53) - CABERNET FRANC (.29) - CABERNET SAUVIGNON (.13) - MALBEC (.04) - PETIT VERDOT (.01) 2 2008RED CABERNET SAUVIGNON 3 2009RED PINOT NOIR 4 2007RED CABERNET SAUVIGNON 5 2007RED SANGIOVESE (.90) - CANAIOLO/COLORINO (.10) 6 2004RED TEMPRANILLO Ref # Worl d CountryRegion Alcoho l Pric e Drink Begin Drink End 1 NEW United States Washington $35NOW2020 2 NEW United States Washington14.5%$37NOW2018 3 NEW United States California13.9%$45NOW2019 4 NEW United States Washington14.6%$32NOW2019 5 OLDItalyTuscany14%$22NOW2022 6 OLDSpainCastilla y Leon $15NOW2015

17 Clustering Results CLUSTER #3 – 6 Instances – Attribute Information AttributeNumber of WinesAttribute Weight BLACKBERRY63 LONG FINISH52 SPICE43 FRUIT31 BLACK CHERRY33 RED32 FOCUSED31 EXCELLENT FINISH 32 RIPE31 TANNINS_LOW32 TANNINS_HIGH32 Suggestions This cluster represents the fruity aspect of new-world wines, focusing on powerful notes of blackberry and black cherry, as well as a commanding finish.

18 Conclusion In this paper, we discuss Wine Reviews and how their attributes can play an integral role in grouping different wines together. We show that when using only the attributes of a wine review, we can aggregate wines together that have similar world region, monetary value, vintage, type, and varietal.

19 Thanks Questions?


Download ppt "Wine Informatics Dr. Bernard Chen Ph.D. University of Central Arkansas."

Similar presentations


Ads by Google