Big Data

Why is this method important?
The integration of Big Data and analytical tools allows researchers to capture real-time tourist behaviors and process vast amounts of unstructured data (text, images, video) that traditional surveys cannot handle. These methods are crucial for “nowcasting” tourism demand, understanding sentiment at scale, and automating the extraction of semantic meaning from user-generated content. AI and machine learning provide the computational power necessary to identify complex, non-linear patterns in high-dimensional datasets, offering actionable insights for experience design and destination management.
 
What has our team done so far? 
We have been at the forefront of applying AI and machine learning to big data tourism research. We pioneered the use of search engine query data (e.g., Google Trends, Baidu Index) and web traffic data for high-frequency tourism demand forecasting. In the realm of unstructured data, we employed Convolutional Neural Networks (CNN), a deep learning deep learning algorithm, to classify health-related issues from social media posts to monitor tourist experiences under air pollution. Our team utilized Support Vector Machines (SVM) to measure topic matching in management responses. Recently, we leveraged Large Language Models (LLMs), specifically GPT-4o, to perform Chinese word segmentation and construct a tourism lexicon, enabling the quantification of “Government Attention to Tourism” from policy documents. 

Recommendation

Lee, Eunji; Yang, Yang

2026

JOURNAL OF TRAVEL & TOURISM MARKETING

Yang, Yang; Tan, Karen Pei-Sze; Liu, Yi Vanessa

2026

TOURISM MANAGEMENT

Yang, Lisi; Yang, Yang; Huang, Xijia; Yan, Kai

2026

TOURISM MANAGEMENT

Related Presentations

Spatial Analytics

Workshop on Informatics, Data Science, and Economics in Hospitality and Tourism Research

University of Houston, Houston, TX

COVID19tourism Index and its application in tourism management

University of Perpignan

Perpignan, France

Machine Learning and Artificial Intelligence Research in Tourism and Hospitality

University of Macau

Macau (Online)

Tourist behavior analysis using online user generated data

Kyung Hee University

Seoul, Korea (Online)

Related Resources

Bike sharing and tourism impact tool

Tool link: https://uflyy.github.io/bike-sharing/

The tool is an interactive, data-driven dashboard designed to visualize the synergistic relationship between urban micro-mobility and tourism. Grounded in empirical econometric research, the tool uses Chicago as a case study to demonstrate how bike-sharing systems impact the demand and visitor experience of nearby tourist attractions. It features interactive spatial mapping and a predictive policy simulator, bridging the gap between academic research and smart city tourism management.

Rating Adjustment Tool

Tool link: https://uflyy.github.io/rating-adjustment/

 

The Rating Adjustment Tool is an advanced analytical web application designed to standardize hotel online reviews by correcting for “scaling heterogeneity”—the phenomenon where different types of reviewers interpret and use rating scales differently. Powered by a Hierarchical Ordered Probit (HOPIT) model grounded in peer-reviewed academic research , the tool mathematically controls for systematic response biases tied to traveler demographics (such as age and gender) and trip characteristics (such as travel type and reviewer expertise). By offering individual review adjustments, hotel-level aggregate score calculations, and batch CSV processing, the tool effectively translates subjective, raw user ratings into objective, comparable latent scores and standardized 1–5 metrics, ensuring fairer and more accurate hotel evaluations.

This rating adjustment tool is built on the theoretical framework and empirical results from the following paper:

Leung, X. Y., & Yang, Y. (2020). Are all five points equal? Scaling heterogeneity in hotel online ratings. International Journal of Hospitality Management, 88, 102539.

Green Hospitality Dashboard

Tool Address: https://uflyy.github.io/green-hotels/

How to Use This Dashboard

  • Global Map Aggregation: At the global or country level, cities are represented by aggregated markers scaled by hotel volume. Click a city marker to seamlessly zoom in and load individual hotel properties without clutter.
  • Dynamic Popups: Popups intelligently pan into the view to ensure long lists of Green Features or Hotel Styles are never clipped by the screen edges.
  • Deep-Dive Graphing: Navigate to the Graph View to instantly evaluate structural differences across subgroups using Native Boxplots (min, Q1, median, Q3, max), Standardized Radar Profiling, and 100% Stacked Bar Charts.
  • Top 10 Feature Slicing: When grouping by specific environmental practices (Green Features), the system automatically filters for the top 10 most adopted measures to preserve visual clarity.