SCIENTIA GEOGRAPHICA SINICA ›› 2017, Vol. 37 ›› Issue (9): 1310-1317.doi: 10.13249/j.cnki.sgs.2017.09.003

Special Issue: 地理大数据

• Orginal Article • Previous Articles     Next Articles

Spatial Association Analysis for Urban Service Based on Big Data

Weihua Liao1(), Xin Nie2   

  1. 1. College of Mathematics and Information Science, Guangxi University, Nanning 530004, Guangxi, China
    2.School of Public Administration, Guangxi University, Nanning 530004, Guangxi, China
  • Received:2016-11-14 Revised:2017-03-04 Online:2017-11-20 Published:2017-11-20
  • Supported by:
    National Natural Sciences Foundation of China (71363005), Social Nature Sciences Foundation of China (13CGL109).

Abstract:

With the development of information technology, big data has become a research focus of all sectors. There is an increasing demand for big data in the urban planning management process. Big data acquisition and calculation is a key technology in the process of the smart city construction. This article covers the following major aspects: 1) Distance table linking to urban service physical store table is used to establish spatial association frequent rules model based on the concept of spatial neighbouring point and the property of spatial point entity; the article also introduces the method and procedure of how spatial frequent items and spatial association rules appear in urban service spatial association model; 2) “For xml path” technology is used in SQL Server to build spatial transaction database because transaction database is needed in association rules computing; 3) Python+sqlite3+lxml+BeautifulSoup technology is used to crawl the online data of the companies in Nanning which have all of their public information registered on “Baidu Nuomi” (https://nn.nuomi.com/); 4) Apriori algorithm is applied to analyze spatial frequent items and spatial association rules in urban service industry of 6 distance thresholds between 10 to 1 000 meters with the obtained data. In case study, the top six registered businesses in “Baidu Nuomi” are snacks and fast food, beauty, hotels, bakeries, sweets and drinks, budget hotels. The spatial association rule of {budget hotels, hotels} has a high degree of confidence and a high upgrading degree in the distance threshold of 10 m and 50 m, being a set of strong spatial association rules. This illustrates the Nanning hotel industry has the characteristics of a compact layout, with all kinds of hotels being together. The spatial association rule of {sweet drinks, snacks and fast food} is a set of strong spatial association rules in the distance threshold of 50 m, 500 m and 1 000 m. Snacks and fast food frequency is very high, especially in the succeeding rules with high support degree. In different distance thresholds, as a kind of mass consumer entity service, snacks and fast food restaurants are distributed around various industries. Because the lift degree of these rules is about 1, the snacks and fast food industry has the characteristics of no connection with other industries. This study is an attempt to use ubiquitous web data around us to analyze city management. Researchers can get a steady flow of big data so as to better carry out the studies on city big data in real time with this methods and thoughts.

Key words: big data, association rules, Apriori algorithm, service industry, Nanning City

CLC Number: 

  • F290