閱讀全文 | |
篇名 |
Big Data Trip Classification on the New York City Taxi and Uber Sensor Network
|
---|---|
並列篇名 | Big Data Trip Classification on the New York City Taxi and Uber Sensor Network |
作者 | Huiyu Sun、Siyuan Hu、Suzanne McIntosh、Yi Cao |
英文摘要 | Millions of trips are made every day by taxis and Uber in New York City. We first employ big data technologies to analyze this vast dataset: Apache Spark is used for data processing and classification, Apache Hive is used for data storage, and MapReduce is used for data profiling. Since taxis and Uber are equipped with GPS sensors, we then visualize a mobile sensor network over New York City separated into fine-sized regions each acting as a mobile sensing node. Each location on the network falls into a region and is classified into one of three categories based on which service dominates the particular region: Yellow taxi, Green taxi, or Uber. We utilize logistic regression to classify a region into one of the three categories. Our classification algorithm is then used to analyze the interaction between taxi and Uber, for example to quantify the expansion of Uber. Experiments run on the Spark cluster show our classifier achieves an accuracy of over 85% scored on the 2014 taxi and Uber dataset. Finally, we propose a trip recommendation system for users using classification results together with a web service application. |
起訖頁 | 591-598 |
關鍵詞 | Big data、Classification、Mobile sensor network、NYC taxi、Uber |
刊名 | 網際網路技術學刊 |
期數 | 201803 (19:2期) |
出版單位 | 台灣學術網路管理委員會 |
DOI |
|
QR Code | |
該期刊 上一篇
| Knowledge Structure and Its Impact on Knowledge Transfer in the Big Data Environment |
該期刊 下一篇
| Coverless Steganography Based on English Texts Using Binary Tags Protocol |