Real-time uber data analysis of popular uber locations in kubernetes environment

dc.contributor.authorGunawardena, TM
dc.contributor.authorJayasena, KPN
dc.contributor.editorKarunananda, AS
dc.contributor.editorTalagala, PD
dc.date.accessioned2022-11-10T09:58:19Z
dc.date.available2022-11-10T09:58:19Z
dc.date.issued2020-12
dc.description.abstractData is crucial in today's business and technology environment. There is a growing demand for Big Data applications to extract and evaluate information, which will provide the necessary knowledge that will help us make important rational decisions. These ideas emerged at the beginning of the 21st century, and every technological giant is now exploiting Big Data technologies. Big Data refers to huge and broad data collections that can be organized or unstructured. Big Data analytics is the method of analyzing massive data sets to highlight trends and patterns. Uber is using real-time Big Data to perfect its processes, from calculating Uber's pricing to finding the optimal positioning of taxis to maximize profits. Real-time data analysis is very challenging for the implementation because we need to process data in real-time, if we use Big Data, it is more complex than before. Implementation of real-time data analysis by Uber to identify their popular pickups would be advantageous in various ways. It will require high-performance platform to run their application. So far no research has been done on real-time analysis for identifying popular Uber locations within Big Data in a distributed environment, particularly on the Kubernetes environment. To address these issues, we have created a machine learning model with a Spark framework to identify the popular Uber locations and use this model to analyze real-time streaming Uber data and deploy this system on Google Dataproc with the different number of worker nodes with enabling Kubernetes and without Kubernetes environment. With the proposed Kubernetes environment and by increasing the worker nodes of Dataproc clusters, the performance can be significantly improved. The future development will consist of visualizing the real-time popular Uber locations on Google map.en_US
dc.identifier.citationT. M. Gunawardena and K. P. N. Jayasena, "Real-Time Uber Data Analysis of Popular Uber Locations in Kubernetes Environment," 2020 5th International Conference on Information Technology Research (ICITR), 2020, pp. 1-6, doi: 10.1109/ICITR51448.2020.9310851.en_US
dc.identifier.conference5th International Conference in Information Technology Research 2020en_US
dc.identifier.departmentInformation Technology Research Unit, Faculty of Information Technology, University of Moratuwa.en_US
dc.identifier.doidoi: 10.1109/ICITR51448.2020.9310851en_US
dc.identifier.facultyITen_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of the 5th International Conference in Information Technology Research 2020en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/19481
dc.identifier.year2020en_US
dc.language.isoenen_US
dc.publisherFaculty of Information Technology, University of Moratuwa.en_US
dc.relation.urihttps://ieeexplore.ieee.org/document/9310851/en_US
dc.subjectUberen_US
dc.subjectKubernetesen_US
dc.subjectMachine learningen_US
dc.subjectSparken_US
dc.subjectreal-timeen_US
dc.subjectBig Dataen_US
dc.subjectDistributed environmenten_US
dc.titleReal-time uber data analysis of popular uber locations in kubernetes environmenten_US
dc.typeConference-Full-texten_US

Files

Collections