A graph-based dataset of commit history of real-world Android apps

Franz-Xaver Geiger, Ivano Malavolta, Luca Pascarella, Fabio Palomba, D. Di Nucci, Alberto Bacchelli

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

43 Citations (Scopus)


Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.
Original languageEnglish
Title of host publicationProceedings - International Conference on Software Engineering
Publication statusPublished - 2018


Dive into the research topics of 'A graph-based dataset of commit history of real-world Android apps'. Together they form a unique fingerprint.

Cite this