Cross-language Clone Detection for Mobile Apps

dc.contributor.authorStephannie Jimenez
dc.contributor.authorGordana Rakić
dc.contributor.authorSilvia Takahashi
dc.contributor.authorNicolás Cardozo
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T19:01:12Z
dc.date.available2026-03-22T19:01:12Z
dc.date.issued2023
dc.description.abstractClone detection provides insight about replicated fragments in a code base. With the rise of multi-language code bases, new techniques addressing cross-language code clone detection enable the analysis of polyglot systems. Such techniques have not yet been applied to the mobile apps’ domain, which are naturally polyglot. Native mobile app developers must synchronize their code base in at least two different programming languages. App synchronization is a difficult and time-consuming maintenance task, as features can rapidly diverge between platforms, and feature identification must be performed manually. Our goal is to provide an analysis framework to reduce the impact of app synchronization. A first step in this direction consists on a structural algorithm for cross-language clone detection exploiting the idea behind enriched concrete syntax trees. Such trees are used as a common intermediate representation built from programming languages’ grammars, to detect similarities between app code bases. Our technique finds code similarities with 79% precision for controlled tests where Type 1-3 clones are manually injected for the analysis of both single- and cross-language cases for Kotlin and Dart. We evaluate our tool on a corpus of 52 mobile apps identifying code similarities with a precision of 65% to 84% for the full application logic.
dc.identifier.doi10.5753/cibse.2023.24696
dc.identifier.urihttps://doi.org/10.5753/cibse.2023.24696
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/73571
dc.language.isoen
dc.sourceUniversidad de Los Andes
dc.subjectComputer science
dc.subjectPolyglot
dc.subjectProgramming language
dc.subjectAbstract syntax tree
dc.subjectCode (set theory)
dc.subjectSyntax
dc.subjectCode generation
dc.subjectSynchronization (alternating current)
dc.subjectNatural language processing
dc.subjectArtificial intelligence
dc.titleCross-language Clone Detection for Mobile Apps
dc.typearticle

Files