Diving into the code
To gather app features with our application corpora to analyze change over time.
Remember, we have over 100k versions of apps.
It's not immediately apparent why this is useful, for some people.
As is the case with heavy data mining.
Over 40% of the words in Instagram are Emoji.
Say, didn't Hearbleed happen around that time?
.dex -> .jar
| procyon decompiler
Unreadable XML -> XML
There are actually no tools that will do this entire stack.
Takes ~ 10 minutes/app and up to ∞
10 min * 100k apps = 1 Million minutes
= ~2 years
Finding Key terms within code.
Useful for storing passwords
Can only test in production
Process was so intensive, the hardware would be limiting
The decompiler world is a dark world
Begin data analysis