Abstract
Tor is increasingly used on the Internet as a means of accessing illicit or illegal services. If enacted by employees, such use may lead to negative impact on the organization. By its nature. Tor traffic is encrypted multiple times before being sent across networks to reach a destination. Therefore it may be impossible to detect the nature of a Tor user's online activities. Nevertheless, such users cannot hide the fact that they are using Tor. This paper proposes a novel data payload analysis as a means of classifying Tor traffic using machine learning. To this end, we consider the characteristics of the encrypted data payload for Tor and encrypted nonTor packets from 8 different applications and extract features to train our machine learning model. Our results indicate that, contrary to the commonsense assumption that Tor packets resemble other encrypted packets, such payload content can be used to distinguish between Tor and nonTor packets.
| Original language | English |
|---|---|
| Title of host publication | 2021 National Computing Colleges Conference (NCCC) |
| Place of Publication | Piscataway, N.J. |
| Publisher | IEEE |
| ISBN (Electronic) | 9781728167190 |
| ISBN (Print) | 9781728167190 |
| DOIs | |
| Publication status | Published - 27 Mar 2021 |
Keywords
- machine learning
- payload features
- Tor
- traffic classification
Fingerprint
Dive into the research topics of 'Tor traffic classification based on encrypted payload characteristics'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver