Skip to main navigation Skip to search Skip to main content

Tor traffic classification based on encrypted payload characteristics

Pitpimon Choorod, George Weir

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

Abstract

Tor is increasingly used on the Internet as a means of accessing illicit or illegal services. If enacted by employees, such use may lead to negative impact on the organization. By its nature. Tor traffic is encrypted multiple times before being sent across networks to reach a destination. Therefore it may be impossible to detect the nature of a Tor user's online activities. Nevertheless, such users cannot hide the fact that they are using Tor. This paper proposes a novel data payload analysis as a means of classifying Tor traffic using machine learning. To this end, we consider the characteristics of the encrypted data payload for Tor and encrypted nonTor packets from 8 different applications and extract features to train our machine learning model. Our results indicate that, contrary to the commonsense assumption that Tor packets resemble other encrypted packets, such payload content can be used to distinguish between Tor and nonTor packets.
Original languageEnglish
Title of host publication2021 National Computing Colleges Conference (NCCC)
Place of PublicationPiscataway, N.J.
PublisherIEEE
ISBN (Electronic)9781728167190
ISBN (Print)9781728167190
DOIs
Publication statusPublished - 27 Mar 2021

Keywords

  • machine learning
  • payload features
  • Tor
  • traffic classification

Fingerprint

Dive into the research topics of 'Tor traffic classification based on encrypted payload characteristics'. Together they form a unique fingerprint.

Cite this