Reliable Aggregation on Network Traffic for Web Based Knowledge Discovery

Authors:  S. Yu, S. James, Y. Tian, W. Dou

Book Title:  Reliable Knoweldge Discovery

Date Accepted: September 15 2011

Abstract

Web traffic information, such as website popularity, flow and congestion, can provide useful insights contributing to our understanding of cyberspace.  Due to the volume of information, it is useful to aggregate the data flow at various sources over a given time interval, which we can then use to make our analyses. A key problem, however is that such information can be distorted by the presence of illegitimate traffic, e.g. botnet recruitment scanning, DDoS attack traffic, etc.  An important consideration in web related knowledge discovery then is the robustness of the aggregation method, which in turn may be affected by the reliability of network traffic data. In this chapter, we present some similarity-based aggregation functions which are suited to the aggregation of traffic flows.  As these functions use similarity or the distance between data inputs, we then present some recently developed information theoretical indices which can be used to discriminate between illegitimate and benign traffic.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s