Machine Learning for 802.11 Troubleshooting

This paper presents a machine learning-based framework for diagnosing common WiFi network issues, such as contention and low signal-to-noise ratio, using MAC-layer data. The proposed solution offers both passive and active detection mechanisms, achieving high accuracy rates in identifying performance problems. The study emphasizes the need for automated techniques to assist home users in troubleshooting complex WiFi environments, leveraging data-driven analysis and various classification algorithms.

Uploaded by

killanapavankumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

180 views6 pages

Machine Learning for 802.11 Troubleshooting

Uploaded by

killanapavankumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

2019 16th IEEE Annual Consumer Communications & Networking Conference (CCNC)

On the Employment of Machine Learning

Techniques for Troubleshooting WiFi Networks
Ilias Syrigos∗ , Nikos Sakellariou† , Stratos Keranidis‡ and Thanasis Korakis§
Department of Electrical and Computer Engineering, University of Thessaly
Greece
Email: ∗ ilsirigo@[Link], † sakellariou@[Link], ‡ efkerani@[Link], § korakis@[Link]

Abstract—The rapidly increasing popularity of 802.11 WLANs performance degradation of end-user applications, which is
along with the co-existence of multiple heterogeneous devices difficult to be attributed to a specific cause from users lacking
in the unlicensed frequency bands have created unprecedented a sufficient expertise such as most users are.
levels of congestion, especially in densely populated urban
areas. Under such complex setups, WLAN under-performance Detection and troubleshooting of performance problems in
issues experienced by end-users are hard to interpret even by WiFi networks is a particularly difficult and frustrating task.
experts. In this paper, we develop an intelligent, easy to deploy This is due to the complex and dynamic nature of the wireless
mechanism that takes advantage of MAC-layer exported data
and employs machine learning techniques to accurately diagnose
medium that demands the collection of specific information
the five most common WiFi pathologies (contention, low-SNR, from the lower levels of 802.11 protocol, which is hard to be
non-802.11 Interference, etc.). The collected data are fed to four interpreted even from skilled experts. In enterprise networks,
different classification algorithms, which we fine-tune, in order to responsible for troubleshooting problems are networked ad-
optimize their hyper-parameters in regards to their precision and ministrators having a clear picture of the network topology,
accuracy. The resulting solution provides two different mecha-
nisms, with the first targeting low-overhead passive detection and
sufficient knowledge of the 802.11 protocol operation and
the second offering more accurate performance relying on active specialized equipment. Thus, they are in a position to draw
probing. Detection performance is evaluated through extensive safe conclusions. On the other side, home users are asked
testbed experiments and exhibits that the K-Nearest Neighbors to solve a much more complex problem given the random
classifier achieves almost 100% accuracy and precision for the deployment of home WLANs, along with the lack of expertise
active probing and 95% accuracy and precision for the passive
detection across the five considered pathologies.
and equipment. According to the above and due to the ever-
growing spread of WLANs, it becomes increasingly necessary
I. I NTRODUCTION to develop automated techniques and mechanisms that can
detect the causes of performance degradation in today’s dense
Till the present day, the IEEE 802.11 standard, most
and complex home wireless networks.
commonly known as WiFi, has grown enormously and has
intruded into every single home and enterprise. Private and In this paper, we extend our previous work [2] of turning
public WiFi networks are responsible for transferring a signif- low-cost commercial APs to intelligent devices able to detect
icant percentage, 45%, of the total IP traffic and are expected performance impairments, diagnose the underlying patholo-
to grow even more, turning this percentage to 49% by 2020 gies and potentially dynamically adjust operation in order to
[1]. improve or even restore maximum performance. We achieve
In addition, the growing adoption of IoT through the emer- that by following a data-driven analysis [3] over the plenty and
gence of smart TVs, wearable devices (smartwatches, activity freely available data coming from drivers of wireless cards
trackers), security cameras, energy monitoring devices and that is exported to user-level. This MAC-layer data, revealed
even smart light bulbs has greatly expanded common home from AP devices, is collected as a part of the physical rate
WLANs. All these devices, although having different charac- control mechanism and is exported to user-level for debugging
teristics and often operating under a different protocol (WiFi, purposes. From the data available to us, we choose the part
ZigBee, Bluetooth), are bound to coexist in dense urban that serves us better in characterizing the root causes of
environments and in the limited unlicensed wireless spectrum. performance degradation, in specific, transmission attempts
However, this coexistence can often become problematic for and percentage of successful transmissions. We, then, evaluate
WiFi users, due to the underlying contention for access to the employment and fine-tuning of popular classification
the wireless medium or due to the interference caused by machine learning techniques for diagnosing WiFi networks’
non-WiFi devices (even from microwave ovens). Furthermore, pathologies.
inexperienced users are prone to mistakes when it comes to As in our previous work, the five most important and well-
deployment of Access Points (AP), leading to poor channel known pathologies are replicated. This occurs in a controlled
conditions for their end-devices. The flawless operation of environment in our testbed by injecting traffic in the network
a home WiFi network becomes even more complex when covering a wide range of scenarios, in order to monitor and
we also consider inherent impairments of the 802.11 protocol gather data of a significant size. This data is fed to four differ-
such as the Hidden Terminal and Capture Effect phenomena. ent models, each featuring one of the following notable clas-
All these factors contribute to the frequently anticipated sification algorithms: a) Decision Trees, b) Random Forests,

horized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on September 23,2024 at [Link] UTC from IEEE Xplore. Restrictions app
978-1-5386-5553-5/19/$31.00 ©2019 IEEE
2019 16th IEEE Annual Consumer Communications & Networking Conference (CCNC)

c) Support Vector Machines and d) K-Nearest Neighbors. The notable example of such a work is [9], where authors feed
models are then cross-validated and their parameters are fine- results of the spectral scan function performed by Atheros
tuned, in order to maximize accuracy and most importantly wireless cards to Decision Tree and SVM classifiers, in
precision. Data is modeled into two modes, one suitable for order to distinguish between non-WiFi devices sharing the
passive monitoring and another suitable for active probing. unlicensed spectrum. The work in [10], based on information
Thus, we overcome our former work’s limitation of having provided by commercial cards (sequences of receiver errors),
to inject traffic in the network for performing detection, by aims at detecting sources of non-WiFi interference by the em-
offering the option of a passive mechanism, although without ployment of Artificial Neural Networks and hidden Markov
excluding a more accurate, active probing option. All models chains. Cross-technology interference detection has also been
in both data modes are trained based on the gathered data and considered in 802.15.4 (Zigbee) networks[11],[12].
evaluated in regards to their accuracy and precision. In contrast to the aforementioned body of work, our de-
Our work aims at providing an intelligent framework that tection methodology takes a step further and considers all
can be easily deployed on commercial AP devices, able to the potential pathologies. It covers 802.11 Contention, non-
diagnose the underlying pathologies with high accuracy and 802.11 interference, low Signal-to-Noise ratio and specific to
precision. It differentiates from related approaches by offering 802.11 anomalies, such as the Hidden Terminal and Capture
the ability of passive detection and covering the whole range Effect phenomena, thus providing more insight on the under-
of WiFi pathologies. lying causes of underperformance.
The rest of the paper is organized as follows. Section II
discusses related work. In Section III, a brief overview of III. 802.11 BACKGROUND I NFORMATION
802.11 background information is presented. In Section IV, A. 802.11 Related Pathologies
our methodology for obtaining data is given, followed by the
evaluation of the classification models in Section V. Finally, The performance of an 802.11 device is mainly impacted
we conclude in Section VI commenting in our findings and by two factors. The first one is the availability of channel
proposing possible extensions. access opportunities, while the second one is the efficiency of
the frame delivery, whenever the device is given the chance
II. R ELATED W ORK to transmit. Based on this key observation we can categorize
There are a few similar approaches that have been followed pathologies into two classes: Medium Contention and Frame
in the literature for detecting causes of poor performance in Loss.
WiFi networks. As an example, the work by Kanuparthy et 1) Medium Contention: In this category, we consider
al.[4] takes advantage of user-level information for distin- pathologies that occur when a WiFi device senses the medium
guishing between the different 802.11 pathologies, by em- as busy and thus defers from transmitting. This type of
ploying active probing and estimating simple metrics such pathologies are frequently encountered in dense urban envi-
as transmission rate. WiSlow, [5] also exploits MAC-layer ronments, as multiple WiFi networks are concentrated in small
information gathered after injecting traffic to the network for areas, while also coexisting along with non-802.11 devices
discriminating between 802.11 and non-802.11 interference (ZigBee, Bluetooth, microwave ovens) that share the same
with a high accuracy, when interfering devices are close to the limited unlicensed spectrum. Consequently, we consider two
suffering node. However, our work differentiates by providing pathologies in this category, 802.11 Contention and Non-
a mode of passive pathology detection that does not incur 802.11 Contention.
any additional overhead and moreover does not impose any 2) Frame Loss: This category includes the pathologies that
significant accuracy penalty. occur when 802.11 devices identify the medium as idle and
Machine learning techniques have been heavily employed attempt a transmission that ultimately fails due to the link
in recent years, when abundance of data exists. WiFi is not conditions experienced at the side of the receiving device.
an exception, with several frameworks being developed for As a consequence, a delay in the next transmission attempt
either estimating key performance indicators of WLANs or takes place, due to the subsequent doubling of the Contention
classifying sources of underperformance. In [6], authors apply Window (CW). A primary reason for this failure can be the
the classification algorithms, considered also in our work, for Low SNR conditions experienced on the receiving end. It
characterizing and estimating WiFi latency. Artificial Neural can be attributed to either low signal power due to fading,
Networks are used in [7] for estimating packet delivery rate, path loss etc. or to high noise caused by interfering devices
while a hidden Markov model estimates the probability of operating outside the sensing range of the WiFi device.
interference between 802.11 nodes from traces collected from In addition, frame delivery failures may also be attributed
multiple sniffers in [8]. to inherent 802.11 impairments such as Hidden Terminal
In addition, several works have put an effort on investi- and Capture Effect phenomena that occur when concurrent
gating the detection and classification of the various causes transmissions lead to frame collisions. More specifically,
of poor performance in WiFi networks with the application the Hidden-Terminal phenomenon occurs in cases that the
of machine learning algorithms. However, most of them receiving device lies within the transmission range of two
have focused basically on the identification of interference, active 802.11 devices that are mutually hidden and cannot
especially cross-technology, using statistics gathered from sense each other resulting in frame collisions. In cases that
commodity hardware or traces from monitoring devices. A no remarkable difference is observed in the received signal

horized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on September 23,2024 at [Link] UTC from IEEE Xplore. Restrictions app
2019 16th IEEE Annual Consumer Communications & Networking Conference (CCNC)

significantly. Restricted access to the wireless medium can

occur in two cases. Either due to the channel being sensed
as busy, so transmissions are infrequent but still successful,
meaning that FDR remains unaffected, or due to frame
losses/collisions that subsequently decrease FDR values.
The aforementioned metrics are sampled by logging their
values, as these are exported by the driver of the wireless
card, ath9k, as part of Minstrel rate control algorithm, to the
debugging file system. During experimentation described in
Subsection IV-D, the logs of the metrics’ values were being
Figure 1. Taxonomy of IEEE 802.11 Pathologies. transferred to an external server, where we performed the
training and testing phases of the considered classification
strength of colliding frames at the intermediate device, the algorithms.
Hidden-Terminal phenomenon appears symmetrically for both
flows. However, the most frequently observed case is the B. Active probing
Capture-effect phenomenon, where a considerable difference The sampling of the previously defined metrics follows the
in RSSI values is observed, resulting in a higher probability same methodology as in [2], as we employed the Varying Bi-
of successful decoding for the high-power frames. As a trate Probing mechanism. By taking advantage of the relation
result, the link capturing the medium experiences lower colli- between the PHY rate of the affected link and the proposed
sion probability, thus accessing the medium more frequently metrics, this active probing mechanism probes the wireless
causing a higher performance penalty for the affected links. channel with multiple packet trains, where each train consists
The categorization of the considered pathologies described is of several packets of a fixed length that are transmitted with all
depicted in Figure 1. of the available single stream modulation and coding schemes
(MCS) of 802.11 n protocol {M CS0, M CS1, ..., M CS7}.
IV. M ETHODOLOGY We have implemented Varying Bitrate Probing on commercial
We deploy our detection framework as a script running AP devices, where we installed OpenWrt, a lightweight
on the AP that implements two different mechanisms for operating system for embedded devices and created a script
sampling our defined metrics, an active and a passive one, for generating saturated traffic transmitted with a specific
both of them presented in the rest of the section. The samples modulation to a device acting as a station (STA).
are then given as input to a classification algorithm, which we
have selected according to the analysis provided in Section C. Passive monitoring
V. The rate control algorithm used by the ath9k driver, Min-
strel, is based on statistics and more specifically, on the
A. Sampled metrics probability of successful transmission for determining the
As in [2], we define the key metrics that are able to rate that provides the best throughput. Evidently, in order
characterize WiFi pathologies based on the two key factors to acquire these statistics, it needs to sample all of the
mentioned in Section III that impact 802.11 performance: a) available MCS. This is done by randomly selecting the 10%
transmission attempts and b) efficiency of the transmissions. of the transmissions as sampling transmissions for evaluating
We represent transmission attempts with the Normalized a random MCS. We, in turn, take advantage of this fact, to
Channel Accesses (NCA) metric, defined as: sample NCA and FDR values without the need for injecting
probing traffic on the network. We sample every 60s, as
CA long as there are transmissions ongoing. This, rather large,
N CA = (1)
M CA sampling window allows time for more transmissions and
where CA denotes the number of channel accesses per second, subsequently more updated values. It is worth noting that if
while MCA represents the maximum number of channel there is no or little user-initiated traffic, then there are not
accesses per second, as are analytically calculated from a enough samples. However, we argue that in that case there is
performance model of 802.11. On the other hand, we rep- no need for troubleshooting, as there is no problem noticed.
resent the transmission efficiency with the Frame Delivery
Ratio (FDR) metric, defined as: D. Experimentation in controlled environment
In order to reproduce all of the considered pathologies
ST and generate numerous data samples for feeding classification
F DR = (2)
CA algorithms, we performed several experimental sessions for
where ST denotes the number of successful transmissions per each pathology in our testbed. Our setup consisted of an
second. OpenWrt enabled wireless router, TP-Link TL-WR2543ND,
When the channel is sensed as idle and there is correct featuring the AR9380 chipset, operating on both the 2.4 GHz
reception of frames on the receiving end, performance is ideal, and 5 GHz bands, supported by the ath9k driver, where
thus NCA and FDR values approximate to 1. However, when Varying Bitrate Probing was implemented. Traffic was sent
channel access opportunities are limited, NCA values drop to a smartphone device acting as a STA that enabled us to

alternated the transmission power of the AP device between

0, 5, 10 and 15 dBm. As regards the case of high noise,
we employed a wireless surveillance camera at 10 different
locations, outside the AP’s range. In total, we collected 1087
samples. By inspecting the data, we observed that higher
MCS, which are less robust, dropped their FDR values and
as a result from frame losses and the subsequent increase of
the CW, a similar behavior was observed for NCA.
4) Hidden Terminal: A more complex pathology to repli-
cate was symmetric Hidden Terminal. In this case, frames
coming from both the transmitters of the evaluation and the
interfering link respectively, should have similar RSSI value,
in order to result to frame loss for both after a collision.
We gathered data from 10 locations and in a similar manner
Figure 2. Data set. to contention scenarios, we varied the traffic rate and the
modulation scheme of the interfering link, accumulating a
replicate a great variety of scenarios and topologies. In order total of 1408 samples. It is worth noting that in order to
to replicate, the scenarios of contending and hidden nodes, achieve having the transmitters outside each other’s range,
we employed several wireless nodes available in our testbed. the distance between the evaluation link was long, thus we
Our evaluation link, between the AP and the smartphone is anticipated the coexistence of the Low SNR pathology. This
configured on channel 36 of the 5 GHz band which is free explains the observation we made from analyzing the data
from external interference in our testbed. that FDR improves as the modulation scheme gets higher,
1) 802.11 Contention: We first considered the case, where and so does the NCA, because smaller duration of frame
multiple 802.11 devices operate in the same frequency within transmissions leads to avoidance of collisions. However, when
each other’s range. We established three different wireless we reach very high - less robust - modulation schemes, there
links operating on the same channel and randomly activated is a significant drop in both metrics, similar to the Low SNR
each one, in order to generate multiple contention scenarios. case.
We also alternated the traffic rate (1, 2, 10, 24 and 50 Mbps) 5) Capture Effect: A more general case of the Hidden Ter-
and the MCS of the links for covering as much as possible minal phenomenon is the Capture Effect, or else asymmetric
of realistic WiFi activity. In total, we collected 928 samples. Hidden Terminal, where frames coming from the transmitters
As also stated in [2], we observed that due to an 802.11 of the evaluation and the interfering link have a noticeable
performance anomaly, higher MCS suffer more when there is difference in RSSI value. The frame with the stronger signal
contention from ”slower” devices, thus heavily impacting the is often correctly decoded by the receiver after a collision, and
corresponding NCA values. On the other hand, FDR values thus the transmitter continues transmissions with a minimum
remain rather stable and high as there are no frame losses. length CW. On the other hand, transmitter of the weak
2) Non-802.11 Contention: A microwave oven was se- signal is heavily affected as collisions, and subsequently
lected as the non-802.11 device with which we generated frame losses, are increased. We collected 1535 samples, in
non-802.11 contention samples. This is a high RF energy a similar manner to symmetric Hidden Terminal pathology
emitting device, found in almost every home and can have a and data exhibited equivalent behavior for both metrics across
serious effect on performance as it operates in 2.44-2.47 GHz modulation schemes, although values were lower due to the
frequencies. For this reason, we configured our evaluation heavier impact of Capture Effect on the evaluation link.
link on channel 7 only for this scenario. The operation of the In Figure 2 we depict the whole data set of 5348 samples,
microwave oven can be considered as similar to a ”slow” WiFi along with their corresponding pathology.
STA that emits energy with a duty cycle of 0.5. However, as
it does not follow the exponential backoff policy, it starts its V. E VALUATION OF CLASSIFICATION ALGORITHMS
emissions without sensing the medium, thus causing unrecov-
erable errors on simultaneous transmissions of WiFi devices. A. Classification Algorithms
We performed experiments in various locations away from In order to classify an underlying pathology into one of
the MW oven resulting in the gathering of 390 samples. As the aforementioned categories in the previous subsection, we
it follows from the above description, NCA exhibited similar employed supervised learning and four popular classification
performance across MCS, while FDR in higher modulation algorithms: a) Decision Trees, b) Random Forests, c) Support
schemes dropped due to the greater number of collisions. Vector Machines Classification and d) K-Nearest Neighbors.
3) Low SNR: In this set of data sampling experiments, we 1) Decision Trees: Decision trees builds classification
replicated various low SNR scenarios for both cases of low models in the form of a tree structure. It breaks down a data
signal power received on the evaluation and high RF noise set into smaller and smaller subsets, while at the same time an
existence close to the receiving smartphone. For the first case, associated decision tree is incrementally developed. The final
we positioned the smartphone in 20 locations ranging from 1 result is a tree with decision nodes and leaf nodes. A decision
to 40 meters, resulting in various SNR values. In addition, we node has two or more branches and a leaf node represents

Table I Table II
D ECISION T REES SETS OF HYPER - PARAMETERS VALUES R ANDOM F ORESTS SETS OF HYPER - PARAMETERS VALUES

criterion ’gini’,’entropy’ criterion ’gini’,’entropy’

splitter ’best’,’random’ n estimators 5,10,20,30,40
min samples split 2,3,4,10 max features (Active) ’auto’,’sqrt’,’log2’,4,10,14,16
max features (Active) ’auto’,’sqrt’,’log2’,4,10,14,16 max features (Passive) ’auto’,’sqrt’,’log2’,2,3
max features (Passive) ’auto’,’sqrt’,’log2’,2,3
Table IV
Table III
K-N EAREST N EIGHBORS SETS OF HYPER - PARAMETERS VALUES
SVM SETS OF HYPER - PARAMETERS VALUES
n neighbors 3,5,7
kernel ’rbf’,’linear’
weights ’uniform’,’distance’
gamma ’auto’,1e-3, 1e-4
algorithm ’auto’,’brute’
C 1, 10, 100, 1000
p 1, 2

a classification or decision. The topmost decision node in a pathology, inducing saturated traffic of no value for the user
tree which corresponds to the best predictor called root node. in the network. However, in the first model, called Passive
Decision trees can handle both categorical and numerical data. Model onwards, samples can be obtained with no need for
2) Random Forests: Random forests or random decision extra probing traffic, as described in Subsection IV-C.
forests are an ensemble learning method for classification,
regression and other tasks, that operate by constructing a C. Classifiers Implementation
multitude of decision trees at training time and outputting
the class that is the mode of the classes (classification) of the After processing data for feature extraction, we shuffled
individual trees. Random decision forests correct for decision and splitted it into a training and a test set. For the first
trees’ habit of over fitting to their training set. model we selected the 80% of the samples for training and
3) Support Vector Machine Classification: A Support Vec- the remaining 20% for testing, while for the second model
tor Machine (SVM) is a discriminative classifier formally the percentages were 85% and 15% respectively, where the
defined by a separating hyperplane. In other words, given number of instances was decreased by the feature aggregation.
labeled training data (supervised learning), the algorithm out- All classifiers were implemented in Python using the scikit-
puts an optimal hyperplane which categorizes new examples. learn library [13]. We then performed hyper-parameter tuning
In two dimensional space this hyperplane is a line dividing a for both models and all classifiers, in regards to optimize
plane in two parts where in each class lay in either side. classification precision. We chose precision over accuracy in
4) K-Nearest Neighbors: The K-Nearest Neighbors algo- an effort to avoid the accuracy paradox that can often occur
rithm is a classification algorithm, and it is supervised: it takes in unbalanced training sets. In order to avoid overfitting, we
a bunch of labeled points and uses them to learn how to label performed a 5-fold [Link] hyper-parameters for
other points. To label a new point, it looks at the labeled points each classifier along with their set of values that were tested
closest to that new point (those are its nearest neighbors), and are given in Tables I, II, III and IV accordingly.
has those neighbors vote, so whichever label the most of the More specifically, for the Decision Trees classifier, we
neighbors have is the label for the new point (the ”k” is the found that for both models ”entropy” was the best function
number of neighbors it checks). to measure the quality of a split, which uses the information
gain. The best random split was found as the best strategy for
B. Data Modeling and Feature Extraction the splitter, while the minimum number of samples required to
The experimental data gathered during the sessions de- split an internal node was 4. The differentiation between the
scribed in Section IV, was fed to classification algorithms, two models, Active and Passive was the number of features to
modeled in two distinct ways. The first model fed classifiers consider when looking for the best split, which for the former
with samples as 3-dimensional vectors of features, in the the best value was the ”auto”, considering the total number
following format: of features, while for the latter was the default value 3.
Regarding, Random Forests classifier, again, ”entropy” was
{NCA, FDR, MODULATION}
the best function to measure the quality of a split. For the
along with their labels denoting the pathology, while the sec- Active model, the optimized value for the number of features
ond model aggregated the results of Varying Bitrate Probing to consider when looking for the best split was 8 and the
and fed them as 16-dimensional vectors of features, in the number of estimators was the default 10. In contrast, for
following format: the Passive model, the corresponding values were the ”auto”,
representing the square root of the total number of 3 features
{NCAM CS0 , FDRM CS0 , ..., NCAM CS7 , FDRM CS7 }
and 30 for the number of trees in the forest.
along with their corresponding labels. The rationale behind Concerning, the SVM classifier, the default ”rbf” kernel
this differentiation was that although the second model, called yielded the best results for both models, while the kernel
Active Model onwards, can certainly offer better accuracy, coefficient ”gamma” was set to 0.001 for the Active model
it requires that the active sampling test (Varying Bitrate and to ”auto” for the Passive model that equals to 1 / total
Probing) is run every time we need to detect an underlying number of features and in our case equals to 0.33. Finally,

Table V Table VI
C ONFUSION MATRIX K-N EAREST N EIGHBORS - ACTIVE M ODEL C ONFUSION MATRIX K-N EAREST N EIGHBORS - PASSIVE M ODEL

Low SNR WiFi Capture Hidden MW Low SNR WiFi Capture Hidden MW
Low SNR 100 0 0 0 0 Low SNR 99.3 0 0.4 0 0.3
WiFi 0 100 0 0 0 WiFi 0 100 0 0 0
Capture 0 0 97.6 2.4 0 Capture 1.4 0 90.9 7.4 0.3
Hidden 0 0 0 100 0 Hidden 0 0 6.2 93.8 0
MW 0 0 0 0 100 MW 0 3.1 3.1 0.9 92.9

penalty parameter ”C” of the error term was set to 1 for the classifier was chosen as a part of a software framework we
Active model and 10 for the Passive. built for detection based on two modes, a highly-accurate
Lastly, the hyper-parameters of K-Nearest Neighbors clas- Active and a no-overhead Passive one, by taking advantage
sifier were also fine-tuned, in order to optimize precision. As of data exported by the driver of the wireless card. We
regards to the Active model, the number of neighbors used exhibited remarkable results of 99.2% and 95.1% accuracy
was set to 3, while the ”brute force” search was the algorithm accordingly. As a future work, we plan to employ multi-
used to compute the nearest neighbors with a ”uniform” label classification in an attempt to accurately detect multiple
weighting function. For the Passive model, we set the default coexisting pathologies.
value for the number of neighbors equal to 5. The algorithm ACKNOWLEDGMENT
used was set to ”auto”, in order to decide the appropriate
between KDTree, BallTree and brute force according to the The research leading to these results has received funding
data and used the ”distance” weighting function that weights by GSRT, under the act of HELIX-National Infrastructures
points by the inverse of their distance. We set ’p’ equal to for Research, MIS no 5002781.
1 that corresponds to the Manhattan distance for the power R EFERENCES
parameter for the Minkowski metric, in both models. [1] V. Cisco, “Cisco visual networking index: Forecast and methodology
2016–2021.(2017),” 2017.
D. Pathology Detection Performance [2] I. Syrigos, S. Keranidis, T. Korakis, and C. Dovrolis, “Enabling wireless
lan troubleshooting,” in International Conference on Passive and Active
Having determined the best hyper-parameters for each of Network Measurement. Springer, 2015, pp. 318–331.
the considered classifiers and for both the Active and Passive [3] C. Fortuna, E. De Poorter, P. Škraba, and I. Moerman, “Data driven
Model, we compared them in terms of accuracy, precision and wireless network design: A multi-level modeling approach,” Wireless
Personal Communications, vol. 88, no. 1, pp. 63–77, 2016.
recall. The overall results are shown in Table VII, where it is [4] P. Kanuparthy, C. Dovrolis, K. Papagiannaki, S. Seshan, and
evident that the K-Nearest Neighbors classifier is superior to P. Steenkiste, “Can user-level probing detect and diagnose common
the rest for all metrics in both models. More specifically, it home-wlan pathologies,” ACM SIGCOMM Computer Communication
Review, vol. 42, no. 1, pp. 7–15, 2012.
exhibits an outstanding accuracy of 99.2% and 95.1% for the [5] K.-H. Kim, H. Nam, and H. Schulzrinne, “Wislow: A wi-fi network
corresponding models. This can be attributed to the fact that performance troubleshooting tool for end users,” in INFOCOM, 2014
the K-Nearest Neighbors algorithm tends to perform very well Proceedings IEEE. IEEE, 2014, pp. 862–870.
[6] K. Sui, M. Zhou, D. Liu, M. Ma, D. Pei, Y. Zhao, Z. Li, and
in cases when there are many data points and few dimensions T. Moscibroda, “Characterizing and improving wifi latency in large-
of the feature vector. It is also known that as a non-parametric scale operational networks,” in Proceedings of the 14th Annual Inter-
algorithm it performs better than parametric ones (SVM), national Conference on Mobile Systems, Applications, and Services.
ACM, 2016, pp. 347–360.
when the considered classes are overlapping. In Tables V [7] M. O. Khan and L. Qiu, “Accurate wifi packet delivery rate estimation
and VI, the confusion matrices of the K-Nearest Neighbors and applications,” in Computer Communications, IEEE INFOCOM
classifier for each model are presented. At this point, it is 2016-The 35th Annual IEEE International Conference on. IEEE, 2016,
pp. 1–9.
also remarkable to note that there is some misclassification [8] U. Paul, A. Kashyap, R. Maheshwari, and S. R. Das, “Passive measure-
between Hidden Terminal and Capture Effect pathologies, a ment of interference in wifi networks with application in misbehavior
fact that is quite expected, as these pathologies differ only in detection,” IEEE transactions on mobile computing, vol. 12, no. 3, pp.
434–446, 2013.
their symmetry. [9] S. Rayanchu, A. Patro, and S. Banerjee, “Airshark: detecting non-
wifi rf devices using commodity wifi hardware,” in Proceedings of the
2011 ACM SIGCOMM conference on Internet measurement conference.
Table VII
ACM, 2011, pp. 137–154.
E VALUATION RESULTS
[10] N. Inzerillo, D. Croce, D. Garlisi, F. Giuliano, and I. Tinnirello, “Error-
Active Passive based interference detection in wifi networks,” in GLOBECOM 2017-
2017 IEEE Global Communications Conference. IEEE, 2017, pp. 1–6.
Accuracy Precision Recall Accuracy Precision Recall
[11] A. Hithnawi, H. Shafagh, and S. Duquennoy, “Tiim: technology-
DT 97.8 97.8 97.8 92.8 92.8 92.8
independent interference mitigation for low-power wireless networks,”
RF 98.1 98.1 98.1 94.6 94.6 94.6 in Proceedings of the 14th International Conference on Information
SVM 98.7 98.7 98.7 94.1 94.3 94.1 Processing in Sensor Networks. ACM, 2015, pp. 1–12.
KNN 99.2 99.2 99.2 95.1 95.1 95.1 [12] F. Hermans, L.-Å. Larzon, O. Rensfelt, and P. Gunningberg, “A
lightweight approach to online detection and classification of inter-
ference in 802.15. 4-based sensor networks,” ACM SIGBED Review,
VI. C ONCLUSION vol. 9, no. 3, pp. 11–20, 2012.
[13] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion,
In this paper we presented the employment, the fine- O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg et al.,
tuning and evaluation of multiple classification algorithms for “Scikit-learn: Machine learning in python,” Journal of machine learning
detecting the causes of WiFi underperformance. The prevalent research, vol. 12, no. Oct, pp. 2825–2830, 2011.

horized licensed use limited to: AMRITA VISHWA VIDYAPEETHAM AMRITA SCHOOL OF ENGINEERING. Downloaded on September 23,2024 at [Link] UTC from IEEE Xplore. Restrictions app

WiFiProfiler: Automated WLAN Diagnosis
No ratings yet
WiFiProfiler: Automated WLAN Diagnosis
15 pages
AI-Driven Wi-Fi Diagnostics Tool
No ratings yet
AI-Driven Wi-Fi Diagnostics Tool
3 pages
Diagnostics
No ratings yet
Diagnostics
15 pages
Enhancing Wi-Fi 802.11 with Machine Learning
No ratings yet
Enhancing Wi-Fi 802.11 with Machine Learning
54 pages
ML Strategies for Wireless Network Optimization
No ratings yet
ML Strategies for Wireless Network Optimization
35 pages
WiFi Advisor: Wireless Assessment Tool
No ratings yet
WiFi Advisor: Wireless Assessment Tool
6 pages
Enhancing Wi-Fi 802.11 with Machine Learning
No ratings yet
Enhancing Wi-Fi 802.11 with Machine Learning
54 pages
Enhancing WiFi 6 with Machine Learning
No ratings yet
Enhancing WiFi 6 with Machine Learning
45 pages
Indoor 2.4GHz Radio Signal Propagation
No ratings yet
Indoor 2.4GHz Radio Signal Propagation
5 pages
Wi-Fi Throughput Prediction Analysis
No ratings yet
Wi-Fi Throughput Prediction Analysis
50 pages
Detecting Selfish Nodes in Wi-Fi Networks
No ratings yet
Detecting Selfish Nodes in Wi-Fi Networks
5 pages
Network Troubleshooting Methodologies
100% (3)
Network Troubleshooting Methodologies
22 pages
Henry Article
No ratings yet
Henry Article
6 pages
ANN-Based Machine Learning for Wireless Networks
No ratings yet
ANN-Based Machine Learning for Wireless Networks
33 pages
Deep Learning for WiFi Big Data Analytics
No ratings yet
Deep Learning for WiFi Big Data Analytics
6 pages
Automated LAN Network Diagnosis Proposal
No ratings yet
Automated LAN Network Diagnosis Proposal
12 pages
ML for WLAN Channel Bonding Prediction
No ratings yet
ML for WLAN Channel Bonding Prediction
14 pages
WiFi Hotspot Monitoring Tool Proposal
No ratings yet
WiFi Hotspot Monitoring Tool Proposal
13 pages
Enhancing Wi-Fi Performance with ML
No ratings yet
Enhancing Wi-Fi Performance with ML
4 pages
Home Wireless Network Design Insights
No ratings yet
Home Wireless Network Design Insights
14 pages
Machine Learning in Wireless Networks
No ratings yet
Machine Learning in Wireless Networks
93 pages
Wireless Transmissions
No ratings yet
Wireless Transmissions
42 pages
Robust Client-Based Wi-Fi Topology Discovery: Pantelis A. Frangoudis, Dimitrios I. Zografos, and George C. Polyzos
No ratings yet
Robust Client-Based Wi-Fi Topology Discovery: Pantelis A. Frangoudis, Dimitrios I. Zografos, and George C. Polyzos
5 pages
WiFi Sensing for Care Home Safety
No ratings yet
WiFi Sensing for Care Home Safety
3 pages
Human Activity Recognition Using Deep Learning Models - !!
No ratings yet
Human Activity Recognition Using Deep Learning Models - !!
1 page
Machine Learning for Wi-Fi Channel Allocation
No ratings yet
Machine Learning for Wi-Fi Channel Allocation
9 pages
IoT Device Identification Framework
No ratings yet
IoT Device Identification Framework
15 pages
Quantenna WiFi Case Study Insights
No ratings yet
Quantenna WiFi Case Study Insights
7 pages
WiFi Sensing Techniques for Edge Devices
No ratings yet
WiFi Sensing Techniques for Edge Devices
32 pages
PPR 12
No ratings yet
PPR 12
24 pages
ANN-Based ML for Wireless Networks Tutorial
No ratings yet
ANN-Based ML for Wireless Networks Tutorial
33 pages
Service Level Assurance for Wi-Fi Performance
No ratings yet
Service Level Assurance for Wi-Fi Performance
14 pages
ANN Machine Learning for Wireless Networks
100% (1)
ANN Machine Learning for Wireless Networks
33 pages
Machine Learning for Energy-Efficient Wireless Routing
No ratings yet
Machine Learning for Energy-Efficient Wireless Routing
6 pages
Threshold Distance for WLAN EM Exposure
No ratings yet
Threshold Distance for WLAN EM Exposure
9 pages
Paper12 PDF
No ratings yet
Paper12 PDF
7 pages
Troubleshooting Network Issues Guide
No ratings yet
Troubleshooting Network Issues Guide
5 pages
LoRa Technology Performance Evaluation
No ratings yet
LoRa Technology Performance Evaluation
10 pages
Diagnosing High Wi-Fi Retry Rates
No ratings yet
Diagnosing High Wi-Fi Retry Rates
9 pages
LAN Failure Prediction with ML Techniques
No ratings yet
LAN Failure Prediction with ML Techniques
19 pages
Understanding WiFi: Modes and Functionality
No ratings yet
Understanding WiFi: Modes and Functionality
8 pages
Wi-Fi Machine Learning for Activity Detection
No ratings yet
Wi-Fi Machine Learning for Activity Detection
67 pages
CWAP (R) Certified Wireless Anal - Tom Carpenter
100% (3)
CWAP (R) Certified Wireless Anal - Tom Carpenter
620 pages
IEEE 802.11 Network Anomaly Detection and Attack Classification: A Deep Learning Approach
No ratings yet
IEEE 802.11 Network Anomaly Detection and Attack Classification: A Deep Learning Approach
6 pages
Cwap-402 (2016) PDF
100% (4)
Cwap-402 (2016) PDF
620 pages
WS1017 Latest
No ratings yet
WS1017 Latest
8 pages
Wireless Network Testing Devices Guide
No ratings yet
Wireless Network Testing Devices Guide
14 pages
Jperf in WLAN Performance Analysis
No ratings yet
Jperf in WLAN Performance Analysis
8 pages
WiFi-Based Fall Detection for Seniors
No ratings yet
WiFi-Based Fall Detection for Seniors
10 pages
ZigBee, LoRa, and NB-IoT in Smart Buildings
100% (1)
ZigBee, LoRa, and NB-IoT in Smart Buildings
11 pages
Indoor WiFi Path Loss Model for India
No ratings yet
Indoor WiFi Path Loss Model for India
6 pages
RF Technology and Wireless Networks Overview
No ratings yet
RF Technology and Wireless Networks Overview
8 pages
Wireless AI for Smart Car Monitoring
No ratings yet
Wireless AI for Smart Car Monitoring
22 pages
Ensemble Learning for Indoor Localization
No ratings yet
Ensemble Learning for Indoor Localization
12 pages
Packet Tracer for Wireless Networks & IoT
No ratings yet
Packet Tracer for Wireless Networks & IoT
5 pages
Air Packet Capture Techniques for Wi-Fi
No ratings yet
Air Packet Capture Techniques for Wi-Fi
4 pages
Heavy Lift Solutions for Offshore Projects
No ratings yet
Heavy Lift Solutions for Offshore Projects
25 pages
Class 4 Jugs and Mugs Worksheet
No ratings yet
Class 4 Jugs and Mugs Worksheet
7 pages
Electrostatics Summary for Class 12
No ratings yet
Electrostatics Summary for Class 12
10 pages
Western Star Bodybuilder Manual 3.1
100% (2)
Western Star Bodybuilder Manual 3.1
20 pages
Phase 7.2 OPL Dryer User Manual
No ratings yet
Phase 7.2 OPL Dryer User Manual
86 pages
General Reasoning Guide: Verbal & Non-Verbal
No ratings yet
General Reasoning Guide: Verbal & Non-Verbal
115 pages
Three-Phase System Overview and Analysis
No ratings yet
Three-Phase System Overview and Analysis
23 pages
TS Scale T2000A
No ratings yet
TS Scale T2000A
23 pages
Ancient Sanskrit Scriptures Overview
100% (1)
Ancient Sanskrit Scriptures Overview
11 pages
Neftec C/C Composite Specifications 2023
No ratings yet
Neftec C/C Composite Specifications 2023
3 pages
60kva Vertiv Online Ups
No ratings yet
60kva Vertiv Online Ups
5 pages
Biomass Energy Conversion Overview
No ratings yet
Biomass Energy Conversion Overview
42 pages
Otto Cycle and Thermodynamic Processes
No ratings yet
Otto Cycle and Thermodynamic Processes
28 pages
Special Metals UDIMET® Alloy 250 Maraging Steel
No ratings yet
Special Metals UDIMET® Alloy 250 Maraging Steel
1 page
1720-E Manual
No ratings yet
1720-E Manual
74 pages
GCSE Psychology Research Methods Guide
No ratings yet
GCSE Psychology Research Methods Guide
26 pages
Distributed Systems: Tutorial 6 - Apache Zookeeper™
No ratings yet
Distributed Systems: Tutorial 6 - Apache Zookeeper™
18 pages
PAC Learning with Concentric Circles
No ratings yet
PAC Learning with Concentric Circles
2 pages
CSB 12V 7.2Ah Battery Specifications
No ratings yet
CSB 12V 7.2Ah Battery Specifications
2 pages
PAM8302A EVB User Guide Overview
No ratings yet
PAM8302A EVB User Guide Overview
4 pages
Ele863 Lab Manual
No ratings yet
Ele863 Lab Manual
34 pages
Reservoir Simulation Basics Explained
100% (1)
Reservoir Simulation Basics Explained
19 pages
Transformations of Basic Shapes in Geometry
No ratings yet
Transformations of Basic Shapes in Geometry
28 pages
Brand Equity Analysis of Starbucks Customers
No ratings yet
Brand Equity Analysis of Starbucks Customers
9 pages
Permutation and Combination Basics
No ratings yet
Permutation and Combination Basics
13 pages
Understanding Brushless Alternators
67% (3)
Understanding Brushless Alternators
15 pages
Permutations and Combinations Worksheet
No ratings yet
Permutations and Combinations Worksheet
3 pages
Wentworth Grain Size Classification Chart
100% (1)
Wentworth Grain Size Classification Chart
1 page
Annual Electrical Maintenance Schedule
No ratings yet
Annual Electrical Maintenance Schedule
6 pages
GW-PINN: Deep Learning for Groundwater Flow
No ratings yet
GW-PINN: Deep Learning for Groundwater Flow
29 pages

Machine Learning for 802.11 Troubleshooting

Uploaded by

Machine Learning for 802.11 Troubleshooting

Uploaded by

2019 16th IEEE Annual Consumer Communications & Networking Conference (CCNC)

On the Employment of Machine Learning

significantly. Restricted access to the wireless medium can

alternated the transmission power of the AP device between

criterion ’gini’,’entropy’ criterion ’gini’,’entropy’

You might also like