DCCLA: Automatic Indoor Localization Using Unsupervised Wi-Fi Fingerprinting

P. Brézillon, P. Blackburn, and R. Dapoigny (Eds.): CONTEXT 2013, LNAI 8175, pp. 73–86, 2013. © Springer-Verlag Berlin Heidelberg 2013

DCCLA: Automatic Indoor Localization Using Unsupervised Wi-Fi Fingerprinting

Yaqian Xu1, Sian Lun Lau2, Rico Kusber1, and Klaus David1

1 Chair for Communication Technology (ComTec) Faculty of Electrical Engineering and Computer Science, University of Kassel

Wilhelmshöher Allee 73, 34121 Kassel, Germany [email protected]

2 Department of Computer Science and Networked Systems Faculty of Science and Technology, Sunway University

5, Jalan Universiti, Bandar Sunway, 46150 Petaling Jaya, Malaysia [email protected]

Abstract. People spend most of their time in a few significant places and often indoors in a small number of select rooms and locations. Indoor localization in terms of a user’s current place, related to a user’s daily life, routines or activities, is an important context. We implemented an automatic approach DCCLA (Density-based Clustering Combined Localization Algorithm) to automatically learn the Wi-Fi fingerprints of the significant places based on density-based clustering. In order to accommodate the influence of the signal variation, clustering procedure separately works on a list of RSSIs (Received Signal Strength Indicators) from each AP (Access Point). In this paper, the approach is experimentally investigated in a laboratory setup and a real-world scenario in an office area with adjacent rooms, which is a key challenge to distinguish for place learning and recognition approaches. From these experiments, we compare and identify the most suitable parameters for the unsupervised learning.

1 Introduction

The location of a user is an important and useful context for many emerging location-based applications. For outdoor localization, the location information is usually obtained by using the Global Positioning System (GPS) technology and for indoor localization with the help of specialized infrastructure (e.g., infrared device [1] or ultra-wideband (UWB) network [2]), or by utilizing existing infrastructure such as cellular networks or Wi-Fi networks.

Despite the increasing amount of research, indoor localization is still a not completely resolved challenge. Wi-Fi network-based techniques usually rely on explicit pre-deployment effort, such as extensive site survey with data annotation to build a Wi-Fi fingerprint database [3] and/or radio propagation model [4]. These techniques, while providing localization accuracies of about 2-3m, are seen as expensive and time consuming to get running in the first place.

https://www.researchgate.net/publication/221573504_Precise_Indoor_Localization_Using_Smart_Phones?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

74 Y. Xu et al.

Provided a technique could learn locations without the need of data annotation, i.e. in an unsupervised manner, one can eliminate the need of the pre-deployment process. As shown in our paper here, to eliminate the purpose-designed site survey for generating a Wi-Fi fingerprint database, the available data can be collected by a user’s smartphone in his daily life.

We observed that a user spends most of his time in a few specific places, such as “home”, “office”, “leader’s office”, “café” and so on, which are usually significant to him. If a user stays in a place for a while, the received signal strength indicators (RSSIs) from one Wi-Fi access points (APs) are usually similar, presenting a high-density distribution. The specific characteristic offers us the possibility to build the fingerprints in an unsupervised manner by an autonomic approach.

It turns out to be an approach that learns the places a user goes to, and then recognizes them when he returns to these places. The notion of “place” in indoor localization usually provides a representation of locations that relates to a user’s daily life, routines or activities. The solution is specifically called “place learning or/and recognition”, which is one category of indoor localization.

In this paper, we demonstrate an automatic indoor localization approach called DCCLA (Density-based Clustering Combined Localization Algorithm) [5] to build place fingerprints from Wi-Fi received signal strength indicators (RSSIs) from surrounding access points (APs) in an unsupervised manner, i.e. without explicit pre-deployment effort.

Our unsupervised approach has two advantages. Firstly, a Wi-Fi fingerprint consists of a set of clusters, which belong to different APs observed in one location. Compared to a list of representative or calibrated RSSI values used in most similar approaches, a cluster is an RSSI range with respect to a place, which is more informative than a set of individual values. Secondly, the clustering procedure separately works on a list of RSSIs from each AP, which makes the approach more robust to the unstable presence of APs in the Wi-Fi measurements.

In many cases, learned places are useful context, for example, for a control system which turns on heating and lighting in a room, according to a user’s preferences when he returns to this room. Without the need of data annotation, the unsupervised place learning and recognition technique can provide room level recognition, if it is able to learn and recognize two places from two adjacent rooms.

The goal of this paper is to present the algorithm and to investigate the performance in an office environment with adjacent rooms, which is a key challenge to distinguish for unsupervised place learning and recognition approaches. Evaluations will be carried out in a laboratory setup with known APs and also in a real-life environment using all detectable APs. We investigate which combinations of DCCLA parameters provide the best correct recognition probability.

This paper is organized as follows. Section 2 presents related work that utilizes similar approaches and a comparison with our technique. Based upon this, we elaborate the DCCLA algorithm in Section 3. In Section 4, the measurements and evaluations performed are explained. In Section 5, the conclusion is presented.

DCCLA: Automatic Indoor Localization Using Unsupervised Wi-Fi Fingerprinting 75

2 Related Work

2.1 Place Learning

Generally, place learning is the process of analyzing various sensor data to discover or learn the significant places, which relate to a user’s daily life, routines or activities. The comMotion system [6] is one of the earliest applications about learning places. It detects and defines the indoor places by comparing the locations where the GPS signals disappear and re-appear later. If two measurements are within a certain distance, the location is considered to be a place.

In recent years, detecting places from the radio environment is regarded as a promising option for place learning. For example, the BeaconPrint algorithm [7] scans the GSM and Wi-Fi information to learn and recognize the places, under the assumption that the user is in a place if the scans remain fairly stable during a pre-defined time window. The basic idea of these radio beacon algorithms is to detect the places from successive scans and applying radius- or time-based clustering.

The approaches using successive scans with time sequences are sensitive to the Wi-Fi signal noise during short times. Density-based clustering [8] is proposed to handle the signal noise. ARIEL [9] incorporates a clustering procedure to partition the collected Wi-Fi scans into clusters, each corresponding to one stationary zone. Dousse et al. apply a density-based clustering algorithm OPTICS [10] directly on the raw Wi-Fi vectors, identifying the representative Wi-Fi clusters for a place [11]. Both of them apply density-based clustering on the Wi-Fi scans, consisting of a list of APs. However, calculating the similarity between scans is complex and sensitive to the signal variation.

In order to eliminate the effect of Wi-Fi signal variation between scans, the DCCLA (Density-based Clustering Combined Localization Algorithm) approach proposed by ComTec performs density-based clustering on the RSSIs from each AP, separately. The clusters from different APs related to the same place are later combined together, generating a Wi-Fi fingerprint. Because the clustering procedure separately works on a list of RSSIs from each AP, the unstable presence of APs and signal variation in Wi-Fi scans has less effect on the clustering results than it usually has in the above-mentioned approaches [9] [11].

2.2 Fingerprinting-Based Technique

Place learning is one category of indoor localization. Most current place learning approaches apply fingerprinting-based technique. The basic idea of the fingerprinting-based technique is to discover the signal characteristics in certain locations to form the “Fingerprint (FP)” of these locations. The localization is then realized when the real-time measurement matches a certain FP. Because of the differences of learned data and learning procedures, FP learning can be implemented with supervised learning or unsupervised learning.

Supervised Learning: For supervised learning, pre-deployment effort is required to build the desired fingerprint database. During the collection phase, explicit input is needed to produce labeled data for the training of localization models. For example,

https://www.researchgate.net/publication/220141410_A_User-Centered_Location_Model?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/221568284_Learning_and_Recognizing_the_Places_We_Go?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/221653977_A_Density-Based_Algorithm_for_Discovering_Clusters_in_Large_Spatial_Databases_with_Noise?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/262398661_ARIEL_Automatic_Wi-Fi_based_room_fingerprinting_for_indoor_localization?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==


https://www.researchgate.net/publication/261165423_Place_Learning_via_Direct_WiFi_Fingerprint_Clustering?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==


76 Y. Xu et al.

in RADAR [3] from Microsoft, empirical Wi-Fi information was collected at predefined locations to generate a complete Wi-Fi fingerprint database for the selected area.

Such approaches learn the FPs from known locations and respectively collected data with data annotations, mostly in terms of semantic names. In many cases, such explicit pre-deployment effort is seen as costly and not practical. Time and resources are needed to build the required fingerprint database.

Unsupervised Learning: The aim of unsupervised learning is to omit the pre-deployment effort for the generation of FPs in the supervised learning. An example is the Redpin approach [12]. It omits the pre-deployment effort by detecting the stable state when Wi-Fi scans keep stable. Then users are immediately prompted to confirm and correct the FP, when an unknown or a wrong one is detected. This approach is seen as unsupervised, because no pre-deployment effort is required to identify a location. However, it requires users' explicit input to annotate the detected location; otherwise the location database is not updated.

Another popular unsupervised approach is based on a technique “Dead Reckoning (DR)”. DR is the process of detecting a location based on previously determined locations and known or estimated speeds [13]. A recent project UnLoc [14], developed by Duke University, bypasses the need of pre-deployment effort by identifying the significant landmark in DR. However, the method still relies on known information of some specific locations, e.g., the location of a door, or staircase, or elevator. The landmarks must be first modeled and identified before the unsupervised Wi-Fi FPs can be created.

A pure unsupervised solution without pre-deployment effort and external landmark information is usually based on radius-based clustering or density-based clustering as we introduced. It is usually applied in place learning applications, e.g. DCCLA works based on the assumption that a significant place is a location where people socially stay for more than a certain time period (e.g. 10 minutes) [15]. The periodically collected Wi-Fi data present a high-density distribution in the significant places, while the data present low-density distribution or vary quickly in the non-significant places. The basic idea of DCCLA is to discover the high-density clusters for each detectable AP. To the best of our knowledge, DCCLA is the first approach to separately apply density-based clustering on the RSSIs from each AP, to eliminate the effect of the signal variation between scans. In the Section 3, we will introduce the DCCLA algorithm in detail.

3 Algorithm Description

The algorithm of DCCLA includes three phases: collection, learning and recognition phase.

Collection Phase: The smartphone periodically scans the surrounding APs. The scans, consisting of RSSI points from all detectable APs, are stored in a Wi-Fi database. To collect RSSI points, smartphones do not need to connect to any APs.

Definition 1: (RSSI point) An RSSI point (Pik) is a record, including the timestamp (tk) of scanning this point, the MAC (Medium Access Control) address (MACi) of the Wi-Fi AP (APi) and the corresponding RSSI value (RSSIik).

https://www.researchgate.net/publication/220926316_Redpin_-_Adaptive_zero-configuration_indoor_localization_through_user_collaboration?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/236872236_Indoor_Positioning_Technologies?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/254462856_No_need_to_war-drive_Unsupervised_indoor_localization?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==

https://www.researchgate.net/publication/232724876_An_experimental_investigation_of_indoor_localization_by_unsupervised_Wi-Fi_signal_clustering?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==


; ; ,

Definition 2: (scan) For a timestamp tk, more than one AP may be detected. Thus, a scan (Sk) consists of RSSI points from all detectable APs related to a timestamp tk. The length of a scan is the number of the RSSI points in the scan. , … , , , ,

Definition 3: (Wi-Fi database) A Wi-Fi database (WD) is a list of scans collected during a time period [tk, tm]. , … , , ,

Learning Phase: The RSSI points are automatically learned to be the Wi-Fi FPs of the significant places. In our work, we separately perform density-based clustering on a set of RSSI points from each Wi-Fi AP in a WD. Then we combine clusters related to the same place together, generating a Wi-Fi FP. Details of the two sub-phase algorithms, cluster learning algorithm and fingerprint learning algorithm, are described in Section 3.1 and 3.2.

Recognition Phase: As the user visits a place, a snapshot of the momentary scan Sk will be compared to the Wi-Fi FPs in the fingerprint database obtained from the learning phase. If there is a matching Wi-Fi FP, the recognition algorithm returns the matched result. This indicates that the user returns to a place related to this Wi-Fi FP. The user is located. Details of the Wi-Fi fingerprint recognition algorithm are described in Section 3.3.

3.1 DCCLA Cluster Learning Algorithm

As we observed in previous work [5], the collected RSSI points from one Wi-Fi AP perform a high-density distribution in a significant place. The density-based clustering algorithm in [8] is modified to learn the high-density range for each AP.

Definition 4: (neighborhood of Pik) An RSSI point (Pim) with the same MAC address as Pik within a neighborhood range (NR) is a neighbor of Pik, denoted by n(Pik). The collection of all neighbors of Pik is called neighborhood of Pik, denoted by N (Pik). Here we define Pik itself belongs to N (Pik) as well. | 0 , ,

The neighborhood range (NR) is one parameter, used to delimit the range of the neighborhood starting from the RSSI point (Pik). The unit of NR is dB.

Definition 5: (neighborhood density) Neighborhood density of Pik ( ) is the number of neighbors of Pik the NR.

, (1)

We take (NR=2dB) as an example. For an RSSI point Pik with RSSI value (RSSIik=-62dB), all the RSSI points, who have the same MAC address (MACi) and

https://www.researchgate.net/publication/232724882_Novel_Indoor_Localisation_using_an_Unsupervised_Wi-Fi_Signal_Clustering_Method?el=1_x_8&enrichId=rgreq-0ee60d21200e51434292d5f576e6e775-XXX&enrichSource=Y292ZXJQYWdlOzI2MDA5NTA2MjtBUzoyMDI5MTExMjU2NDMyNjhAMTQyNTM4OTE5MDUyNg==


78 Y. Xu et al.

RSSI values [-62dB,-61dB,-60dB], are the neighbors of Pik. The number of the RSSI points in the neighborhood range [-62dB, -60dB] is the neighborhood density of Pik.

Two parameters are introduced to determine the criterion of high neighborhood density. One is the neighborhood range (NR) as we introduced above. Another is the minimum number of RSSI points (MinPts). It is a natural number which is introduced to determine if the Neighborhood density of Pik is high enough to create a cluster.

For an RSSI point (Pik), its neighborhood density is high enough to create a cluster if the following requirement is satisfied: , (2)

Definition 6: (cluster) A cluster ( ) is a collection of RSSI points. Its representative form is a consecutive RSSI range with high RSSI density, associated with , which can be indicated by: , , , , , ,

RSSIib represents the beginning and RSSIie represents the end RSSI value of the high-density range (e.g. 00: : 00: 62: 6: 00, 56 , 62 ). We extract the cluster with a consecutive RSSI range because it is more informative than a set of individual RSSI values. A set of RSSI points with one MAC address may be grouped into several clusters (e.g. { , …, }) without overlapping. is the oth cluster related to the MAC address MACi.

After a cluster is created, the algorithm check if this cluster can be extended by checking if the requirement of cluster-extension is satisfied.

Definition 7: (cluster-extension) A cluster ( ) can be extended if the following two conditions are met:

•

•

The neighborhood of Pim is merged into the cluster ( ), as the cluster is extended. The basic idea to generate a cluster is the following. For each RSSI point Pik, the

algorithm calculates its neighborhood Density ( ). If is lower than MinPts, the algorithm continues to check the next unchecked RSSI point. Otherwise, the neighborhood of Pik is either used to create a new cluster if Pik does not belong to any existing cluster, or merged to an existing cluster if Pik belongs to the existing cluster. The set of RSSI points not belonging to any cluster is defined as noise. The pseudo code of the DCCLA cluster learning algorithm is described as follows. Input: a Wi-Fi database (WD). Output: a set of all learned clusters { , … , }.

1) Separate the WD into datasets, each dataset with a unique MAC address .

2) Order each dataset to form a list with increasing RSSI values.

3) Label each RSSI point ( ) on each as unchecked


4) for each ordered , do 1) while there exist an unchecked , do

1) Calculate the neighborhood density of ( )

2) if is smaller than MinPts , then 1) Label as checked. 2) continue the while loop

3) else if belongs to an existing cluster , then 1) Merge neighborhood of to .

4) else 1) Create a new cluster .

5) end if 6) Label as checked.

2) end while 5) end for As indicated above, the range of a cluster is not restricted by the parameter

neighborhood range (NR). The cluster range is automatically adapted to the high-density distribution of RSSI points in the Wi-Fi database. Consequently, the density-based clustering algorithm is suitable for indoor localization where signal variations in different scenarios are mostly not the same.

3.2 DCCLA Fingerprint Learning Algorithm

After the clustering process, the algorithm learns the Wi-Fi fingerprints.

Definition 8: (Wi-Fi fingerprint) A Wi-Fi fingerprint ( ) is a set of clusters, which belong to different APs related to a scan Sk collected at the time tk. , … , , , , ,

In most offices or residential areas, more than one AP can be detected. The RSSI points in a scan Sk with the same timestamp tk and different MAC addresses are collected in the same place. We develop the fingerprint learning algorithm based on the above fact. In the fingerprint learning sub-phase, the algorithm compares all scans in the WD with the learned clusters. For each scan , … , with the timestamp tk, if they respectively belong to a set of clusters , … , , the algorithm

combines these clusters to generate a Wi-Fi fingerprint , corresponding to a significant place where this scan is collected. The pseudo code of the DCCLA fingerprint learning algorithm is described as follows.

Input: a Wi-Fi database (WD) and a set of clusters { , … , } learned from this WD.

Output: a fingerprint database (FD).

80 Y. Xu et al.

1) for each scan in the WD, do 1) Label all learned clusters { , … , } as

unmatched. 2) for each RSSI point (P

ik) in , do

1) Compare with { , … , }.

2) if Pik belongs to a learned cluster ( ), then 1) Label as matched.

3) else 1) break the for loop.

3) end for 4) if the number of matched clusters equals to the

length of , then 1) Combine the set of matched clusters { … , }

together as a fingerprint . 2) if does not exist in a FD, then

1) Add to FD. 3) else

1) continue the for loop. 5) else

1) continue the for loop. 2) end for One place may have more than one corresponding set of fingerprints because of the

variation of detectable APs. Ideally, the fingerprints in each distinguishable place are unique, so as to be able to separate places from one another. All Wi-Fi fingerprints are stored in the fingerprint database (FD). In the update phase, the learning phase is repeated on a newly collected Wi-Fi database. But only new Wi-Fi fingerprints are added to the FD.

3.3 DCCLA Fingerprint Recognition Algorithm

As the user visits a place, a snapshot of the momentary scan , … , is compared to the Wi-Fi fingerprints in the fingerprint database (FD). If, and only if, each RSSI point in belongs to each cluster of a Wi-Fi fingerprint ( ), the recognition algorithm returns the matched result. The pseudo code of the DCCLA cluster learning algorithm is described as follows. Input: a momentary scan , … , and a fingerprint database (FD). Output: a matched Wi-Fi fingerprint ( ). 1) for a momentary scan , … , , do

1) Label each fingerprint ( ) in the FD as unchecked


2) while there exists an unchecked in FD, do 1) if the length of equals to the number of

clusters in , do 1) if each RSSI point ( ) in falls into

each cluster in , do 1) Return .

2) else 1) continue the while loop.

2) else 1) continue the while loop.

3) end while 2) end for This is known as 100% degree of matching. For example, if there are three RSSI

points measured in a scan, {tk, MAC1, -45dB}, {tk, MAC2, -50dB}, {tk, MAC3, -57dB}, a 100% matched fingerprint contains three clusters with respective MAC address and cluster range where the RSSI value of each RSSI point belongs to.

A matched scan indicates that the user has returned to a place corresponding to this fingerprint. The user’s current place is recognized. Currently, in order to reduce the localization errors, we utilize the 100% degree of matching to recognize the matched Wi-Fi fingerprint.

4 Experimental Evaluation

We select two scenarios in an office area with adjacent rooms for the intended investigations. The decision to select a small-scale office area is that it is generally a challenge to distinguish adjacent rooms in an unsupervised manner. In the first scenario, we use four APs with known locations to investigate the influence of the number of APs. For the second scenario, all detectable APs (up to 17 APs) in reality are included. The office area is located on the second floor of a three-story building. The area consists of five office rooms next to each other. Room 2408 has an area of 38.32m2, and each of rooms 2410, 2411, 2412, and 2414 has an area of 18.41m2. The layout is shown in Fig. 1.

Five Motorola Milestones smartphones with Android version 2.2.1 are used as measuring devices. The collection locations are shown in Fig. 1. These smartphones are placed at the height of a table (about 70cm). We assume that most users are used to putting their smartphones on the table (meaning a similar height).

The RSSIs are collected at a collection interval of 5 seconds. In contrast to the previous investigation [15], we now allow the presence of people throughout the measurements. In this way, possible influence on signal variation due to occupancy, which will be the case in a real life scenario, is included in the collected data.


82 Y. Xu et al.

Fig. 1. The office area, and tsmartphones used for the evalu

For the evaluations, traiTraining data from each roTest data from each room DCCLA approach. We d“Correct” means a smartphplace. “False” means a smadifferent place. “Correct Rethe “Correct”s divided by t

∑The CRP metric indicat

without having it mistaken

4.1 Influence of the Nu

From our previous investigtwo adjacent rooms using Wwe investigate the CRP of Fig. 1 presents the placemen

We use the first 30 minusubsequent 30 minutes of increasing the number of APand MinPts=120 are usedarithmetic average CRP is th

The increase of APs useFrom the CRP of 59.17% (to 97.63% and 99.25% resthan the result from ourinterference caused by oc

the collection locations, i.e. the positions of the APs as weluation

ining and test data are extracted from the collected doom are used to generate fingerprints of respective roo

are used to evaluate the recognition performance of define an evaluation metric for the recognition phahone is in a place and the algorithm recognizes it is in artphone is in a place but the algorithm recognizes it is iecognition Probability (CRP)” is defined as the numbethe number of the “Correct”s and the “False”s. ∑∑ ∑ 100%tes how well the DCCLA can recognize a place correas another place.

umber of APs

gation [15], we know that the CRP of two locations frWi-Fi RSSI from a single AP was 76%. In this evaluatif the DCCLA in an environment with more than one Ant of the APs used in our investigation. utes of the collected Wi-Fi RSSIs as training data and Wi-Fi RSSIs as test data. Measurements are repeatedP from one to four APs. Based on the results in [15], NR

d as parameters for the DCCLA learning process. Then calculated for all rooms. The results are listed in Fig. es for localization results in an improvement in the CR(only one AP), the CRP with three and four APs increaspectively. The CRP using one AP in this paper is low previous paper (76%) [15]. This is due to possiccupancy and changes in the environment during

ll as

data. oms.

the ase. this in a r of

(3)

ctly

rom ion, AP.

the d by R=2 The 2.

CRP. ases wer ible the




DCCLA: Automatic Indoor

Fig.

measurement period. In ourwithout disturbance of surincrease of the number of A

4.2 Parameters for the

In the previous paper [15], Furthermore, the selected eand signal variation due tothese factors cannot be negwith different combinationsDCCLA learning algorithmrange (NR) and the minimu

We use the data collectAP4 in Fig. 1). The valueslearning phase. Selected NRMinPts values range from 1test data from every room to produce the CRP. Fig. 3 MinPts=110 are the paramethan 4. This is because whtendency to overlap locatiapproach to fail to differenevaluation for a range of NAPs) to see whether the opevaluations were still valid.the best CRP is obtained fearlier, a lower number of locations in adjacent roomgenerate clusters and fingerthe physical location of theFig. 4, a total of 4 APs doeNR is lower than 7. This is d

r Localization Using Unsupervised Wi-Fi Fingerprinting

2. Average CRP using one to four APs

r previous investigation, the measurements were performrrounding people. From this result, it is shown that

APs improved the CRP of the DCCLA approach.

Generation of Fingerprints Using the DCCLA

we identified parameters based on the use of a single Aenvironment had minimal disturbance such as interfereo people movements and environment changes. In realglected. In this subsection, we compare the CRP obtais of parameters to build the fingerprint database using m. The parameters to be investigated are: neighborhom number of RSSI points (MinPts).

ted from three APs with known locations (AP2, AP3, s of NR and MinPts are changed in each repetition of

NR values are 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10 dB. Selec10 to 240 with an increment of 10. For each repetition, are tested with the generated fingerprints from each roshows the results of the comparison. The values NR=3 eters with the best CRP. The CRP declines if NR is m

hen the NR was too large, the generated clusters have ions from two adjacent rooms. This causes the DCCntiate these places. Based on these results, we repeat

NR values (from 1 to 10) and number of APs (from 1 tptimal values (NR and number of APs) from the previ. The value of MinPts is kept as 110. The results show tfor three APs and an NR between 3 and 7. As obserAPs produces a lower number of clusters for recogniz

ms. By increasing the number of APs, it is possiblerprints that can distinguish these adjacent rooms. Howeve AP should also be taken into consideration. As shownes not provide higher recognition as 3 APs if the valuedue to the additional AP2 being close to AP1 (see Fig. 1)

83

med the

AP. ence lity, ined

the ood

and the

cted the

oom and

more the

CLA the

to 4 ious that

rved zing e to ver, n in e of ).


84 Y. Xu et al.

Fig. 3. Comparison of differe

Fig. 4. The CRP f

4.3 Inclusion of All Det

Besides the four APs used APs from neighboring depaexisting APs is from 2 to 17when Wi-Fi RSSIs from all

We repeat the evaluationall existing APs to build theevaluate fingerprint datasetMinPts are computed for co

As compared to the resuCRP is 100% for all values highest MinPts with 100% CIn other words, the selectionapproach to recognize the se

nt DCCLA learning parameters, NR (x-axis) and MinPts (y-ax

for different NR (x-axis) and numbers of APs (y-axis)

tectable Wi-Fi Access Points from the Surroundings

in the previous evaluation, there are also other detectaartments. From the collected data, the number of detecta7. We investigate the performance of the DCCLA approl available APs are to be used. n in the previous subsection, and use collected RSSIs fre fingerprint dataset. Test data from each room are usedt of each room. The CRP for each combination of NR omparison. The result is shown in Fig. 5. ult in Section 4.2, the CRP has increased (see Fig. 3). Tof NR if we keep the MinPts lower than 100. Similarly,

CRP is 180, when the value of NR is kept between 7 and n of NR and MinPts within these ranges enables the DCCelected places in the office area without false recognition.

xis)

able able oach

rom d to and

The the 10.

CLA

DCCLA: Automatic Indoor

Fig. 5. The CRP using all av

5 Conclusion

In this paper, we have preLocalization Algorithm) apapproach that is designed without any explicit user experimentally investigatedexperiments, we have ideoffice area with adjacent radjacent rooms can be corRecognition Probability) bhave given us promising approach. Index Expanded. Acknowledgments. This pWirtschaft und Technologiinternetbasierte Telekommauthors are responsible facknowledge the contributio

References

1. Want, R., Hopper, A., FaTransactions on Informati

2. Mahfouz, M.R., Zhang, CAccuracy Indoor 3-D PMicrowave Theory and T

3. Bahl, P., Padmanabhan, Vsystem. In: INFOCOM (2

r Localization Using Unsupervised Wi-Fi Fingerprinting

ailable APs for different values NR (x-axis) and MinPts (y-axi

esented the DCCLA (Density-based Clustering Combipproach in detail. It is an unsupervised indoor localizatto automatically generate Wi-Fi RSSI-based fingerprinput or control. The performance of the approach

d in a laboratory setup and a real-world scenario. From ntified the ideal parameters suitable for the small-scrooms. It is shown that even in a small-scale office arrrectly distinguished and recognized with a CRP (Corrbetween 97% and 100% with at least 3 APs. The res

techniques that can improve and optimize the DCC

paper is funded by the German “Bundesministeriumie by funding the project” pinta (Pervasive Energie du

munikationsdienste), reference number 01ME11027. Tfor the content of this publication and would likeons of their colleagues.

alcao, V., Gibbons, J.: The active badge location system. Aion Systems 10(1), 91–102 (1992) C., Merkl, B.C., Kuhn, M.J., Fathy, A.E.: Investigation of HPositioning Using UWB Technology. IEEE Transactions echniques (2008)

V.N.: RADAR: An in-building rf-based user location and track2000)

85

is)

ned tion rints h is the

cale rea, rect ults

CLA

m fr urch The

e to

ACM

High-on

king

86 Y. Xu et al.

4. Martin, E., Vinyals, O., Friedland, G., Bajcsy, R.: Precise indoor localization using smart phones. In: Proceedings of the International Conference on Multimedia (2010)

5. Lau, S.L., Xu, Y., David, K.: Novel indoor localisation using an unsupervised Wi-Fi signal clustering method. In: Future Network & Mobile Summit (2011)

6. Marmasse, N., Schmandt, C.: A user-centered location model. In: Personal and Ubiquitous Computing (2002)

7. Hightower, J., Consolvo, S., LaMarca, A., Smith, I., Hughes, J.: Learning and recognizing the places we go. In: Beigl, M., Intille, S.S., Rekimoto, J., Tokuda, H. (eds.) UbiComp 2005. LNCS, vol. 3660, pp. 159–176. Springer, Heidelberg (2005)

8. Ester, E., Kriegel, H., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: 2nd International Conference on Knowledge Discovery and Data Mining (1996)

9. Jiang, Y., Pan, X., Li, K., Lv, Q., Dick, R.P., Hannigan, M., Shang, L.: ARIEL: Automatic Wi-Fi based Room Fingerprinting for Indoor Localization. In: UbiComp 2012 (2012)

10. Ankerst, M., Breunig, M.M., Kriegel, H., Sander, J.: OPTICS: Ordering Points to Identify the Clustering Structure. In: ACM SIGMOD International Conference on Management of Data (1999)

11. Dousse, O., Eberle, J., Mertens, M.: Place learning via direct WiFi fingerprint clustering. In: IEEE 13th International Conference on Mobile Data Management (2012)

12. Bolliger, P.: Redpin - adaptive, zero-configuration indoor localization through user collaboration. In: The 1st ACM International Workshop on Mobile Entity Localization and Tracking in GPS-less Environments (2008)

13. Mautz, R.: Indoor Positioning Technologies, ETH Zurich, Department of Civil, Environmental and Geomatic Engineering, Institute of Geodesy and Photogrammetry (2012)

14. Wang, H., Sen, S., Elgohary, A., Farid, M., Youssef, M., Choudhury, R.R.: No need to war-drive: unsupervised indoor localization. In: MobiSys 2012 (2012)

15. Xu, Y., Lau, S.L., Kusber, R., David, K.: An Experimental Investigation of Indoor Localization by Unsupervised Wi-Fi Signal Clustering. In: Future Network and Mobile Summit (2012)




























DCCLA: Automatic Indoor Localization Using Unsupervised Wi-Fi Fingerprinting

Documents