Top Banner

of 28

Enabling Effective Data Mining of RFID Data

Apr 10, 2018

Download

Documents

vishnunath3000
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    1/28

    Roma Chauhan Rajeev Gupta

    Computer Science DepartmentInstitute of Management Education

    Sahibabad

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    2/28

    What we going to cover

    y Introduction to RFID System

    y Applying RFID: Supply Chain

    y RFID Data Characteristics and issues

    y Why RFID Data Mining?

    y RFID Data Mining

    y RFID Data Mining Challenges

    y Conclusions

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    3/28

    B ar Code

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    4/28

    B arcode Disadvantagesy In order to keep up with inventories, companies must

    scan each bar code on every box of a particularproduct.

    y Going through the checkout line involves the sameprocess of scanning each bar code on each item.

    y Bar code is a read-only technology, meaning that itcannot send out any information.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    5/28

    RFI Dy RFID is an technology that uses radio-frequency waves

    to transfer data between a reader and a movable item

    to identify, categorize, track...y RFID is fast, reliable, and does not require physical

    sight or contact between reader/scanner and thetagged item

    y It uses technologies using radio wave to automatically identify objects

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    6/28

    y RFID tags are intelligent bar codes that can talk to anetworked system to track every product that you put

    in your shopping cart.y Many manufacturers use the tags to track the location

    of each product they make from the time it's madeuntil it's pulled off the shelf and tossed in a shopping

    cart.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    7/28

    Why RFI Dy Read/Writey A bility to add information directly to tags enables each

    unique asset to carry its own unique history y N on-contact Readsy A

    bility to read tags at a distance, under a variety of environmental conditions, without physical manipulation of the asset

    y Fast Ready

    A bility to simultaneously read large numbers (1000-1750tags/sec) of itemsy Automation: Requires less human interventiony Authenticity y Each RFID chip is unique and can not be replicated

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    8/28

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    9/28

    Components of an RFID Systemy Tag: The tag contains a microchip for storage of data and an antenna

    which is tuned to receive radio frequency waves emitted by a scannerfor allowing wireless transmission of data to the reader.

    y Scanner : It is a reader which contains radio frequency module, acontrol unit and a coupling element to interrogate the tags via radiofrequency communication. Readers are usually connected throughmiddleware to a back-end database.

    y The Middleware which interacts with backend database. Middleware isresponsible for cleaning the data such as eliminating false reads besidesperforming aggregation and filtering of data. A lso, by monitoringmultiple readers, middleware can detect the movement of RFID tags asthey pass from the read range of one reader to another,

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    10/28

    RFI D Technologyy Radio Frequency Identification (RFID)

    y Technology that allows a sensor (reader) to read,from a distance, and without line of sight, a uniqueelectronic product code (EPC) associated with a tag

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    11/28

    RFI D System (Tag, R eader, Database)

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    12/28

    Electronic Product Code (EPC)y

    EPC is Coding schemes created as an eventual successor to the barcode.y The EPC was created as a low-cost method of tracking goods using RFIDtechnology. It is a standard naming scheme by A uto-Id Center.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    13/28

    Events during RFI D tracking

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    14/28

    App lying RFI Dy RFID Enabled Supply Chain: Inventory management, reduce thrift,

    easy and effective tracking of items.y Retail : A ctive shelves monitor product availability y Access control : Toll collection, credit cards, building accessy Airline luggage management : Reduce lost/misplaced luggagey Medical : Implant patients with a tag that contains their medical

    history y P et identification : Implant RFID tag with pet owner informationy

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    15/28

    M ovement of an object within a su pp ly chain

    before it is finally p urchased by the customer

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    16/28

    RFI D Data Characteristics and issuesy The RFID datasets possess following set of characteristics:

    y Inaccuracy y Massive data streamsy Temporal & spatial featuresy Integrationy Scalable Systemsy Heterogeneous datasetsy Inaccuracy y Inferencesy Manageabley Security y Mapping physical and interpreted world

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    17/28

    Inaccuracy y The RFID data is dirty and not very accurate due to duplicate

    and missed reads by the reader.y The RFID data seems to be noisy and unreliable in a raw

    structure due to false reads.

    Massive data streamsy RFID datasets are continuously generated stream of data

    recorded all the time by RFID readers automatically.

    y Thus huge data requires scalable storage schemes for efficientstorage of RFID data.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    18/28

    Temporal & spatial featuresy During the reading of the RFID datasets it is depended on

    time stamps (temporal). A lso, the RFID datasets are spatial innature as they move across locations during their life cycle.This add to another layer of complexity

    Integrationy Integrating of RFID data with company s backend operations

    & business processes many challenges.y The cost of implementing RFID infrastructure within the

    boundaries of an organization bears a large cost for thecompany.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    19/28

    Scalable Systemsy With the enormous growth of RFID databases the system

    should be scalable enough to handle it.y The system should be capable enough to grow in one or more

    dimensions with the increase in the volume of RFID data andthe number of transactions without affecting performance.

    Heterogeneous datasetsy The Organizations that adopt RFID technology must handle

    data from thousands of readers distributed across variousdispersed geographical locations. The RFID solution can bedeployed across multiple sites, companies, or even countries.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    20/28

    y Inferencesy The RFID datasets always carries embedded implicit information

    such as changes of state and containment relationships amongobjects. To make proper inferences of RFID datasets it requirescontext of other information also.

    y Manageabley To manage huge datasets good support of administration and

    testing is a prerequisite for the successful deployment of an RFIDsolution in large-scale, distributed applications.

    y Security y The vast amount of potentially sensitive information involved in

    RFID systems makes security concerns critical. Such like:placement of wrong RFID tags on the objects.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    21/28

    RFI D Data M ining: Whyy The massive RFID data can be mined to produce

    prediction analysis and determine patterns in realtime that can help further to improve supply chainprocess

    y Tracking of objects moving in the supply chaincontains valuable knowledge important and very much required to understand processes, such asinventory management, and quality control incomplex production systems.

    y Vehicle tracking data can be used by transportation managers for incident detection orroad network planning.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    22/28

    y The massive RFID data brings great opportunity formining techniques.

    y

    The huge datasets can be mined to produce predictionanalysis and determine patterns in real time.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    23/28

    Issues in Data Cleaningy L ack of Completeness

    y RFID readers capture only 60-70% of all tags that are inthe vicinity

    y Smoothing of data is done to rectify the loss of intermediate messages

    y Temporal Nature of data or tag dynamicsy RFID tags are in motion and that is what makes them

    more difficult to handley But motion of a tag causes dropping of messages

    y RFID data streams are very fast and are huge innumber

    y Hence filtering is important before sending them to

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    24/28

    Challenge for RFI D data miningy The major problem with RFID data is that the volume of

    data increases as product moves in supply-chain in terms of time and location. Wal-Mart is expected to generate 7terabytes of RFID data per day

    y Moreover, data is redundant that needs to be consolidatedand transformed to occupy less space in the database.

    y To obtain the desired result one must ensure that no usefulinformation is lost during the process.

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    25/28

    Characteristic of RFI D Datay Spatio-temporal, dynamics and correlationsy Data contains time, location, and statusy Rich semantics.y Objects carry a lot of informationy related with its context status and background

    knowledgey U ncertainty and heterogeneity y

    missing readings, and repeat readingsy dirty datay Streaming, batching and massive volumey automatically generated rapidly in form of streamingy Objects must be checked in a batch

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    26/28

    Challengesy The gigantic size of RFID data, and the diversity of

    queries poses great challenges to traditional system

    and analysis technology since processing may involveretrieval and reasoning over a large number of interrelated tuples through different stages of objectmovements

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    27/28

    Data Cube Challengesy The path nature of RFID data makes it hard to incorporate

    into a traditional data cube while preserving its structurey We can count the number of objects that stayed at a

    particular location for defined period of time. Such as if we want to calculate the number of objects that stayed at aparticular location for what duration of time. But, it sdifficult to determine distance object covers while movingfrom one store location to the other store location.

    y The cleaned RFID data moves beyond raw tuples form. Itdepends on path database which is (code, time in, timeout).

  • 8/8/2019 Enabling Effective Data Mining of RFID Data

    28/28

    Thank You.