The Scientific Journal of Cihan University – Slemani PP: 01-17 Volume (4), Issue (1), june 2020 ISSN 2520-7377 (Online), ISSN 2520-5102 (Print) DOI: http://dx.doi.org/10.25098/4.1.22 1 Abstract: Ontology data is one of the well-known dataset on web and it is the main dataset of semantic web. Apriori algorithm is one of the best-known algorithms of association rule mining. The result of ontology will be effective by using association rule mining with it. Ontology is rich sources of data to feed Association rule mining algorithms. This paper focuses on how Ontology and data mining can combine. First of all, the semantic web data (ontology, RDF, RDFS or RDFa) need to be prepare and valid. Then, from the semantic web data, traditional dataset will be extracted from ontology using SPARQL as a query language for semantic web. Next, we are going to mining the OWL data using one of the association rule mining (ARM) algorithm. Apriori is one of association rule mining algorithm, which can use to mine frequent item-sets. Furthermore, the proposed model transforms semantic web to traditional dataset using SPARQL query language. Then, to obtain important information from the traditional dataset, therefore new relationships produced between semantic web and data mining, will be generated. Keywords :Ontology, Semantic web, Association rule mining, Aprioir, SPARQL. ال ملخص: ساسيملف ارنت وهي النتت ابيانات شهرة على شبكات الكثر ملفاجي واحدة من انتولوت ار بياناعتب ت لشبكة ا لسيمانتك. نات. وإنلبيانون تعريف امتعلقة بقانبية اللحساول احل واحدة من اشهر اللحسابيةول الحلوري لر أبريعتب ت تيجة انتولوجي ت بدوون جمعيم قانستخدا فعالة با لتغذية جعلومات غنيا بالمنتولوجي مصدرات ار بياناعتب. تلبيانات ة تعريف ا معية قون تعريف انولحل الت اكة بيانااد شب يجب اعدت. اولبيانا تعريف انتولوجي معبحث على كيفية دمج اة. يركز هذا اللحسابي ا لسيمانك و تفعيل ت ها. ثمندية من اتقليت اللبياناص تعريف ا استخ كلغة اسسباركلستخدام اللسيمانتك بات اكة بيانا تولوجي من شبم تع ل شبكة السيم انتك.فة الى ذلكضات. البيانا عدة فقرات من استخدم لتعريفوري تم. أبريرستخدام اوول با نقوم بتعريف ا ثم ا لنموذجمطلوب الدية باستختقليت اللبياناك الى تعريف السيمانتتحويل شبكة ا يقوم بم. وبعد ذلك كلغة استعسباركل دام ال ي تم الحلى مع صول ع لوماتلبيانات.ك و تعريف السيمانتين شبكة اقات جديدة بدية لذلك يتم بناء عتقليت اللبيانامة من ملف ا ها پوختە: دئ( تاکانى ا ۆ نت ۆ ل ۆ جى) يە ک ێ ک ە ل ەسراوتا نا دا ە کان ل ە ت ۆڕ ى ئ ي نت ە رن ێ ت و فا ي لى س ە ر ە کىت( ۆڕ ى س ي مانت ي ک ەئ( .) ە پريۆ ۆ رى) يە ک ێ ک ە ل ە باوتر ي ن چار ە س ە ر ەاتمات م ي ک يە کان ک ە تا ي ب ە ت ە ب ەتاکان. ئنى دا ناساند ە نجامىئ( ۆ نت ۆ ل ۆ جى کار) ي گ ە ر د ە ب ێ ت ب ە ب ە کاره ێانى ن ي اساى ک ۆ م ەڵە ىنى د ناساندتاکان ا . ئ ە م تو ێ ژ ي ن ە ويە ە ج ە خت ل ە س ە ر چ ۆ ن يە تى ت ي ک ەڵئ( کردنى ۆ نت ۆ ل ۆ جى ل) ە گ ەڵتاکان. سنى دا ناساند ە ر ە تا ئاماد ە کردنىت( ۆڕ ى س ي مانت ي کاکردنى دواتر ه و کار) ەڵێجاندنى نتا تقلنى دا ناساند ي ديە ي کان ل ەئ( ۆ نت ۆ ل ۆ جى ل) ەت( ۆڕ ى س ي مانت ي ک ب) ە ب ە کاره يانى ن زمانى س ي مانت ي ک . دواتر ه ەڵ د ە ست ي ن ب ەئ( ناساندنى ۆڵ ب) ە ب ە کاره يانى نئ( ە ىIntegration of OWL data with Apriori Algorithm Payman Othman Rahim 1 , Wria Mohammed Salih Mohammed 2 1,2 Department of Computer Science, College of Science, University of Sulaimani, Sulaimani, Iraq [email protected]1
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
The Scientific Journal of Cihan University – Slemani PP: 01-17 Volume (4), Issue (1), june 2020
ISSN 2520-7377 (Online), ISSN 2520-5102 (Print)
DOI: http://dx.doi.org/10.25098/4.1.22 1
Abstract:
Ontology data is one of the well-known dataset on web and it is the main dataset of semantic
web. Apriori algorithm is one of the best-known algorithms of association rule mining. The result of
ontology will be effective by using association rule mining with it. Ontology is rich sources of data
to feed Association rule mining algorithms. This paper focuses on how Ontology and data mining
can combine. First of all, the semantic web data (ontology, RDF, RDFS or RDFa) need to be
prepare and valid. Then, from the semantic web data, traditional dataset will be extracted from
ontology using SPARQL as a query language for semantic web. Next, we are going to mining the
OWL data using one of the association rule mining (ARM) algorithm. Apriori is one of association
rule mining algorithm, which can use to mine frequent item-sets. Furthermore, the proposed model
transforms semantic web to traditional dataset using SPARQL query language. Then, to obtain
important information from the traditional dataset, therefore new relationships produced between
semantic web and data mining, will be generated.
Keywords : Ontology, Semantic web, Association rule mining, Aprioir, SPARQL.
:ملخصال
لسيمانتك. الشبكة تعتبر بيانات الانتولوجي واحدة من اكثر ملفات البيانات شهرة على شبكات الانترنت وهي الملف الاساسي
بدو تلانتولوجي تيجة اتعتبر أبريوري للحلول الحسابية واحدة من اشهر الحلول الحسابية المتعلقة بقانون تعريف البيانات. وإن ن
الحلول انون تعريفمعية قة تعريف البيانات. تعتبر بيانات الانتولوجي مصدرا غنيا بالمعلومات لتغذية جفعالة باستخدام قانون جمعي
ها. ثم تك و تفعيللسيمانالحسابية. يركز هذا البحث على كيفية دمج الانتولوجي مع تعريف البيانات. اولا يجب اعداد شبكة بيانات ا
انتك. شبكة السيملتعلام تولوجي من شبكة بيانات السيمانتك باستخدام السباركل كلغة اساستخلاص تعريف البيانات التقليدية من الان
المطلوب لنموذجا ثم نقوم بتعريف الاوول باستخدام الارم. أبريوري تستخدم لتعريف عدة فقرات من البيانات. اضافة الى ذلك
لومات صول على معتم الحيدام السباركل كلغة استعلام. وبعد ذلك يقوم بتحويل شبكة السيمانتك الى تعريف البيانات التقليدية باستخ
هامة من ملف البيانات التقليدية لذلك يتم بناء علاقات جديدة بين شبكة السيمانتك و تعريف البيانات.
The Scientific Journal of Cihan University – Slemani PP: 01-17 Volume (4), Issue (1), June 2020
ISSN 2520-7377 (Online), ISSN 2520-5102 (Print)
DOI: http://dx.doi.org/10.25098/4.1.22 16
shows in Table 6. Figure 6, the result of confident and support can be shown. The results are
retrieving data from ontology dataset that is present in the Figure 6.
6- Conclusion:
In conclusion, the application of Association rule mining technique is used to determine
important pattern from Ontology data has been proposed. Our algorithm of Association rule mining
is Apriori algorithm, which is used to detect the confidents of OWL data. Apriori algorithm is one
of the well-known algorithm which is unsupervised algorithm that can be used to show the
relationship and the important of data item according to other data-items. OWL ontology which
includes RDF, RDFS and other concepts has been used as the dataset, also, the query language
which is used to retrieve traditional data from the OWL dataset is SPARQL which is most famous
query language for semantic web. It can be seen that the combination of semantic web and data
mining can have an adequate result. Particularly Apriori algorithm which can have frequent item-
sets. Apriori also works according to confident of data items.
Bibliography:
AbedjanEmail, Z., & Naumann, F. (2013, July). Improving RDF Data Through Association Rule Mining. Datenbank Spektrum, 13(2).
Aggarwal, C. C., & Han, J. (2014). Frequent Pattern Mining. Springer. Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining Association Rules between Sets of Items in
Large Databases. SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data . 22, pp. 207-216. IBM Almaden Research Center.
Baratia, M., Baia, Q., & Liub, Q. (2017, October 1). Mining semantic association rules from RDF data . 133, 183-196 .
Berendt, B., Hotho, A., & Stumme, G. (2002). Towards Semantic Web Mining. International Semantic Web Conference, 264–278.
Borgelt, C. (2012, December). Frequent item set mining . WIREs Data Mining And Knowledge discovery, 2(6), 437–456.
Bringas, P. G., Hameurlain, A., & Quirchmayr, G. (2010). Database and Expert Systems Applications: 21st International Conference. Springer.
Dean Allemang, J. H. (2008). Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL . Morgan Kaufmann.
DECKER, S., MELNIK, S., HARMELEN, F. V., FENSEL, D., KLEIN, A. M., BROEKSTRA, J., . . . HORROCKS, I. (2000, September). The Semantic Web: The Roles of XML and RDF. IEEE Internet and Computing, 4(5), 63-74.
Domingue, J., Fensel, D., & Hendler, J. A. (2011). Handbook of Semantic Web Technologies. Springer.
DuCharme, B. (2013). Learning SPARQL: Querying and Updating with SPARQL 1.1 (Second edition ed.). O'Reilly Media.
Gayo, J. E., Prud'hommeaux, E., Boneva, I., & Kontokostas, D. (2018). Validating RDF Data. Morgan & Claypool .
Gayo, o. E., Prud'hommeaux, E., Boneva, I., & Kontokostas, D. (2018). Validating RDF Data (Vol. 7). Morgan & Claypool.
Giri, K. (2011, March). Role of Ontology in Semantic Web. DESIDOC Journal of Library & Information Technology, 31(2), 116-120.
Han, J., Pei, J., & Kamber, M. (2012). Data Mining: Concepts and Techniques . Waltham, MA: Elsevier Inc.
The Scientific Journal of Cihan University – Slemani PP: 01-17 Volume (4), Issue (1), june 2020
ISSN 2520-7377 (Online), ISSN 2520-5102 (Print)
DOI: http://dx.doi.org/10.25098/4.1.22 17
Hart, G. (2013). Linked Data: A Geographic Perspective . CRC Press. Hidber, C. (1999, June 3). Online Association Rule Mining . Proceeding SIGMOD '99 Proceedings of
the 1999 ACM SIGMOD international conference on Management of data, 28(2), 145-156 . Juneja, D. S., & Dixit, A. (2019). Improving Search Results Based on Users’ Browsing Behavior Using
Apriori Algorithm. Software Engineering , 73-82. kaur, M., & Grag, U. (2014, December). ECLAT Algorithm for Frequent Item sets Generation.
International Journal of Computer Systems, 1(2). Kaur, S., & Kaur, B. (2015, May). Semantic Web Mining - A Review. International Journal of
Computer Applications, 117(12), 16 - 19. Koh, Y. S., & Rountree, N. (2010). Rare Association Rule Mining and Knowledge Discovery:
Technologies for Infrequent and Critical Event Detection . SCOPUS. Lacy, L. W. (2006). Owl: Representing Information Using the Web Ontology Language . Victoria,
Canada: Trafford Publishing. Luo, Q. (2003). Advancing Computing, Communication, Control and Management . Spinger. Maedche, A., & Staab, S. (2001). Ontology learning for the Semantic Web. 16(2). Maimon, O., & Rokach, L. (2005). Data Mining and Knowledge Discovery Handbook . springer. Mohammed, W. M. (2016). Mining XML data using K-means and Manhattan. 7(7). Mohammed, W. M., & Saraee, M. M. (2016). Mining Semantic Web Data Using K-means Clustering
Algorithm. British Journal of Mathematics & Computer Science, 13(1), 1-14. Mohammed, W. M., & Saraee2, M. M. (2016). Mining Semantic Web Data Using K-means
Clustering. British Journal of Mathematics & Computer Science, 1-14. Mohammed, W., & Saraee, M. M. (2016). Sematic Web Mining Using Fuzzy C-means Algorithm.
16(4), 1-16. ReynaudEmail, J., Toussaint, Y., & Napoli, A. (2019, May 23). Using Redescriptions and Formal
Concept Analysis for Mining Definitions in Linked Data. International Conference on Formal Concept Analysis, 11511, 241-256.
Shimada, K., Hirasawa, K., & Hu, J. (2006). Association Rule Mining with Chi-Squared Test Using Alternate Genetic Network Programming . Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining, 202-216.
Shukla, A., Akanksha, & Yadav, P. (2013). International Journal on Recent and Innovation Trends in Computing and Communication , 1(12), 919 - 922.
Singh, J., Ram, H., & Sodhi, D. J. (2013, January). Improving Efficiency of Apriori Algorithm Using Transaction Reduction. International Journal of Scientific and Research Publications, 3(1).
Singh, S., & Aswal, M. S. (2018). Semantic Web Mining: Survey and Analysis. Journal of Web Engineering & Technology, 5(3), 20-31.
Uschold, M. (2018). Demystifying OWL for the Enterprise . Morgan & Claypool Publishers . Venkata, N. p., Kappara, P., & Ichise, R. (2011). LiDDM: A Data Mining System for Linked Data .