Showing posts with label Research Paper. Show all posts
Showing posts with label Research Paper. Show all posts

Monday, 30 March 2020

Pemanfaatan Semantik Web Pada Perpustakaan : Isu Interoperabilitas Dan Solusinya


Perkembangan teknologi informasi berubah secara pesat dalam kurun lima dasawarsa terakhir. Teknologi Pra-web dikembangkan sebagai alternatif metode berkomunikasi pada tahun 1960 dan teknologi tersebut terus dikembangkan menjadi teknologi web 1.0. Namun website 1.0 tersebut kurang populer bagi netizen karena fitur-fiturnya tidak interaktif. Pada era 90-an, konsep pengembangan website mempertimbangkan aspek interaktivitas sehingga teknologi web 2.0 mendorong para netizen berperan aktif dalam berbagai proyek kolaboratif. Selanjutnya, teknologi web 2.0 terus berkembang dan teknologi website saat ini mencapai versi 3.0 atau lebih dikenal dengan istilah semantik web (Miller 2014, hal.388).
Teknologi semantik web mengadopsi  model hubungan antar entitas, yakni: semantik mampu menghubungkan berbagai entitas datum yang saling memiliki keterkaitan. Ilustrasinya sebagai berikut: sebuah perpustakaan membagikan informasi dalam format text/gambar/video melalui website berbasis semantik. Selanjutnya, netizen atau pemustaka yang berafiliasi ke perpustakaan tersebut dapat menambahkan metadata berupa tag pada yang diunggah oleh perpustakaan tersebut. Mesin pencari (search engine) berbasis semantik akan mendeteksi dan mencocokkan tag tersebut dengan kata kunci pencarian sehingga tingkat keakurasian selama proses temu balik informasi berbasis semantik akan meningkat.
Meskipun teknologi semantik web memberikan berbagai manfaat bagi perpustakaan dan para netizen. Namun terdapat aspek teknis berpotensi menghambat penggunaan teknologi semantik web pada layanan perpustakaan. Makalai ini bertujuan membahas keunggulan semantik web dan tantangannya. Makalah ini turut merekomendasikan solusi bagi tantangan tersebut.

Keunggulan Teknologi Semantik Web
Model hubungan antar entitas pada semantik web meningkatkan keakurasian proses temu balik informasi. Hart, Hogan, Umbrich dan Decker (2008, hal. 2) menyatakan bahwa model dokumen sentris (document-centric) pada web tradisional hanya mampu mencocokkan antara frasa yang identik pada sebuah dokumen dengan kata kunci pada mesin pencarian. Sebaliknya, model hubungan entitas (entity-centric) pada semantik web memiliki interkonekvitas dengan berbagai entitas varian informasi, seperti: orang, tempat dan berita. Model tersebut tidak hanya mencocokkan frasa pencarian yang identik dalam sebuah dokumen tetapi mengalisis keterkaitan hubungan antara frasa tersebut dengan informasi lain yang terkait. Proses temu-balik informasi berbasis semantik akan menghasilkan informasi yang akurat dan efektif. Semakin jelas bahwa model hubungan entitas merupakan basis pencarian informasi secara cerdas karena model tersebut menganalisis informasi yang relevan dengan kata kunci pencarian.
Peningkatan aspek keamanan merupakan kelebihan lain dari semantik web. Menurut Berners-Lee, Hendler dan Lassila (2001, hal.  31), semantik web menyediakan fitur tanda tangan digital (digital signature) dalam perangkat lunak agen. Fitur ini akan mengenkripsi blok data yang tidak dilengkapi dengan sumber yang terpercaya tertentu. Fitur ini akan memungkinkan pengguna untuk memverifikasi data sebelum mereka mengakses informasi dari isi web lain. Hasilnya, fitur keamanan pada semantik web akan melindungi netizen dari situs-situs yang tidak dipercaya.

Tantangan Penggunaan Teknologi Semantik Web.
Pemanfataan semantik web akan menghadapi beberapa tantangan. Menurut Benjamin, Contreras, Chorcho dan Gomez-Perez (2002, hal 5-10), tantangan semantik web akan mencakup berbagai bidang, seperti: ketersediaan konten web semantik, pengembangan ontologi, skalabilitas, multilinguality, visualisasi dan stabilitas bahasa web semantik. Tantangan-tantangan tersebut akan menghambat penggunaan semantik web.
Namun salah satu isu yang menonjol adalah interoperabilitas metadata pada semantik web. Prosedur kerja teknologi semantik melibatkan dua elemen penting, yakni: struktur onkologi dan bahasa program eXtensive Markup Language (XML). Struktur ontologi berfungsi sebagai alat pemandu bagi mesin pencari untuk mengambil informasi secara akurat dan XML merupakan alat pengunduh metadata. Walaupun XML mampu bekerja pada multiplatform dan mampu menggambarkan objek menggunakan tag. Namun kelemahan XML adalah ketidakmampuan untuk menemukan hubungan antara entitas. RDF Schema mengatasi kelemahan ini dengan menggunakan bahasa ontologi (Bygstad, Ghinea, & Klæboe 2009, hal 974-975). Meskipun semantik web memiliki kemampuan untuk terhubung data dengan menggunakan skrip bahasa XML, metadata pada semantik web memiliki format yang bervariasi. Selain itu, model eksklusif menciptakan kesulitan untuk bertukar data. Akibatnya, model yang eksklusif akan menghambat proses interoperabilitas pada semantik web (Stan dan Maret 2012, hal.6),

Solusi bagi Interoperabilitas Semantic Web
Tantangan interoperabilitas dapat diselesaikan dengan memperluas metadata namespace Resource Description Framework (RDF). Metadata dapat ditambahkan ke dalam HTML 5, XHTML dan isi XML dengan menggunakan atribut RDF sehingga RDF dapat menghubungkan berbagai jenis metadata (Miller 2014, hal.298)..

SIMPULAN
Salah satu perubahan signifikan pada teknologi website adalah perubahan platform website yang bersifat HTML statis ke HTML dinamis. Perubahan platform tersebut meningkatkan keakuratan hasil pencarian informasi karena website semantik menggunakan model hubungan entitas dengan bahasa program XML. Keunggulan model adalah peningkatan keakurasian pencarian informasi dan peningkatan keamanan data.  Walaupun semantik web memiliki beberapa keunggulan, tetapi teknologi tersebut menghadapi tantantangan berupa interoperabilitas. Adapun solusi interoperabilitas adalah memperluas ruang nama metadata pada RDF dan RDF akan menghubungkan berbagai format metadata sehingga teknologi semantik dapat bertukar data lintas platform.

DAFTAR PUSTAKA
Benjamins, R. and Contreras, Jesús and Corcho, Oscar and Gómez-Pérez, A 2002,”The six challenges of the Semantik Web”. In Horrocks & Hendler (ed.), Refered proceedings of the First International Semantik Web Conference, ISWC2002, Springer Verlag, pp. 1-15.

Berners-Lee, T, Hendler, J and Lassila, O 2001,"The semantik web." Scientific American Magazine. vol. 284 no. 5 pp.28-37.

Bygstad, B, Ghinea, G & Klæboe, G 2009, “'Organisational challenges of the semantik web in digital libraries: a Norwegian case study”', Online Information Review, vol. 33, no. 5, pp. 973-985.

Harth, A, Hogan, A, Umbrich, J, Decker, S & Ireland, SF 2008,”Building a Semantik Web Search Engine: Challenges and Solutions”, Refered proceedings of the 3rd XTech Conference, Science Foundation Ireland, pp. 1-14.

Miller, JB. 2014,”Chapter 12 XML Primer”. In the Library and Information Science Text Series: Internet Technologies and information services 2nd. Santa Barbara, Libraries Unlimited.

Miller, JB. 2014,”Chapter 16 Libraries and the internet: Learning from the past, exploring the future”. In the Library and Information Science Text Series: Internet Technologies and information services 2nd. Santa Barbara, Libraries Unlimited.

Stan, J and Maret, P 2012,”Semantik metadata management in web 2.0”. In  Akerkar, Dumitru & Burdescu (ed.), Refered proceeding of International Conference on Web Intelligence, Mining and Semantiks (WIMS), June 2012, Romania. Association for Computing Machinery, pp.1-6.

Identifying Impacts Of Mass Digitization: A Case study of Google project

The access of electronic books by library users has increased among university students (Shelburne 2009). This circumstance encourages libraries to provide the digital collections. As the result, libraries digitized their book collections on a massive scale.Many libraries cooperate with commercial organizations in the digitizing book project, due to a lack of budget (Nielsen 2008). However, in the view of some experts, the project infringes the copyright law as these commercial partners use the digitizing books for commercial purpose. This essay will first describe the trend toward mass digitization and then identify problems due to effect of mass digitization on the right holders’ protection, particularly the copyright of orphan books. It the put forward arguments that government should protect the copyright holders of orphan books due to mass digitization.
On one hand, the digitizing books give advantages for libraries and its users. One of the advantages is that these books make accessing information easier. Digitized books have a compatibility with modern devices, such as smartphones and tablets. As the result, they can be accessed wherever and whenever by users. In addition, digitizing books have an added value, compared with printed books. The reason for this is that these books provide audio visual features, such as audio books, downloadable text and audio, electronic braille, so the disabled have an opportunity to access information (Myhill 2002). Furthermore, another benefit of digitizing books is to provide an opportunity for libraries to preserve information more economically and effectively. The digitizing books would need a less storage spaces as libraries deposit these books on the computer server. Compared with a conventional storage, computer servers might deposit millions of digitizing books in on square meter and this might prevent digitizing books from damage by insectsor natural disasters. This leads to reduction in the expenditure of book maintenance. Thus, most libraries digitize their printed book collections to improve accessibility to information in books and to preserve their collections.
On the other hand, the mass digitization project caused problem of copyright infringement by threatening right holders. This problem was caused by cooperation among public libraries and commercial organizations. In fact, the first project of mass digitization, which involved a commercial institution, was called the Google 5 projects and five of the largest libraries, namely Harvard University, University of Michigan, New York Public Library, Stanford University and Oxford University were involved. This project became concern among scientists and practitioners of Library and Information sciences due to effect mass digitization on copyright law. According to Hanh (2006), the project digitization of books began in 2004 and this project succeeds to digitize approximately 15 million of books.  In this project, Google, which was a commercial partner, would give one copy of digitizing books to these libraries. Meanwhile, Google also would redistribute these books through the company website. The site would provide snippets, which displayed only three lines, for digitizing books, so these features would make retrieval information on digitizing books easier for users.The increasing number of users, who were access digitized books, encouraged Google to obtain financial benefit. Thus,the Google 5 earns money from advertisements and subscriptions. However, the Google 5 project was sued by Author Guild America and Association of America Publisher. These plaintiffs point out that those authors had a privilege on derivative advantages, which included electronic rights from their books. Although they had privilege, they would have lost their financial gain because the Google 5 did not ask permission to use their books.
Another problem of mass digitization was orphan books licensing. A preliminary study of United States of Copyright Department indicates that orphan books licensing would become obstacle for mass digitization. According to Office (2006), orphan book can be defined a book which required permission from copyright holders, but the right holders cannot be located or found by the parties who wishes to use this book. The obstacle of orphan books came from extending the length of copyright holders from 20 years to 70 years from the author death, so parties who want to get author permission cannot get licensing to redistribute orphan books because the difficulty of getting license of orphan books is to find the right holders, who might be die but the their copyright have not already expired. In addition, the number of orphan books increased year by year due to copyright holders’ extension. According to Clair (2006), the extension implied on increasing number of orphan books with around 90 per cent of books published from 1923 to 1950. Furthermore, the cost of re-licensing orphan books was expensive. The expenditure to re-locate original author of orphan books and to register these books were included into re-licensing cost.
Turning to the question of why government should protect the copyright holders of orphan books due to mass digitization. There are two reasons to answer this question. The first reason is the right holders of orphan books contribute to enlighten societies by disseminating knowledge through their books. The second reason is the more challenging issue on the orphan books protection because the right holder would meet the books piracy due to mass digitization. The  books piracy is concerned by some experts. According to Kevles (2013), the advantages of mass digitization project for equality accessing information cannot be a justification to occupy the copyright of orphan books illegally. The difficulties to find the right holders of orphan books is not a justification for use their books without their permission. Although the right holdersmight bedie , they still deserves to receive the copyright protection for their books.
In conclusion, this essay has described the trend of mass digitization cope with the problem and its solution. It cannot be denied that mass digitization gives advantages for societies. The project of mass digitization provides indexing of digitize books, so societies can use these books for education and research purpose. However, the mass digitization caused problem for threatening right holders.This situation might be happen since the Google 5 project tends to earn money from advertisement and subscription, while the project did not ask permission to right holders, whose books were digitized by Google. Another problem of mass digitization is orphan books licensing. The extension of license books published from 1923 to 1950 made obstacle for the project to get permission. To cope with these problems, it is suggested thatgovernment should protect the copyright holders of orphan books because the right holders of orphan books have a contribution to enlighten societies and they have met a weak position to against books piracy due to mass digitization.
 Top of Form
REFERENCESBottom of Form
Clair, G.S (2006). Mass Digitization Projects: Celebration and Challenges: 3. Retrieved from www.ulib.org/conference/2006/4.pdf

Kevles, B. (2013). Will Google Books Library Project End Copyright ?. AAL Spectrum May: 36. Retrieved from http://www.aalnet.org/
Offices. (2006). Report on Orphan Works: A Report of Register of Copyright: 15. Retrieved from http://www.copyright.gov
Shelburne, W. A. (2009). "E-book usage in an academic library: User attitudes and behaviors." Library Collections, Acquisitions, and Technical Services 33(2-3): 59-72.