Certain investigations on the Performance analysis of clustering Indexing and versioning techniques In xml data management;

by Palanisamy S

Institution: Anna University
Department: Certain investigations on the Performance analysis of clustering Indexing and versioning techniques In xml data management
Year: 2015
Keywords: Binary Large Objects; Character Large Objects; eXtensible Markup Language; Relational Database Management Systems; Standard Generalized Markup Language
The growth and evolution of Internet offer a wide range of newlineinformation sharing facility which facilitates the interactive distributed and newlinecollaborative computing Due to the heterogeneity of hardware and software newlineplatform it is essential to provide a platform independent means for data newlineexchange between web and collaborative applications eXtensible Markup newlineLanguage XML is a markup language derived from Standard Generalized newlineMarkup Language SGML used a medium of transport and represents data newlinein World Wide Web WWW application It can be used to store structured newlinesemistructured as well as unstructured data It is easy to represent data in newlineXML format This XML representation of data are stored as files in file newlinesystem, table formats in Relational Database Management Systems RDBMS newlineand also a native form in relational databases as Character Large Objects newline CLOBs or Binary Large Objects BLOBs and as objects in object oriented newlineDatabases Data stored in the native form of XML in relational databases newlinerequire simple and efficient technique for access and manipulation newlineIn this research work a novel clustering technique called a newlinesimilarity based clustering is proposed to cluster the data stored in a single newlinelarge XML file or BLOB or CLOB It is based on the content value of newlineelements stored in a table or file Similar content of the elements are moved to newlinethe closer locations It is proved that the similar elements may be accessed for newlinecomputing summarization or manipulation as a group The proposed newlineclustering method is combined with the primary, secondary and full text newlinesearch indexing techniques supported by Relational Database Management newlineSystems RDBMS to access the data stored in the native form The proposed newlineapproaches use SQLServer2005 and Oracle Berkeley DBXML newline%%%reference p162-175.