Sunday, 8 January 2012

Web 2.0 - Open Access



In this post I will discus Open Access benefits and issues if any. With the development of the Internet and The World Wide Web access to information changed drastically. Starting with Web 1.0 where HTML can be read and linked, moving to Web 2.0 where users can get involved into interaction by commenting, personalising web pages etc. Furthermore, semantic web makes it even more proactive as makes websites machine readable which means information can be found, combined and acted upon by computers. However, with so much information on the Web being placed the issue of information value arises. Should users be charged for information they seek or should it be free?
When the first social networks emerged excitement was so high that users didn’t mind paying a fee to access them. Take for instance website Friends Reunited. However, as more social networks free of use started to emerge popularity of paid websites plunged. Willingness to pay for information is very low; especially for electronic information. Due to vast amount of it available on the internet users will choose the one which is free against the paid one. In the information management and library services situation is complicated as well. Even if the cost may be determent the value stays unknown. The certain books, documents which have known cost can be used to look for information, however outcome is hard to predict and as a result value of this information can not be established. Raban (2007) argues that “the value of information perceived by the user may change with changing market or social influences, it may change over time or due to the accumulation of experience.”
Let us take a closer look at the journals; their prices were increasing steadily for a long time which results in “not only must libraries cope by cancelling subscriptions and cutting into their book budgets, but researchers must do without access to some of the journals critical to their research.” (Suber 2003) As Karen Albert points out library organisations were concerned about subscription price increases from 1980. Furthermore, it has been stressed out that high prices might be influenced by profit-seeking publishers. “Dual pricing levels force libraries to routinely pay more than ten times the price charged individuals for the same subscription.” (Albert 2006) The first electronic journals didn’t solve the problem of the high subscription prices. The libraries are prevented from using electronic journals in the same full and free way that they may now use print journals due to licensing and software locks. (Suber 2003) And that’s when steps in Open Access (OA).
With the beginning of Internet service new ways of publishing and accessing information became known. In 2003 Berlin declaration on Open Access stated the goals as follow:
Our mission of disseminating knowledge is only half complete if the information is not made widely and readily available to society. New possibilities of knowledge dissemination not only through the classical form but also and increasingly through the open access paradigm via the Internet have to be supported. We define open access as a comprehensive source of human knowledge and cultural heritage that has been approved by the scientific community.
In order to realize the vision of a global and accessible representation of knowledge, the future Web has to be sustainable, interactive, and transparent. Content and software tools must be openly accessible and compatible.
In short the open access to electronic information is free of charge to everyone and is available for unrestricted reading, downloading, copying, sharing, storing, printing, searching, linking and crawling. (Suber 2003) As Budapest Open Access Initiative states there are two ways to achieve it: self-archiving which is known as “green” road and open-access journals as “gold” road. Open-access journals contain peer reviews and are freely available to the readers. Self-archiving makes content available for free and contain unrefereed preprints, refereed postprints, or both. (Suber 2003) However, free access to information does not mean that publishing and making it accessible is free. If article is going to be published as electronic version rather than hard copy the certain cost will still apply. Take for instance “green” road; the researches can be placed in the archives which belong to institutions like universities, colleges etc. These archives are maintained by open-source software which has its costs as server space and time of technician. The “gold” road has its price as well; distributing the files to reviewers, tracking progress, nagging dawdlers, facilitating communication, and collecting data. (Suber 2003) There are two options to cover the costs: author-pays vs. user-pays. (Esposito 2004) Both of these categories can have their sponsors. User-pays category can be covered by libraries and author-pays by research institutions.
If to publish an article author has to find proxy to cover publishing costs and make it freely available without expecting to make money out of it one might raise a question what are the benefits of Open-Access. The articles are written to benefit further researches and advance professional statuses. Furthermore, authors do not seek benefit financially. It goes back to XVII century when early journals could not afford to pay authors and this tradition of writing for impact instead of payment continued. (Albert 2006) The evidence is still being collected however the statement can be made that freely available papers have greater research impact. (Antelam 2004) On-line published researches with open access have more potential to be downloaded, read and cited than those published in print journals. Impact rather then financial reward was always the main motivation for researchers. On another hand it has to be mentioned that librarians are likely to continue to pay for user-pays journals that their constituencies (faculty and students) want, and authors are likely to continue to submit papers to user-pay journals if such journals will advance them professionally. (Esposito 2004) The journals with a reputation will motivate authors to publish their academic research in these journals along to making the researches freely available to access on line. It seems that for some time user-pay category will be paying and for physical journals and for open-source software maintenance which indicates that expenses will not be reduced straight away. As Esposito pictures this “The faculty, who must continue to read in their disciplines and want access to the publications they publish in themselves – and whose professional advancement is tied to these very same journals – will continue to press libraries to spend, spend, spend.” Once again willingness to pay for information is based on authority. The articles published in the academic journals are more trustworthy as they have been reviewed by peers before publishing which doesn’t apply to electronic versions. Self-archiving doesn’t have peer reviews. It leads to question how can one decide which information published on the internet is reliable. The blogs especially has the issue of reliability. However, there is a space for publishers to step in and offer wider readership and recognition for the authors. For example, the publisher may automate the metatagging process, create an abstract of the text of the blog and use it to search other web sites or create a database by discipline of scholars and e-mail them whenever a posting in their areas appeared, inviting them to comment on the blog. (Esposito 2004) There are more services to be considered before authors’ and readers’ needs are satisfied.
As mentioned earlier the main benefit of the open-access is: information is freely available to wider audience via internet. Another few vital advantages of open-access have been discussed in The Open Access Open Data conference which took place in Cologne, Germany on 13-14 December 2010. (Giglia 2011) One of the speakers Rainer Kuhlen offered a “knowledge ecology” concept which suggests that publishing models are only acceptable when they acknowledge the status of knowledge as a commons, allowing free and open access for everyone. As a result, the commercial use of publicly produced knowledge should be exception and open and free access the default. Alma Swan stressed out that open-access connects readers worldwide and fosters international and interdisciplinary collaboration. When discussing open data issues Celina Ramjoue described European Commission’ vision for 2030 “..point in the direction of an infrastructure that supports seamless access, use, reuse and trust of data. It suggests a future in which the data infrastructure becomes invisible, and the data themselves have become infrastructure – a valuable asset, on which science, technology, the economy and society can advance.” Toby Green and Jan Brase addressed findability problem. It was suggested integration of books, journals, datasets and tables in a single search engine within the OECD library and called for libraries to open their catalogues to non-textual materials. (Giglia 2011) The dialogues between professionals encourage development of open-access; it gives an opportunity to new perspectives. This is where authors and librarians should meet and make changes to the world ruled by publishers not only by saving money but making access to knowledge easy and free. Librarians with their skills to manage information are the ones who should be leading towards successful open-access use in practice.
Albert (2004) distinguishes 3 steps in achieving price reduce: 1. development of more OA (“gold” road) and SPARC-type (lower subscription cost) journals; 2. increased implementation of institutional repositories and self-archiving; 3. more funders mandating deposit of grant-supported manuscripts in free archives.
Another example would be contributor-run libraries who allow people to contribute. By giving contributors and readers access to tools for evaluating, ranking, and managing the collections, we are building communities of intellectual discourse. (Jones 2001) Libraries which encourage contribution and interaction open new possibilities for society.
Brophy (2007) lists challenges which arise for librarians with the open-access: 1. self-archiving authors are reluctant in doing so and libraries who undertake the work of depositing on behalf of authors are taking expensive route; 2. it is not always clear what version the repository is holding as there are pre-prints, direct copies and post-prints; 3. retrieve the relevant item will be difficult due to institutionally based repositories which contain items from a wide variety of subject domains; 4. there is some concern that repository owners could be tempted to take monopolistic actions; 5. tension between open access and long-term preservation.
Open-access is changing our understanding about information and use of it. It brings new ways of managing and sharing knowledge. It is ongoing process; the open access is being developed continuously as with every new tool made available more needs for new services arise.


References:
Antelman, K. (2004), Do Open-Access Articles Have a Greater Research Impact? College and Research Libraries, 65 (5), 372-382 (available at http://crl.acrl.org/content/65/5/372.full.pdf+html )

Albert, K.M. (2006). Open Access: implications for scholarly publishing and medical libraries, Journal of the Medical Library Association, 94(3), 253-262 (available at http://0-web.ebscohost.com.wam.city.ac.uk/ehost/pdfviewer/pdfviewer?vid=3&hid=14&sid=0ff4713b-0c58-46cb-8709-085984e6920d%40sessionmgr4 )

Brophy, P. (2007) The Library in the twenty-first century, second edition, London, Facet Publishing

Esposito, J.J. (2004). The Devil you don't know: the unexpected future of Open Access publishing, First Monday, 9(8) (available at http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/1163/1083 )

Gigla, E. (2011), Open Access, Open Data: Paradigm Shifts in the Changing Scholarly Communication Scenario, D-Lib 17(3/4) (available at http://www.dlib.org/dlib/march11/giglia/03giglia.html )


Jones, P. (2001) Open (source)ing the doors for contributor-run digital libraries. Communications of the ACM, 44 (5), 45-46 (available at http://0-cacm.acm.org.wam.city.ac.uk/magazines/2001/5/7355-open-sourceing-the-doors-for-contributor-run-digital-libraries/fulltext )

Raban, D.R. (2007), User-centred evaluation of information: a research challenge, Internet Research, 17 (3), 306-322 (available at http://0-www.emeraldinsight.com.wam.city.ac.uk/journals.htm?issn=1066-2243&volume=17&issue=3&articleid=1610804&show=html )

Suber, P. (2003) Removing the Barriers to Research: An Introduction to Open Access for Librarian, College & Research Libraries News, 64, 92-94 (available at http://www.earlham.edu/~peters/writing/acrl.htm )

http://oa.mpg.de/berlin-prozess/berliner-erklarung  (visited on 11 Dec 2001)

http://www.soros.org/openaccess/read (visited on 11 Dec 2011)


My blog http://vitalijazoglaite.blogspot.com/

Sunday, 30 October 2011

DITA Coursework Part 1 

The World Wide Web and its impact on society

We probably couldn’t imagine a single day without computers and access to the Internet. Regardless to our age or status the service, namely, the World Wide Web provided by the Internet is essential in many aspects of our life. The huge impact it has is undeniable. I will be discussing what the World Wide Web is and what impact it has.

When we use our computers to access the internet for information search or retrieval or for communication purposes we actually use the World Wide Web (WWW). It’s quite common for consumers to use the Internet and the World Wide Web terms interchangeably. The most obvious distinction between these two terms is that the later is the former’ service as mentioned above. WWW has been around from 1990 and was created by Tim Berners-Lee. The main purpose of it was to allow remote access to academic documents and it was so successful in academic field that very quickly it spread to other areas. By 1995 the Internet and WWW proved their enormous significance world wide.

The hypertext mark-up language (HTML) was invented by Tim Berners-Lee to define the structure of web pages. It tells the browser how to display text and pictures to users so they do understand tem. The design was so good that it started to be used for vast range content representation as well. HTML has a very simple design. There are a few main elements in HTML such as <html>, <head>, <body>. They are called nested elements as they contain other elements within them. Some elements like <p>, <a> have start tag and end tag, some of them like <br>, <hr> are empty content. Take for instance tag <br>, it stands for single line brake and doesn’t have end tag. Opposite to it, tag <p> defines a paragraph and has an end tag </p>, the content is typed between these two tags. Linking in HTML document is specified by anchor element <a> and its attribute href which specifies a destination of the link. The anchor tag can be inserted anywhere in document body.
Another specification which can be used in HTML is Cascading Style Sheet. The simple HTML documents are static and can be used to read the texts and move from one document to another by using the links. However, these documents can be styled as well. It’s done by a) linking to external style sheet, b) inserting style element in <head> element, c) importing style sheet and d) using it as style attribute on element.
These simple HTMLs developed with the time and are more interactive today.

The World Wide Web works on client-server base: client sends request to server, server receives it and responds by sending digital message back to client, when respond is received client interprets it and displays as a web page.

CLIENT
  NETWORK
SERVER
        
CONNECTION


This picture is the simplest way in representing information request on the Internet. There is more than just one server and there are many more than one client sending requests to different servers so in reality the system is more complicated. As client wants to receive reply relevant to its inquiry it has to send request to the server which actually holds required information.
Therefore, client sends request by Uniform Resource Locator (URL) which was developed by Tim Berners-Lee and which specifies where precisely the required information is stored.


“index.html” is a file which is placed in “~abkc925” folder where “~” indicates that it’s individual’s not a company’s page. The folder is on server www at “city” (computer owner name) and belongs to academic domain “ac” in the United Kingdom “uk”. “http” stands for hypertext transfer protocol which means that document is a World Wide Web hypertext document transferred using hypertext  transfer protocol. Hypertext means the text which contains links to other texts and is used in the World Wide Web.

In XXI century it would be hard to make use of computer if it doesn’t have connection to internet. The first computers acted as the systems of storage. Today when WWW can be employed for information retrieval they act as tools for communication, creation and learning. The computer connected to internet is essential part of our personal and professional life. We use it for research purposes, for sharing information, for social interaction, for payments. The popularity of internet is due to the fact that it’s easy to use and there is a vast amount of information which can be searched in a little time. Take for instance the libraries. They are connected to the Internet for public access and for staff use. As the Internet and the World Wide Web developed libraries became hybrid libraries providing access not only to books, journals, newspapers, etc as a hard copies but electronic variations as well. They started to offer far better delivery of information than before. The links and cross-search improved library services to users; local libraries use OpenURLs which “consists of the address of the link resolver and the bibliographic data needed to identify the requested resource”. (Brophy 2007)
The experience of librarians to organise and provide access to information as well as search, brows and index are used to organise digital information. Most of libraries’ resources today can be reached on web pages. The catalogues can be accessed on line from the comfort of your own home. On another hand, access to the internet in public libraries is used to attract more users.
One can argue that there is information overload today. However, consumers can develop skills required for efficient browsing and information evaluation with help from their local libraries. Furthermore, most of the web pages are well organised which makes it easy to navigate and find what you are looking for. A new generation starts surfing internet from their early years and develop their practical skills exceptionally fast. The older generations are still in process of learning. Nonetheless, the Internet and the World Wide Web have a huge impact on everyone’s life.




References and Resources 

Brophy, P. (2007), The Library in the Twenty-First Century, Second Edition, London: Facet Publishing
MacFarlane, A., Butterworth, R., Dykes, J. (2011), Lecture 2: The Internet and World Wide Web, INM348 Digital Information Technologies and Architecture, City University London: http://moodle.city.ac.uk/file.php/26243/SESSION02/lecture2.pdf
Morville, P. and Rosenfeld, L. (2007), Information Architecture for the World Wide Web, Third Edition, Sebastopol: O’Reilly Media
http://www.w3.org/TR/REC-CSS1/ (visited on 29.10.11)





Monday, 17 October 2011

Internet and World Wide Web



One of the Digital Information Technologies and Architecture lectures was about Internet and World Wide Web (WWW). Most of the times these two terms are used interchangeably. However, the definitions are quite different. www.thesaurus.com suggests that internet is “a vast computer network linking smaller computer networks world wide. The Internet includes commercial, educational, governmental, and other networks, all of which use the same set of communication protocols”; when WWW is “a system of extensively interlinked hypertext documents: a branch of internet.”

If we look at the history of internet and www, we see that internet was proposed by the American military in 1960 with the development of computers. It does let us to access files, data, etc from central computer and share resources. Www has been around from 1990 and was created by Tim Berners-Lee. He “developed the first web browser (…,) including an integrated editor that could create hypertext documents. He deployed the program to his and Cailliau’s computers, and they were both communicating with the world’s first web server (…)” (http://www.livinginternet.com/w/wi_lee.htm 15.10.2011). The World Wide Web allowed view hyperlinked academic documents remotely and as it proved to be a convenience system in the academic field, www was introduced to other areas.

The internet and WWW work on the client-server bases. 

CLIENT
NETWORK
SERVER

CONNECTION






Client sends request to server, server receives request and responds to it by sending digital message, client receives message, interprets it and displays as web page.

The example above is one of the simplest ways to represent information receive process. It’s far more complicated in reality. There are lots of servers and they hold lots of different information.
Client sends request by using Uniform Resource Locator (URL) which helps to find right server which contains requested information.

That’s how it looks:


“Conditionsofuse.html” is a file which is placed in folder “conditions” within “cs” folder. The latter is on the server “www” at “city” university, which is part of the academic domain “ac” in the United Kingdom “uk”. As mentioned earlier “www” indicates World Wide Web and “http” stands for hypertext transfer protocol, it means that document is a World Wide Web hypertext document and it’s transferred using hypertext transfer protocol.
Hypertext simply means the text which contains links to other texts and is used in World Wide Web. Furthermore, it means that documents can be linked between each other even if they are stored on different servers. Mark-up elements are used to achieve it and the best example is HTML (Hypertext Mark-up Language).

So, it was all theoretical part of the Internet and World Wide Web. The next thing is the practical exercises: Learning to use and publish HTML.

Before I started creating my HTML I looked at some elements as paragraph, line brake and horizontal rules in www.w3schools.com (15.10.2011).
<p> defines paragraph in HTML document. Like most of the elements it has start tag <p> and end tag <p/>. Between these two tags is element content; the text which you would like to appear in that paragraph.
Line brake <br> is an empty element and it doesn’t require end tag. You insert it before the text which you would like to appear in a new line without starting a new paragraph.
Horizontal rules <hr> is another empty element. It is used to separate text.
All three elements have attributes. <p> supports a few standard attributes such as class, style, title, id and others. The listed attributes are supported by <br> element as well as <hr>.
After familiarising myself with another few essential elements like <html>, <head> and <body> I started creating HTML document in an html editor.

My first.html includes two links to documents on different servers and two links to html pages created by me and saved on City server. The most fascinating part of creating HTML document to me personally was linking and inserting pictures. The rest was quite straight forward. As we used html editor all we had to do was to fill in a template. Therefore I will explain documents linking and inserting picture.

Links in html are specified by <a> and it’s attribute href which specifies the destination. Attribute, URL and link text in this particular order has to be between anchor tags <a> and </a>. The anchor tag itself can be inserted anywhere in text.
Something like this:
Little bits and pieces <a href=”www.student.city.ac.uk/~abkc925/index.html”> about me </a>.
The link in the text will be displayed like this: about me.

Tag for images is <img> and it doesn’t have closing tag. Attribute to display picture is src.  Width and height specifies size of the picture.
<img src=”lithuanian big.jpg” width=”444” height=”251”>

For the picture to be displayed in an html document it has to be in the same folder as html document. If it happens that picture can not be displayed there is an alt attribute which specifies an alternate text for the image. It would be placed at the end of the element content.
<img src=”lithuanian big.jpg” width=”444” hight=”251” alt=”Local Image: Trakai”>

If picture is not in a local file but on the World Wide Web URL has to be used to display it.

Overall, I learned differences between the Internet and The World Wide Web. Furthermore, I created and published my HTML document linked with another two HTML documents as well as links to documents on different servers.

Wednesday, 28 September 2011

Introduction to Computing

I have to say it's all very exciting!

I couldn't make it for the first DITA session so I did some investigating and studying at home.

I read introduction notes and found interesting and useful information there; what is computer, digital information and binary numbers. I learned that computer consists of "Central Processing Unit", input and output devices, memory and external storage (as in von Neumann architecture). Simple as it is digital information is a combination of symbols which represents things in a way it will be understood by user. Binary digit is "bit" and is either 0 or 1. ASCII - a common encoding for text and is based on 7-bit. There are more different formats to represent pictures, videos, etc. for instance file and document.

After reading notes I transformed my living room to a lab and started experimenting.
Thanks to Andy MacFarlane guide lines the exercises were quite easy. I opened Notepad and saved it as a text document "weather.txt". Then I opened the text file in MS Word document and saved it as "weather.doc" After I had closed it I reopened the file with notepad application. What I got in front of me wasn't meaningful text but rather chaotic symbols. The notes explained that "many characters do not relate to ASCII so can not be interpreted meaningfully as text".
The next step asked me to open "weather.doc" and save it as HTML document. As a result I got new document saved as "weather.htm". I opened this document with Notepad application and once again there were no meaningful text.
The notes explained that these "tasks rely upon a file-centered view of information" and the following task was to introduce me to a document-centered view.
I opened my "weather.doc" document and inserted the picture. The new document I saved as "weather2.htm" Then I compared all the documents created.

It was absolutely new experience for me and I enjoyed and learned a lot.