LAUC-B Fall Assembly
Tuesday, 16 November 2010
8:30 – 10:00 am
I. Call to order by LAUC-B Chair Susan Koskinen
II. Welcome by University Librarian Tom Leonard
Tom Leonard graciously shared his notes on his welcoming remarks, which follow:
I don’t know what Brewster Kahle will say, it will be interesting if we see things the same way, but I am here today with the feeling that an era of cooperation is breaking out in handling the books digitized in the past few years.
We are no longer waiting around and speculating on what Judge Chin will decide about the Google Books Settlement. We take as a given that litigation here will go on and on.
We are talking about making the world of digitized books better, though the other means possible – e.g. Bernie Hurley at the recent Google partners meeting. HathiTrust, with 7.2 million volumes, is gearing up for expanded access to public domain and the concept of lending digital copies. Last year Hathi was the size of an average ARL Library, today it is about the equivalent of the University of Minnesota – 1.7 volumes are now in the public domain. UC Berkeley is talking with libraries of comparable size about quietly doing this.
The Samuelson Law, Technology and Public Policy Clinic (a group of faculty and fellows that spans Berkeley Law and the Information School) would like to work to realize the potential of copyright to green light the digital lending of books, the access to orphan works, and the “ordering” of books by authors into the public domain. Michigan, Stanford, and others, including Google, are working on ways to discover if there are rights holders and to do something to change the current landscape that serves no one’s interest: Authors not being clear on what rights they gave up and publishers on what rights they received.
More cooperation is in sight in the preservation and storage needs that the downturn in funding as well as the digitized legacy collection create. WEST, the Western Regional Storage Trust, will pick off journal runs that are backed by trusted digital files. The more difficult job of determining the optimal number of paper copies of monographs – then storing that number safely across the US – is proceeding by baby steps, by CDL and IMLS for instance at an October meeting in Chicago with many other partners.
Prospectively, ARL libraries have been consulting with AAUP about investing in the
digitization of back files so that ebooks of university presses are the sorts of products we want as part of our collection, allowing for multiple users, sharing, and so forth. Print-on-demand (POD) services will be rolled out in the New Year across UC, giving us a better chance to see if ebooks and our digitized legacy collection presented in this form fall into the comfort level of our users.
III. Introduction of New LAUC-B Members
Susan Koskinen welcomes Heather Thams, Elliott Smith, Josh Schneider, Daniel Hensley, and Michele Morgan. She also bid a fond farewell to retiring librarian Hisayuki Ishimatsu.
IV. LAUC: Update & call for agenda items or concerns for the LAUC Spring
Assembly in Santa Barbara, March 10-11, 2011
- Four UC Berkeley delegates will attend the LAUC Statewide Assembly in the spring.
- Distinguished Status for librarians: ULs charging a group to review distinguished status and produce a report by January – a short time line. UC Berkeley’s Susan Wong and, perhaps, Lynn Jones will be members of the group.
V. LAUC-B Committee Reports
A. CAPA Report by Chair Lynn Jones
- Lynn Jones thanks the members of CAPA for great work over the past year.
Changes in the membership of CAPA include Rita Evans taking over for Paul
Atwood, Kathryn Neal and Jennifer Nelson joining, and Lynn Jones and Nick
Robinson completing their service.
- The full CAPA report is available at
http://lib.berkeley.edu/LAUC/minutes/new_index.html. Highlights of their
work include 14 candidate interviews for five jobs and completion of 55-60
reviews. CAPA held their fall peer review workshop on October 26th and have
plenty of updated information on peer review on the CAPA webpage on LAUCB
B. Committee on Diversity Report by Chair Debbie Jan
- Debbie Jan focused on the committee’s work during the coming year, which
includes a job shadowing program (for library staff), a conference shadowing
program (for library staff), and a library staff mentoring program. Volunteers are
needed for all three programs.
- Tom Leonard added that he hopes to announce in the next few months a new
Diversity Fellow position. The Fellow will be based in the Bancroft Library and
work on a collection they hope to acquire. A donor may provide funding.
VI. Report from the Chair, Susan Koskinen
- S. Koskinen introduced the current members of ExComm and welcomed Brian Quigley as
incoming CAPA chair.
- New LAUCB website! S. Koskinen thanked Harrison Dekker and Julie Lefevre as well as Corliss Lee and Margaret Phillips for their efforts. The website has a lot of new information and a calendar of events will soon be available too.
- S. Koskinen asked LAUC-B members to say yes to any requests from Nominations
Committee to run or be appointed to positions within LAUC-B and LAUC Statewide.
- Congratulations to our newest distinguished librarians, Barclay Ogden and Gary Handman. The Distinguished Librarians’ Reception will be held on December 1st at 4:00-6:00 p.m. in the Morrison Library.
- Fiat Flux, the 2011 LAUC-B Conference: The committee has submitted a request for proposals and LAUC money available for those who wish to present.
VII. Questions and Other Announcements
VIII. Guest Speaker, Brewster Kahle, "Digital Librarians, Digital Libraries"
The last time that Brewster Kahle was in the Morrison Library, he was launching the Internet
Archive’s Wayback Machine. He thanked UC Berkeley for taking a risk in supporting the Wayback
Machine from the start.
Rather than discuss the future of all things online, Kahle emphasized that we are living in the future
now. Kahle presented what the Internet Archive has that could be of use to librarians and asks what
librarians need out of digital libraries to do their jobs.
The Internet Archive is a non-profit located in the Richmond District of San Francisco:
http://www.archive.org/about/about.php. The Internet Archive employs about 300 people,
many of whom are scanners. Funding comes from government contracts, foundation grants, and
keeping the money they receive by being frugal.
The Internet Archive works with a variety of open access projects. They have different collection
types: free texts (public domain, “the Linux to Google’s Microsoft”), modern texts for the blind or
dyslexic, and a modern lending library. The Internet Archive is no longer scanning at NRLF or
SRLF but is still scanning items from the Bancroft Library and UCLA. In the last five years, the
Internet Archive has worked to build a public domain library.
Lending Library (http://www.archive.org/post/312815/digital-lending-library): But what
about in-print and out-of-print materials still in copyright? Libraries are worried because of the
legal risks involved. The Internet Archive has made available works still in copyright for the blind
and/or dyslexic! Google did not do this. A new project for is the Internet Archive’s Lending
Library, which digitally lends books. The process is managed using Adobe Digital Editions – users
can download a book for two weeks, one copy at a time. “Own to loan!” The Lending Library
currently holds up to 10,000 books and they want to add to this number, especially for out-of-print
Scanning (http://www.archive.org/scanning): The Internet Archive designed its own book
scanner, the Scribe. Scanning services are available. It is hard to start again when a project subsides,
so keep using it! Even when Microsoft bowed out, the Internet Archive continued scanning. They
digitize about 1000 books per day and about 200 microforms per day. It costs about $30 per book
and 10 cents per page. The Korean, Chinese, and Japanese governments are all funding digitizing
books with the Internet Archive and the Internet Archive is working with them to keep materials
Open Library (http://openlibrary.org/): Open Library is meant to be a Wikipedia of books, with
a webpage for every book ever published – an open catalog. Everything is open-source. They get
metadata from multiple sources, then weave the data together. They have about 200,000 users per
day, 100,000 edits per month, 1 million ebooks, and 23 million records. It is wiki-editable, so
please contribute. It is blind and dyslexic accessible and they are constantly trying to make it easier
to use, including on the devices where people want them: Kindles, iPads, etc. It costs about $1
million per year to keep the Open Library up-to-date. If libraries put 5% of their acquisitions
budgets into digitization and posting digital materials online, so much would be accessible to all.
Audio (http://www.archive.org/details/audio): Audio is very popular, especially music
collections, but also lectures, news, and audio books. The Internet Archive has 500,000 user
contributions and offers free hosting forever. Please contribute!
Moving images (http://www.archive.org/details/movies): Including archival films, current
news, lectures, etc. They currently have about 400,000 user contributions. The Prelinger Archives
(http://www.archive.org/details/prelinger), a collection available through the Internet Archive,
has been reconverted six times since it was acquired to keep up with technological formats.
Reprocessing takes about a month on clusters of computers. The Internet Archive is up for taking
on this roll, but they need more collections. They will cover all of the costs, so contribute lectures
and other moving image content.
Wayback Machine (http://www.archive.org/web/web.php): Every two months, a snapshot of
the web is archived. Many use it for retrieving lost sites. It is not yet searchable but Kahle would
like it to be. He is not quite sure yet how to make it a really useful tool for research around
particular subjects and is open to ideas.
Archive-it (http://archive-it.org/): Archive-it is a pay service allowing users to build custom,
curated collections. There are currently 1200 curated collections and over 2 million URLs. It is
searchable and browsable. Kahle says they are doing well at archiving and availability, but they do
not know how to make it a better service for researchers. Generally, the Internet Archive will start
with collecting and move forward and iterate from there. They are also collecting television
programs and they hope to make TV news collections searchable before the next US election cycle.
Digital Archive for Your Collections: The Internet Archive can be a backend infrastructure for
collections for organizations that do not yet have and cannot build it themselves. Collections could
be lectures, photos, books, web collections, etc. For example, a new federal court documents
collection is being built, and being populated by robots, as an open alternative to PACER.
You can help by sending books, lending ebooks, providing feedback, using the collections, and
coming to the Internet Archive open lunch on Fridays. The Internet Archive is buying property in
Richmond, CA for climate controlled space to hold physical materials. You can donate your own
books or library withdrawn books to the Internet Archive. We are all digital librarians now. We
need to work together to build this world.
- The Internet Archive is trying to get publishers to allow books in web browsers, which are
more open than apps for iPads or closed reader platforms like Kindles.
- There is no metasearch for Internet Archive materials other than use of large search
engines. The Internet Archive exposes data in all sorts of ways so that the search engines
can find it. Kahle suggest adding “open library” or “archive” to your search, which should
push Internet Archive resources higher in search results.
- What about image collections? The Internet Archive has worked with NASA on NASA
Images (http://nasaimages.org/) using Luna imaging software. They are still novices with
images, but are open to projects, so contact Kahle if you have a project in mind.
- Tom Leonard on the HathiTrust: 24% of its materials are in the public domain, it has new
search capacity, and is providing access to them. What is bad about it? Kahle’s response: If
it is all we can muster, okay, but researchers are using texts in a different ways (including
text mining) now and HathiTrust is too restricted in how materials can be used (e.g., only
one item at a time). Kahle sees it as locking up the public domain. Corporate licensing
restrictions are highly troubling. Creating a lots of copies allows us to push the law in ways
that make sense for our society from a number of different points. If there is only one
access point, it is easy to lock it up and restrict it.
- The Internet Archive’s text collections are currently hot research corpora for computer
scientists, so associated interface and infrastructural technologies are moving forward. For
now. New technology can help with problems such as OCR on old typefaces.