Web-Scale Discovery

December 22, 2010

Connecting users with the information they seek is one of the central pillars of our profession. Web-scale discovery services for libraries are those services capable of searching quickly and seamlessly across a vast range of local and remote preharvested and indexed content, providing relevancy-ranked results in an intuitive interface expected by today’s information seekers. First debuting in late 2007, these rapidly evolving tools are more important today than ever to understand.

Web-scale discovery services for the library environment are an evolution holding great potential to easily connect researchers with the library’s vast information repository, whether physical holdings, such as books and DVDs; local electronic content, such as digital image collections and institutional repository materials; or remotely hosted content purchased or licensed by the library, such as e-books and publisher or aggregator content for thousands of full-text and abstracting and indexing resources. For our purposes, web-scale discovery can be considered a service capable of searching across a vast range of preharvested and indexed content quickly and seamlessly. They provide discovery and delivery services that often have the following traits:

Content harvested from local and remotely hosted repositories to create a vastly comprehensive centralized index—to the article level—based on a normalized schema across content types, well suited for rapid search and retrieval of results ranked by relevancy. Content is enabled through the harvesting of local library resources, combined with brokered agreements with publishers and aggregators allowing access to their metadata or full-text content for indexing purposes.
Discovery provided by a single search box providing a Google-like search experience (as well as advanced searching capabilities).
Delivery of quick results ranked by relevancy in a modern interface offering functionality and design cues intuitive to and expected by today’s users, such as faceted navigation to drill down to more specific results.
Flexibility agnostic to underlying systems, whether hosted by the library or hosted remotely by content providers. These services are open compared to traditional library systems and allow a library greater latitude to customize the services and make them its own.

Why Web-Scale Discovery?

As illustrated by research from as far back as the 1990s, if not earlier, to as recent as 2010, library discovery systems within the networked online environment have evolved, yet continue to struggle to serve users. As a result, the library, or systems supported and maintained by the library, is often not the first stop for research—or worse, not a stop at all. Users have defected, and research continues to illustrate this fact.

Other factors, apart from user behavior and preferences, also give reasons for libraries to use web-scale discovery services. First, and most obvious, is that if something is not discovered, it has no chance of being used. Whether a librarian conducts a reference interview, a user browses the shelves, a friend provides word-of-mouth, a user searches in Google or a library database, or a user scans issues and article titles in an electronic journal, discovery must happen, either by focused intent or serendipitously. Libraries often spend tremendous amounts of money every year to purchase or pay for access to an ever-growing body of electronic content, and the cost for access to this content often increases on an annualized basis. But for the content to be used, it must be discoverable—and for today’s users, easily discoverable.

Jason Vaughan is the director of library technologies at the University of Nevada at Las Vegas. This is an excerpt from the January 2011 ALA TechSource on web-scale discovery.

Effective outreach services permit readers to voyage beyond their limitations

Advisory Beyond Books

Latest Library Links

5h

Michael Kan writes: “Are you still using ‘chocolate,’ ‘naruto,’ or ‘monkey’ for your passwords? You really need to stop. All three are among the most commonly used passwords, putting your accounts at risk of hacking, according to new data from NordPass. The password manager’s sixth annual list of the top 200 most commonly used passwords is pulled from a 2.5TB dataset of stolen logins taken from various sources.” ZDNet offers tips for creating stronger passwords and improved security measures.

PC Mag, Nov. 14; NordPass, Nov. 13; ZDNet, Nov. 14
10h

John Sharp writes: “Athens and Fairhope are two fast-growing cities in the fastest-growing areas of Alabama, and there is no mistaking their conservative bona fides. But according to early statistics about library card applications, their libraries appear to be bucking a trend embraced by conservative-leaning groups and the Alabama GOP: Their patrons appear to be trusting the libraries. More than 60% of parents at Fairhope Public Library and Athens-Limestone County Public Library have signed off on all-access passes for their children with no restrictions on library usage.”

AL.com, Nov. 17
1d

Katie Gaddini writes: “Book bans may have mushroomed in the Trump era of reactionary politics, but they have a well-established history in America. One woman in particular, Norma Gabler, redefined the current strategy and logic behind modern book bans. Called ‘education’s public enemy number one,’ by critics in 1980, Gabler led the crusade against the so-called secular trend in school textbooks throughout the 1960s, 1970s, and 1980s. Even though Norma and her husband Mel worked together, Norma was the public face of their efforts for decades.”

Time: Made By History, Nov. 13
1d

Many members know Bill Ladewski as executive director of the Reference and User Services Association, an ALA position he has held since 2019. In early September, Ladewski added another title to his CV: director of Member Relations and Services. In this role, he and his team are charged with providing service and information to the Association’s nearly 50,000 members. Ladewski answered our 11 Questions to reintroduce himself to the ALA community.

AL: The Scoop, Nov. 19
2d

John Warner writes: “I understand what people mean when they invoke the term ‘institutional neutrality,’ but I don’t know how it’s workable in today’s world. Higher education institutions are built upon a foundation of actual values, values that are, by definition, not neutral. The Kalven report, the Rosetta Stone of institutional neutrality produced by a faculty committee at the University of Chicago in 1967, is not a call to make all work emanating from an institution ‘neutral,’ but is instead a call to make the atmosphere for scholarly inquiry and debate as free as possible.”

Inside Higher Ed, Nov. 15
2d

Angela Hursh writes: “Ever wondered how your library’s email performance compares to others? Benchmarks help you understand how well your emails perform in key areas and identify opportunities for improvement. They also allow you to compare your email marketing performance, set goals, and stay on top of trends. However, the lack of industry benchmarks for email marketing metrics specific to libraries has been bugging me. Metrics from similar industries don’t fully capture the unique aspects of promoting a library. To help libraries accurately measure their email effectiveness, we’ve created the first-ever library email benchmark report.”

NoveList, Oct. 22
3d

Rachel Hendrick writes: “Students and library vendors are pushing artificial intelligence (AI) adoption in higher education, but there are very few resources that help librarians and scholars separate the wheat from the chaff. Luckily, we here at Choice love a good assessment rubric. (In fact, we made a PDF of our assessment rubric.) Even the least tech-savvy of us can use a very simple AI literacy framework to think critically about whether an AI application is worth your time.”

Choice 360 LibTech Insights, Nov. 13