AI and Machine Learning

The challenges of artificial intelligence in libraries

By Jason Griffey | March 1, 2019

Artificial intelligence (AI) and machine learning are everywhere, giving driving directions and identifying objects in photographs. They are so engrained in our technology that often people don’t realize what they’re experiencing is a machine learning system. Everyone with a smartphone has an AI system that uses machine learning.

For example, Google’s Android operating system records, measures, and collects information and sends that data to servers. These servers use billions of data points collected from tens of millions of users as input for their machine learning systems. When you ask an Android phone to show you pictures from the beach, a complex set of data moves back and forth between your phone and Google’s servers, comparing your photos to the billions in its data set. The search results include pictures that the AI decided were most likely to be related.

Since Google has billions of photos to assess and millions of people helping it train its AI, the decisions that the AI makes are generally good. But AI is only as effective as its training data and the weighting given to the system as it learns to make decisions. If the data is biased, contains bad examples of decision making, or is simply collected in such a way that it doesn’t represent the full problem set, the system will produce broken, unrepresentative, or bad outputs.

For data privacy and security concerns, localized machine learning has an advantage.

Apple, on the other hand, has chosen to model its AI and machine learning by analyzing and weighting your data locally on the iOS devices themselves. Your devices use the same machine learning algorithms to include your photos in Apple’s preset weights, but they aren’t pushed to Apple’s servers. Because each data set is analyzed locally, there is no shared decision making as there is with Google. Each device must do heavy lifting itself, rather than rely on remote servers for the bulk of the work.

For data privacy and security concerns, localized machine learning has an advantage. If you don’t need to send photos and data back and forth from server to client, and if providers don’t need to store and host data, the data’s vulnerability to attack is greatly reduced.

The examples above focus on object and image recognition in photos by a machine learning system. This is only one of dozens of uses for AI and machine learning systems.

It’s also easy to see how an AI system is useful for libraries and archives in creating metadata from digitization projects. AI systems can be trained to recognize locations from a single photograph—including where the photographer was standing—based on angle, geography, and other factors. These systems can be enormously useful in making the processing and cataloging of archives and collections more discoverable.

As more libraries and library vendors move into developing AI and machine learning systems, we should be sensitive to the privacy implications of collecting and storing the data that’s needed to train and update those systems. As with existing systems where we outsource data collection and retention to vendors, libraries need to be aware of the mechanisms by which that data is protected and how it may be shared with others through training sets. Where libraries can provide local analysis in the style of Apple and iOS, they should.

The opportunities associated with new machine learning systems to reform large portions of library activities will be rich and varied. While it will be some time before AI will conduct full conversations or reference interviews with students and patrons, the use of AI as an increasingly powerful lever inside other systems will progress quickly over the next three to five years. Libraries can watch these systems as they develop, work with vendors, and create their own services and systems so that our values and ethics are baked into the technology at the outset.

JASON GRIFFEY is a librarian and technologist and the founder and principal at Evenly Distributed. Adapted from “Artificial Intelligence and Machine Learning in Libraries,” Library Technology Reports vol. 55, no. 1 (Jan. 2019).

Tagged Under

artificial intelligence

Penn State University student Luz Sanchez Tejada uses the school's microcredentialing platform in Pattee Library to earn badges as part of her peer research consultant training. Photo: Steve Tressler

The Making of a Microcredential

Penn State University Libraries evaluates badge steps with help from artificial intelligence

Bohyun Kim, chief technology officer at the University of Rhode Island Libraries in Kingston

An AI Lab in a Library

Why artificial intelligence matters

Latest Library Links

14h

Jackie Jennings writes: “It feels like the debate over whether #BookTok is bad has been raging since the moment the term was first coined. I’m starting off with a strong stance: BookTok is indeed bad. However, the problem with BookTok is not crappy books or bogus influencers. The problem with BookTok is TikTok itself. BookTok isn’t actually a community driven by fans, writers, influencers, or even publishers: it’s part of a social media corporation, controlled by the most mysterious, fickle god of all, the algorithm.” Not surprisingly, librarian recommendations can overcome some of BookTok’s limitations.

Jezebel, Apr. 18; Book Riot, Apr. 22
20h

ALA announced the launch of its state Intellectual Freedom Helpline grant program April 22. Over the next two years, 10 pilot program sites will operate a confidential reporting system that will help connect those experiencing censorship attempts with professional support, in-state peers, or referral to ALA’s Office for Intellectual Freedom, as appropriate. State or school library associations or agencies wishing to either establish an Intellectual Freedom Helpline in their state or expand existing efforts may apply for $10,000 grants through July 14.

ALA Office for Intellectual Freedom, Apr. 22
21h

In celebration of the release of his latest nonfiction title, The Secret Lives of Booksellers & Librarians, bestselling author James Patterson is honoring select American Bookseller Association and American Library Association members with bonuses. He announced plans April 11 to give $200 each to 250 library workers across the country. The deadline for ALA members may nominate members to receive bonuses through April 30. Winners will be announced at ALA’s 2024 Annual Conference in San Diego.
1d

Catherine Hollerbach writes: “In early 2020, when the world shut down for COVID, many people got interested in houseplants. Anne Arundel County (Md.) Public Library’s Crofton Library embraced this trend and then some!” While preparing to reopen after the COVID shutdown, the library installed plants at the information desk to discourage patrons from sticking their heads through gaps in newly installed acrylic shields. They were well received and cared for, and the library gradually added more plants and built educational tools, programming, and partnerships around the plants.

Public Libraries Online, Apr. 18
2d

Rodney Freeman writes: “I am proud to be a librarian—and rare. Less than 7% of librarians in the US are Black. Libraries symbolize the literacy that was denied to so many of our ancestors. For our enslaved forebears, something as fundamental as learning to read was illegal and dangerous, but they did it anyway. Separate but ‘equal’ schools and ‘colored’ libraries filled with cast-offs from white libraries were key features of the Jim Crow era. Today we are seeing the same impulse to distort access to information into a tool to suppress and control, and to make some people ‘other.’”

Newsweek, Apr. 16
2d

Lori Birrell writes: “Staff want to feel valued, and they want their work to have an impact. A reorganization process can help leaders to surface such areas of impact and give staff a feeling of empowerment and value. Practitioners considering any sized reorganization are strongly encouraged to consider what models and resources will best support them as they plan and lead this work. Regardless of the model or resources, any reorganization process should be more than just moving boxes and reporting lines around on an organizational chart.”

Library Leadership and Management, Apr. 15
3d

Megan Bennett writes: “Two decades ago, while Daily Show Senior Correspondent Dulcé Sloan was doing summer shows at a community theater in Quakertown, Pennsylvania, the library was her main hangout spot. In the small town of 9,000 people, it was a place to gather with other young actors—and the only place with internet access. American Libraries spoke with Sloan before her closing session at the Public Library Association 2024 Conference in Columbus, Ohio, about her new book, her journey in stand-up comedy, and her memories of libraries.”

American Libraries Trend, Apr. 17