Skip to main content
The Daily Seattle

All of Seattle, every day

News

How Seattle's Digital Archives Ended Up Full of the Same Photos — And Why It Took Years to Notice

The city's push to digitize decades of public records created a sprawling duplicate-image problem that now dogs planners, archivists, and neighborhood groups across Seattle.

Share

By Seattle News Desk · Published 4 July 2026, 11:51 am

4 min read

Updated 4 h ago· 4 July 2026, 7:53 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Seattle is independently owned and covers Seattle news free from advertiser or sponsor influence. Read our editorial standards →

How Seattle's Digital Archives Ended Up Full of the Same Photos — And Why It Took Years to Notice
Photo: United States. Army. Corps of Engineers. Seattle District / Public domain (Wikimedia Commons)

Seattle's public image archives contain hundreds of thousands of duplicate photographs — some files copied three, four, or five times across different city databases — and municipal staff have spent the better part of three years trying to untangle the mess. The problem traces back to a digitization sprint that began in earnest around 2019, when the Seattle Municipal Archives on Third Avenue launched an aggressive push to move paper records, photographic prints, and analog slides into shared digital systems before older storage formats degraded beyond recovery.

The stakes are real. City planners at the Seattle Department of Construction and Inspections routinely pull historical images when evaluating permit applications in dense neighborhoods like Capitol Hill and the Central District. When the same photograph appears under four different file names and catalog numbers, it inflates apparent coverage, buries gaps in the actual visual record, and wastes staff time that could go toward substantive review work.

How the Duplication Grew

The root cause was not carelessness. Three separate digitization contracts ran simultaneously between 2019 and 2022, covering materials held at the Municipal Archives, the Seattle Public Library's Special Collections on Fourth Avenue, and the Office of Housing's own project documentation files. Each contractor used different metadata schemas and file-naming conventions. When the resulting batches were merged into the city's shared content management system, there was no automated deduplication layer in place to catch overlapping images.

A 2023 internal audit — details of which were reported at the time by the Seattle Times — found that roughly 18 percent of images in one city database had at least one exact or near-exact duplicate elsewhere in the same system. For photographs related to Yesler Terrace redevelopment, a project that generated voluminous documentation between 2013 and 2021, the duplication rate in some folders ran even higher. The audit did not produce an immediate fix. Budget negotiations for the 2024-2025 biennium delayed procurement of the deduplication software tools that city technology staff had recommended.

Neighborhood preservation groups felt the downstream effects first. The Historic Seattle nonprofit, which works on projects from Pioneer Square to Rainier Valley, found that image searches for properties in the Chinatown-International District frequently returned the same archival photograph multiple times under different accession numbers, making it harder to determine whether genuinely distinct visual documentation existed for a given building or block face.

The Slow Path to a Fix

Seattle's Office of the City Clerk issued updated digital asset management guidelines in March 2025, requiring that any new batch of images uploaded to shared city systems pass through a perceptual-hash comparison tool before ingestion. That requirement applies to new uploads — it does not retroactively clean the existing backlog, which city archivists estimate contains somewhere between 40,000 and 60,000 duplicate image files, though that figure has not been independently verified and staff acknowledge the true count could be higher.

The Seattle Public Library's Special Collections team began its own deduplication project in January 2026, working through the Washington Photographs Collection in order of research demand. That collection includes images dating to the 1880s. Librarians have prioritized photographs of the waterfront and downtown core, given how frequently those images appear in academic requests and development review processes tied to the ongoing Elliott Bay waterfront redevelopment work.

For residents and researchers dealing with the archives today, the practical advice from city staff is straightforward: cross-reference any image pulled from the Seattle Municipal Archives portal against the library's Digital Collections site before treating a single result as definitive. The two systems have not yet been reconciled, meaning a photograph may appear to be absent from one while existing under a different identifier in the other. City technology staff have said a unified search interface is a medium-term goal, though no contract has been awarded and no launch date has been set as of this week. Until that integration happens, the duplicate problem remains a daily friction point for anyone who depends on Seattle's visual public record.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Seattle

Covering news in Seattle. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Seattle news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Seattle and accept our Privacy Policy. Unsubscribe anytime.