Seattle's Office of the City Clerk announced this week that a duplicate-image problem inside the city's digital archive system has required the manual review of more than 14,000 photograph files, a bottleneck that has delayed public access to records dating back to the 1970s. Staff confirmed the issue on Wednesday, July 1, after a routine audit flagged repeated image entries across multiple collections.
The timing matters. The city launched its expanded digitization initiative in January 2026, promising residents and researchers searchable access to municipal photo archives by late summer. That deadline is now in jeopardy. The Clerk's office has been migrating records from aging physical storage at the Seattle Municipal Archives, located in the basement of City Hall at 600 Fourth Avenue, into a cloud-based content management platform. Duplicate entries — some images appearing three or four times under different file names — crept in when batch uploads from multiple departments were merged without a deduplication step.
The problem is not unique to Seattle, but it carries particular weight here given the volume of material at stake. The Seattle Municipal Archives holds roughly 2,000 linear feet of photographic records, spanning neighborhood construction projects in Capitol Hill and the Central District, old Alaskan Way Viaduct documentation, and decades of Seattle City Light infrastructure photographs. Researchers at the University of Washington's Special Collections division, which has collaborated informally with the Clerk's office on several digitization grants, have been among those waiting on expanded access.
What Went Wrong and Who's Fixing It
The root cause, according to internal documentation reviewed this week, was a misconfigured import script used during the February and March upload cycles. When the Clerk's office ingested records from Seattle Public Utilities and the Department of Neighborhoods simultaneously, the script did not check for existing file hashes before writing new entries. The result was a bloated database with redundant entries consuming an estimated 340 gigabytes of unnecessary storage.
The Clerk's office has brought in a vendor — Preservica, a digital preservation software company that already holds a contract with the city — to run an automated deduplication pass. That process began June 30 and is expected to take roughly three weeks. Staff are also conducting spot-check manual reviews on images flagged by the algorithm as potential false positives, meaning images that look nearly identical but are technically distinct, such as sequential frames from the same shoot.
The Seattle Public Library's Digital Collections team, which manages a parallel but separate archive of neighborhood photographs at its Central Library branch on Fourth Avenue and Madison Street, said this week it was not affected by the same issue. Its collections use a different content management system and a stricter upload protocol that requires manual tagging before any batch import clears.
What Researchers and the Public Should Expect
For anyone who has submitted a public records request involving photographs from the 2026 digitization batch, the Clerk's office says responses may be delayed by up to six weeks beyond the standard 10-business-day window. That extension applies specifically to requests filed after January 15, 2026 — the date the new system went live. Requests tied to older, already-catalogued records should not be affected.
The broader digitization project carries a budget allocation of $1.2 million through the end of fiscal year 2026, approved by the Seattle City Council as part of its records modernization package last fall. City officials have said the deduplication issue will not require additional appropriations, since the Preservica contract already covers remediation work of this scope.
Researchers with active projects should contact the Seattle Municipal Archives directly at its City Hall location to ask which specific collections have cleared deduplication and are available for immediate review. Walk-in access to the physical archive, open Tuesdays through Thursdays, remains unaffected. The Clerk's office has also said it will post weekly status updates on the city's open-data portal beginning July 7, giving the public a running account of how many records have passed quality review and entered the searchable database.