Seattle's public institutions are sitting on a growing problem that has quietly compounded for more than a decade: thousands of duplicate digital images scattered across city servers, archival databases, and departmental hard drives, with no unified policy to resolve them. The question now isn't whether to act — storage costs and contract timelines are making that decision for administrators — but how, and who decides what gets kept.
The pressure point is real. The City of Seattle's contract with its primary cloud storage vendor is up for renewal in early 2027, and technology staff across multiple departments have flagged the duplication issue as a budget liability heading into that renegotiation. Redundant image files, particularly high-resolution scans from planning documents, park surveys, and public event photography, are consuming storage that the city is paying for at commercial rates.
What's at Stake in the Archive
The Seattle Municipal Archives, based at 600 Fourth Avenue in the Seattle Municipal Tower, holds decades of photographic records that were digitized in waves — first in the early 2000s, again after 2012, and again as departments transitioned to new content management systems around 2019. Each migration left behind copies. According to city technology staff who presented the issue to the Seattle City Council's Finance and Housing Committee earlier this year, the duplication rate across some departmental image libraries runs as high as 40 percent in older collections.
The Seattle Public Library's digital collections program faces a parallel challenge. The Library's digitization partnership with the University of Washington — through the Pacific Northwest Regional Digitization Initiative — has produced overlapping scans of historical photographs donated by community members over the years, particularly images from the Central District and the International District. Some of those images now exist in three or four separate repositories, each catalogued slightly differently, creating confusion for researchers and driving up long-term storage costs.
Beyond cost, the duplication problem carries a real cultural risk. When institutions move to deduplicate files without a clear review protocol, original metadata — provenance, copyright status, donor information — can be stripped or lost. That's the scenario archivists at the Washington State Historical Society in Tacoma spent two years recovering from after a 2021 server migration went wrong. Seattle's institutions are watching that case closely as a cautionary example.
The Decisions That Can't Wait
Three choices are converging at once. First, city technology staff must decide by the end of 2026 whether to invest in automated deduplication software — tools that can identify identical or near-identical image files and flag them for human review — or continue manual audits, which are slower and more expensive in staff hours. Automated tools from vendors currently under evaluation can process archival image libraries at scale, but they require upfront licensing costs estimated in the low six figures for a deployment the size of Seattle's municipal system.
Second, the Seattle Office of Arts and Culture, which administers the city's public art registry and its associated image database on Western Avenue, must align its own deduplication timeline with the broader city process or risk being left out of any centralized solution. The office manages thousands of images of public artworks, including murals in Beacon Hill and sculptures along the waterfront, and has its own metadata standards that don't always match the Municipal Archives.
Third, and most politically complicated, is the question of governance. Who has final authority to delete a file from the public record? The city attorney's office has been asked to clarify whether certain image records carry retention obligations under Washington State's public records law — a question without a clean answer when the same image exists in six places simultaneously.
The Finance and Housing Committee is expected to take up a formal proposal for a deduplication policy framework this fall, with a target adoption date of January 2027. Community archivists and digital preservation advocates have already submitted public comment urging the city to build in mandatory human review before any deletion, and to publish a public-facing inventory of what's removed. That last demand — transparency about what disappears from the public record — may prove to be the hardest negotiation of all.