Seattle's Office of Arts & Culture and the city's Department of Neighborhoods confirmed this week that a systematic duplicate-image problem inside the city's public digital archive has grown large enough to trigger a formal remediation project, with staff beginning hands-on database corrections on July 1. The effort targets redundant, mislabeled, and low-resolution placeholder images that have accumulated across multiple city repositories over at least five years of inconsistent data entry practices.
The timing matters because Seattle is midway through digitizing roughly 80,000 physical records held at the Seattle Municipal Archives on Third Avenue, a project that runs through the end of fiscal year 2027. Injecting new scans into a database already cluttered with duplicates risks compounding the problem rather than solving it, which is why archivists pushed to front-load the cleanup work now, before the second half of this year's scanning batch is uploaded.
What the Audit Found
A preliminary internal review completed in late June identified more than 6,400 image records flagged as probable duplicates across two platforms — the city's legacy Content Management System and the newer OpenGov-linked public portal that launched in 2023. Many of the flagged records are tied to historic landmark files in neighborhoods including Capitol Hill, Pioneer Square, and the Chinatown-International District, where building documentation goes back decades and staff turnover has meant inconsistent file-naming conventions across different administrations.
The Department of Neighborhoods' Historic Preservation Program, which maintains the Seattle Landmarks Preservation Board's project files, has been particularly affected. Staff this week began working through a queue of duplicate image sets attached to landmark nomination documents — records that are publicly searchable and regularly accessed by property owners, developers, and researchers filing applications with the city. A single landmark case file had as many as three near-identical photographs of the same facade uploaded under different file names and dates, creating confusion about which version was the authoritative record.
The Seattle Public Library's digital collections unit, which shares some metadata standards with the city archive through a cooperative agreement signed in 2021, is also involved in the review. Librarians at the Central Library on Fourth Avenue have been cross-referencing image hashes — a technical fingerprinting method — against the municipal archive to identify records that were duplicated during a 2022 data migration that moved files off aging servers.
What Gets Fixed, and How Fast
City staff are not wiping duplicate records outright. Instead, archivists are merging duplicate entries, designating a primary image, and demoting redundant copies to a suppressed status that keeps them recoverable but removes them from public-facing search results. The distinction matters for legal and evidentiary reasons — some image records are attached to permits or code-enforcement cases where the file history must remain intact even if a photo appears twice.
The remediation is being done with a combination of automated detection scripts and manual review. Archivists estimate that roughly 40 percent of the flagged 6,400 records can be resolved through automated merging, while the remaining 60 percent require human judgment to determine which version of an image is correct or most complete. At current staffing levels — three full-time archivists assigned to the project — the manual review phase is expected to run through September.
For residents and researchers who regularly use the city's public archive portal, the practical effect this month is that some searches for historic building photographs or neighborhood survey images may return fewer results than usual as records are consolidated. Anyone who downloads city archive images for use in permit applications or preservation documents should confirm with the Department of Neighborhoods that the file they have is the current primary record, particularly for properties in landmark review zones like First Hill or Belltown.
The city has not announced additional funding for the project, which is being absorbed within existing archive operational budgets. A public progress report is expected to be posted to the Seattle Municipal Archives website by August 15, with a final remediation summary scheduled for the fourth quarter of 2026.