Seattle's Office of City Clerk is sitting on a problem that has quietly grown for years: thousands of duplicate digital images embedded in the city's public records systems, the result of multiple scanning campaigns, overlapping vendor contracts, and inconsistent file-naming conventions that date back to the early 2000s. The question now is not whether to fix it, but how — and before the next budget cycle locks in priorities for 2027.
The timing matters because the city is midway through a broader digital modernisation push tied to its Open Data Program, administered through the Department of Information Technology on Fourth Avenue. That program set a 2026 deadline for departments to audit their own archival holdings. With that deadline now here, duplicates that were once a nuisance are becoming an administrative liability — slowing search times, inflating cloud storage costs, and creating legal uncertainty about which version of a document constitutes the official record.
Where the Backlog Came From
The problem is largely structural. When the city migrated from paper to digital records across departments between roughly 2004 and 2015, different offices used different scanning vendors with different metadata standards. The Seattle Municipal Archives, housed in the Seattle Municipal Tower on Fifth Avenue, ended up with files ingested through at least three separate systems that did not communicate with each other. A permit scan completed by the Department of Construction and Inspections might exist simultaneously in the Clerk's Laserfiche database, a SharePoint folder maintained by a project manager, and an older shared drive that was never fully decommissioned.
The Seattle Municipal Archives holds records spanning more than a century, covering everything from early-20th-century annexation maps to contemporary City Council legislation. Managing that volume digitally requires consistent deduplication protocols — something the city has not had in a standardised form. Community advocates working with the Capitol Hill Housing organisation, which regularly files public records requests related to affordable housing permits in the First Hill and Capitol Hill corridors, have noted in public comment sessions that search results sometimes return the same document multiple times, making it harder to verify whether a complete record set has been produced.
The Decisions That Will Define the Outcome
Three choices will shape what comes next. First, the city must decide whether deduplication work is handled in-house by existing IT staff or contracted out. A fully in-house approach would require redeploying personnel from the Department of Information Technology's enterprise applications team — staff who are currently committed to the city's permitting system upgrade, which is scheduled to go live in the first quarter of 2027. An outside contract carries its own risks, including the possibility that a vendor's automated tools flag legitimate document versions as duplicates and delete records that carry legal standing.
Second, city officials must agree on a threshold definition of what constitutes a duplicate. Two scans of the same physical page are obvious candidates. But what about a document that was corrected after its first scan? Or a file that was intentionally retained in two departments for independent audit purposes? The city's records retention schedule, last comprehensively updated in 2019, does not yet address this scenario clearly.
Third, and most consequentially for residents, is cost. Cloud storage rates for government-scale systems have climbed across the industry over the past three years. Eliminating even 20 percent of redundant files across the city's estimated holdings — a conservative projection based on comparable municipal audits in cities like Portland and Denver — could reduce annual storage expenditures meaningfully, though any specific savings figure would need to come from a formal audit the city has not yet published.
The City Clerk's office is expected to present options to the City Council's governance committee before the August recess. Community members who regularly use the Seattle Municipal Archives reading room at the Municipal Tower, or who file records requests through the online portal, can submit written comment to the Clerk's office ahead of that presentation. Whatever path the council endorses will set the template not just for image deduplication but for how Seattle manages the inevitable redundancies that come with any large-scale digital archive — a question that is not going away.