The Daily Perth

Perth news, every day

News

The Numbers Behind Perth's Duplicate Image Problem: What the Data Reveals

A surge in digital asset mismanagement across Western Australia's public and private sectors is exposing just how costly untracked duplicate images have become.

By Perth News Desk · Published 5 July 2026, 4:47 am

3 min read

UpdatedUpdated 5 July 2026, 12:17 pm

#News

Advertisement

Tens of thousands of duplicate image files are quietly draining storage budgets and slowing workflows across Perth's government agencies, real estate platforms, and media organisations — and the numbers are only now coming into clear focus. An audit culture prompted by the WA Labor government's scrutiny of digital infrastructure spending has pushed the issue from IT back-offices into the broader conversation about how the state spends its surplus.

The timing matters. Western Australia recorded a state budget surplus that has directed fresh capital toward digital transformation projects, including Metronet's passenger information systems and the Department of Communities' housing database, both of which rely on large image libraries. When those libraries bloat with duplicates, the downstream costs compound fast — redundant cloud storage, slower retrieval times, and staff hours burned on manual deduplication.

What the Data Actually Shows

Industry benchmarks from digital asset management providers suggest that in large organisations, duplicate files routinely account for between 20 and 40 per cent of total image storage volume. For a government department running a 50-terabyte image repository — not an unusual figure for a body managing social housing stock across Perth's northern corridor — that translates to between 10 and 20 terabytes of redundant data sitting on servers and accruing cloud costs month after month.

Advertisement

Perth's real estate sector offers a sharper local illustration. On Scarborough Beach Road and across the Inner North corridor, property management companies listing rental stock on platforms like REIWA.com frequently upload the same photographs multiple times across different listing cycles. Each duplicate entry adds to indexing load and search latency. REIWA, whose database covers listings from Fremantle to Joondalup, has not publicly disclosed the scale of its deduplication overhead, but the problem is endemic across the industry nationally.

The Metronet project provides another data point. As the rail expansion pushes into Ellenbrook and Morley, communications teams managing construction progress imagery for public reporting face the same structural problem: multiple photographers, multiple upload portals, no centralised hash-matching to catch identical files before they enter the system. The result is archive bloat that compounds with every new station milestone.

The Cost of Doing Nothing

Cloud storage pricing provides a concrete way to measure the neglect. Standard AWS S3 storage in the Asia Pacific (Sydney) region — the closest commercial node to Perth infrastructure — sits at approximately USD $0.025 per gigabyte per month as of mid-2026. A 15-terabyte duplicate overhang, modest by large-agency standards, costs roughly AUD $600 a month at current exchange rates, or more than $7,000 a year, for data that delivers zero operational value.

The City of Perth, which manages image assets across its cultural venues on James Street and its planning portal on Barrack Street, is among the local government bodies that have flagged digital asset governance as a priority in their technology roadmaps — though specific deduplication spend figures have not been publicly released.

Automated deduplication tools using perceptual hashing — a technique that identifies visually identical images even when file names and metadata differ — have dropped sharply in price. Enterprise-grade solutions from vendors including ImageKit and Cloudinary now offer deduplication processing at a fraction of what comparable tools cost five years ago, making the business case straightforward for any organisation running more than a few hundred gigabytes of image data.

For Perth organisations still managing duplicates manually or not at all, the practical path forward starts with a storage audit. IT teams should establish a baseline count of image assets, run a perceptual hash check across the full library, and identify the top five directories by duplicate density before committing to any vendor solution. Given WA's budget position and the volume of digital infrastructure investment now flowing through agencies tied to AUKUS contracts at HMAS Stirling and the Indian Ocean Strategy's logistics build-up, getting image data governance right is no longer a back-of-house problem — it is a line item that auditors are increasingly equipped to scrutinise.

Advertisement

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Perth

This article was produced by the The Daily Perth editorial desk and covers news in Perth. See our editorial standards for how we use AI.

Stay in the loop

Enjoyed this story? Get tomorrow's briefing free.

Daily brief

Enjoyed this? Wake up to Perth news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Perth and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia

More local news across Australia