The Daily Perth

Perth news, every day

News

Perth's Duplicate Image Problem: The Numbers Hiding in Plain Sight

Western Australian government agencies, real estate platforms and media organisations are sitting on databases riddled with duplicate images — and the scale of the problem is finally being measured.

By Perth News Desk · Published 5 July 2026, 5:41 am

3 min read

UpdatedUpdated 5 July 2026, 1:36 pm

#News
Perth's Duplicate Image Problem: The Numbers Hiding in Plain Sight
Photo: Photo by David on Pexels

Advertisement

Perth's digital infrastructure has a clutter problem. Across WA state government servers, real estate listing platforms and local media archives, duplicate image files are consuming measurable storage capacity — and a wave of audits conducted through the first half of 2026 is beginning to put hard figures on what had previously been an invisible cost.

The timing matters. With the WA government running back-to-back budget surpluses and funnelling capital into Metronet expansion and the Indian Ocean Strategy, agencies have been under pressure to demonstrate digital efficiency before requesting fresh IT allocations. Duplicate image bloat has emerged as a surprisingly concrete target — one that auditors can quantify without political sensitivity.

What the Data Actually Shows

Industry benchmarks from global digital asset management research suggest that between 20 and 30 per cent of images stored in large institutional repositories are functional duplicates — files that are identical or near-identical but stored under different filenames, in different folders, or uploaded by different staff members at different times. Apply that range to any organisation maintaining thousands of records and the redundancy adds up fast.

Advertisement

In Perth's property sector, the problem is especially visible. Realestate.com.au and Domain listings for suburbs including Cottesloe, Subiaco and the rapidly densifying corridor along the Forrestfield–Airport Link route routinely feature the same exterior shot of a dwelling uploaded two or three times within a single listing — sometimes at slightly different resolutions. Property data firms tracking image metadata in WA listings have flagged this as a quality issue that affects search indexing and load times, particularly on mobile connections.

The City of Perth, which maintains digital archives for planning applications, heritage documentation and event photography, launched an internal review of its image asset library in March 2026. The review — confirmed on the council's public project register — is part of a broader data governance push that also covers document duplication across the council's customer service portal on Murray Street.

Landgate, the WA government's land information authority based in Midland, manages one of the state's most image-intensive public databases, holding aerial photography, cadastral maps and satellite imagery going back decades. Duplication in geospatial image archives is a known cost driver: storing two versions of a high-resolution aerial tile at 50 megabytes each across tens of thousands of tiles produces measurable redundancy measured in terabytes, not gigabytes.

Why Deduplication Is Now a Budget Line, Not Just an IT Footnote

Cloud storage is no longer cheap enough to ignore. AWS S3 standard storage — the infrastructure underpinning many WA government cloud migrations — is priced in Australian dollars at rates that, when multiplied across terabyte-scale duplicates, translate to recurring annual costs that show up in departmental IT budgets. The WA Department of Finance's whole-of-government cloud framework, updated in late 2025, explicitly identifies redundant data storage as a target for rationalisation in the 2026–27 financial year.

Universities are also in the frame. Curtin University's sprawling Bentley campus and the University of Western Australia on Crawley Bay both operate large research data repositories. Image duplication in research datasets — where the same microscopy image or field photograph gets uploaded by multiple team members across collaborative projects — can compromise data integrity as well as storage efficiency. Both institutions have active research data management policies, though neither has publicly released deduplication audit findings.

For businesses and agencies looking to act, the practical path is well-established. Perceptual hashing tools — software that generates a compact fingerprint for each image and flags near-matches, not just exact copies — are the standard first step. Running a deduplification pass before migrating to a new content management system, rather than after, consistently produces better outcomes: migration projects that skip the step typically discover the problem only when storage invoices arrive.

For Perth organisations expecting to scale digital content through the second half of 2026 — whether driven by AUKUS contract documentation requirements at Henderson or the sheer volume of property imagery generated by WA's still-heated housing market — the message from the numbers is straightforward. Measure what you have before you pay to move it somewhere new.

Advertisement

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Perth

This article was produced by the The Daily Perth editorial desk and covers news in Perth. See our editorial standards for how we use AI.

Stay in the loop

Enjoyed this story? Get tomorrow's briefing free.

Daily brief

Enjoyed this? Wake up to Perth news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Perth and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network — local news across Australia

More local news across Australia