Best Way to Scan Receipts

How to scan receipts so they hold up at audit time. Phone camera technique, the thermal paper problem, OCR accuracy, and what changes when volume grows.

Inbox Ledger TeamInbox Ledger Team· 2026-04-24
Phone scanning a paper receipt on a desk with structured data appearing on a laptop screen beside it

Most people scan receipts wrong, and the error is not technical. The scan looks fine on the phone screen. The problem shows up six months later when the OCR software cannot read the total, or two years later when an auditor asks for the original and the file is a blurry JPEG with the bottom edge cropped.

Getting a scan right takes thirty seconds of attention in the moment. Getting it wrong costs hours of reconstruction later. This guide covers what actually matters: why scan quality has legal consequences, the specific camera technique that produces readable files, the thermal paper problem and the only real fix, and what changes about filing and retention once volume grows past what manual effort handles.

Why scan quality has legal consequences

A scanned receipt is only as good as its readability. That sounds obvious until you consider what tax authorities and auditors are actually checking.

The IRS has accepted digital and scanned records as primary documentation since Revenue Procedure 97-22, with a clear standard: the copy must be a complete and accurate reproduction of the original, legible, and stored in a system that does not allow undetected alteration. HMRC follows similar principles under its Making Tax Digital guidance on digital records, accepting digital images provided they are accurate and accessible. IRS Publication 583 covers the retention requirements: three years default, six years for material underreporting, indefinite for fraud.

What this means practically: a scan that clips the vendor name, loses the VAT line, or makes the total unreadable is not a legally acceptable substitute for the original. It is just a photo that looks like a receipt. The most common audit rejection for digital records is not the format. It is the completeness of the image.

There is also the downstream problem. A scan that a human can barely read is a scan that OCR will fail on. When your accounting software cannot extract the amount or the vendor from a receipt, the record falls into a manual-review queue. At low volume that is annoying. At high volume it is a bottleneck that defeats the point of having a digital system.

The goal is not just "store a scan." It is "store a scan that any system can read correctly two years from now."

Phone camera best practices

A smartphone with a decent scanning app produces better results than a flatbed scanner for most receipts, because the software handles perspective correction, contrast normalization, and edge detection that a raw photo does not. The technique matters more than the hardware.

Lighting

Indirect natural light is the ideal. Direct sunlight creates harsh shadows across creases and produces hot spots on thermal paper that wash out the print. Overhead office fluorescents create glare on glossy thermal surfaces. The practical fix for any indoor situation: hold the receipt at a slight angle away from the primary light source until the glare disappears from your phone screen before you capture. A few degrees of tilt eliminates most glare without affecting readability.

Avoid using the phone's flash on thermal receipts. The flash creates a direct specular reflection from the thermal coating that renders a clean horizontal band on the scan where the text is invisible. If the environment is too dark to scan without flash, move closer to a lamp rather than enabling flash.

Angle and distance

Hold the phone directly above the receipt, parallel to the surface. Angled shots create perspective distortion that scanning apps can partially correct, but the correction degrades OCR accuracy on text near the edges. Directly overhead at the minimum distance that captures the whole receipt with a small margin on each side gives the software the best input to work with.

Most scanning apps show a bounding-box preview before capture. Use it. If the box is not touching all four edges of the receipt, reposition rather than capturing and hoping the crop comes out right.

Cropping and margins

Capture the whole receipt with roughly half a centimeter of table or surface visible on each side. This gives the edge-detection algorithm a clear contrast boundary to work from. Do not try to crop to the exact edge of the paper in the viewfinder. Let the app crop after capture. Manual pre-crop in the viewfinder is a reliable way to clip the top or bottom line of the receipt.

After capture, check the preview. Specifically check that the first line (usually vendor name and address) and the last line (usually total and sometimes a legal disclosure) are fully visible. Those two lines carry the information that matters most for audit purposes. If either is cut off, rescan.

Multi-page and long receipts

Some point-of-sale receipts print across 30 to 40 centimeters of paper. Do not try to capture these in one shot by pulling back until the full length fits. The distant capture reduces resolution below what OCR needs for the small print that most thermal printers use.

Instead, capture overlapping sections: top third, middle third, bottom third, with each overlap containing two or three lines of shared text. The scanning app can either stitch them (Adobe Scan does this well) or you save them as separate pages of a single PDF. Either result is more readable than a single low-resolution full-length shot.

Dedicated scanning apps vs. native camera

The plain camera app on every phone takes photos, not scans. The difference is significant.

A photo is a faithful image of what you pointed the camera at, including whatever perspective distortion, shadows, and color cast were present. A scan adds several processing steps: perspective correction to make the receipt appear as if photographed directly overhead, contrast enhancement to make faded print more legible, edge detection to identify the document boundary, and in most cases OCR to embed searchable text in the output file.

Three apps that handle all of this well:

Adobe Scan (iOS and Android, free) is the standard choice for receipt scanning. It produces perspective-corrected PDFs with embedded OCR text, handles multi-page documents, and syncs to Adobe cloud or can export directly to Google Drive or a local folder. The OCR runs on-device for common text and in the cloud for complex layouts.

Google Drive's built-in scanner (Android, free) runs server-side OCR automatically. Every scanned PDF becomes searchable inside Drive without any extra step. The main limitation is Android-only for the direct scan feature, and it requires a Google account.

Apple Notes (iOS, free, built-in) has a document scanner accessible from the camera icon in any note. It produces multi-page PDFs with perspective correction and syncs to iCloud automatically. It does not run OCR natively, but files stored in iCloud become searchable through the iOS Files app because Apple runs server-side text recognition on stored documents.

The native camera wins exactly one comparison over dedicated scanning apps: it is always open. For a receipt you will digitize properly in 30 seconds anyway, native camera plus a scanning app is no better than just opening the scanning app. For receipts you need to capture in a hurry and process later, a native photo is fine as long as you come back and process it the same day.

The thermal receipt problem

Thermal paper is the source of more audit problems than any other receipt type, and the reason is chemistry. Thermal printers work by heating coated paper to produce the image. The same coating that responds to heat also responds to light, ambient temperature, and contact with oils and plasticizers. Wallets, car dashboards, plastic bags, and sunlit windowsills all degrade thermal receipts.

Most thermal receipts fade to near-blank within two to five years. Some fade within months under bad storage conditions. A receipt that looks fine today may be a pale blank by the time a three-year IRS audit window expires.

There is no way to slow the fading that is practical for everyday business operations. The only reliable response is same-day scanning. Treat thermal paper as a temporary artifact that must become a digital file before it returns to your wallet. Not "scan it this week." The same day.

A few thermal-specific scanning tips that differ from regular paper:

Skip the flash entirely. The thermal coating reflects light differently than regular paper. Flash creates a horizontal glare band across the center of the image that is invisible on the physical receipt but renders as white-out on the scan.

High contrast mode helps. Adobe Scan and most dedicated apps have a "grayscale" or "document" mode that increases contrast during processing. Use it for thermal receipts. The lighter gray of faded thermal text on slightly yellowed paper needs more contrast pull than fresh black ink on white paper.

Check the faint parts before filing. After scanning a thermal receipt, zoom in on the total line and the vendor header in the preview. These are the two areas most likely to drop below readable contrast first. If the total is hard to read on screen, it will fail OCR. Rescan from a better angle before the receipt leaves your possession.

The physical receipt can be discarded once the scan is confirmed complete and legible. Most jurisdictions allow this. Keeping a box of fading thermal paper as backup to a digital archive is not a useful hedge. If the digital copy is incomplete, the thermal paper is not going to save you two years from now.

After scanning: OCR, filing, and retention

Scanning is capture. What you do with the file in the next five minutes determines whether the scan is useful six months from now.

OCR and searchability

A PDF that is just an image is a dead file. You can open it and look at it, but you cannot search inside it, and accounting software cannot extract the fields it needs to create a transaction. Embedded OCR text is what makes a scanned receipt useful downstream.

If your scanning app runs OCR automatically (Adobe Scan, Google Drive, the iOS Files indexing), verify the output before filing. Open the PDF and try selecting text. If the text is selectable, OCR ran successfully. If you can only select the whole page as an image, OCR either did not run or failed.

For receipts where automatic OCR fails, often because of poor scan quality, faded thermal print, or non-standard layouts, manual correction takes thirty seconds: open the PDF in any editor that allows annotation, add a text note with the vendor, date, and amount. That minimum metadata is enough for search and enough for most accounting tools to work with.

Naming convention

File naming is the difference between an archive you can search and a folder you have to scroll through. The convention that survives long-term is YYYY-MM-DD_VendorName_Amount.pdf. Examples: 2026-04-15_OfficeDepot_47.99.pdf, 2026-03-22_Uber_24.50.pdf.

Date-first sorts correctly in any file system without additional tooling. Vendor name in the middle makes the file recognizable at a glance. Amount at the end is useful for quick matching against bank statements without opening the file.

Pick a convention and apply it consistently. The exact format matters less than the consistency. Changing formats halfway through a year creates a sorting problem that defeats the organization.

Filing structure

For low to medium volume, two structures work:

By category: Receipts/Software, Receipts/Travel, Receipts/Meals, Receipts/OfficeSupplies. Use categories that match your tax return line items. At filing time, the folder total is already organized by the axis you need.

By date then category: Receipts/2026/Q1/Travel, Receipts/2026/Q1/Meals. Useful when you need to reconstruct a specific period quickly, such as for a client audit covering six months.

For more on building an archive that survives long-term, the best way to organize receipts guide covers eight organizational methods with the volume thresholds where each one breaks.

Retention and immutability

Cloud storage with version history (Google Drive, OneDrive, Dropbox) satisfies the IRS requirement for a system that does not allow undetected alteration, because file versions are logged automatically. A modified file leaves a version trail.

Local storage without backup does not satisfy this requirement. A receipt saved to a laptop hard drive can be deleted silently, with no audit trail. This matters if you are ever audited and your records are challenged.

Keep a second copy in a separate location. One copy in Drive and one copy in a dated folder synced to OneDrive is redundant enough for most business needs. The cost is negligible. The cost of reconstructing three years of expense records from bank statements because one storage location was lost is significant.

Scanning for different use cases

The right setup differs by how many receipts you process and what you do with them afterward.

Personal expenses and side income

Volume is low, usually under 30 receipts a month. A phone scanning app and a dated folder in cloud storage is the appropriate level of infrastructure. Spend five minutes scanning at the end of each week. The effort of setting up automation exceeds the time saved for this volume.

One exception: if you have a side business that generates Schedule C or equivalent income, treat it as a small business from the first receipt. The IRS does not grade on a curve for small side income; it grades on completeness of records.

Freelance and independent contractors

Volume is usually 30 to 100 receipts a month across a mix of meal receipts, transport, subscriptions, and equipment. This is where manual scanning starts costing real time. Most of the volume arrives as email invoices or digital receipts, not physical paper. Connecting your email inbox to an AI-powered extractor handles the digital majority automatically, and physical receipts via phone scan fill the gap. Our AI processing page covers what the extraction pipeline pulls from each receipt: vendor, date, total, tax breakdown, currency, and line items.

Small businesses with multiple employees

Receipts arrive from multiple people across multiple channels. Paper receipts from employees on the road, corporate card statements, vendor invoices from email, platform receipts from tools like Stripe and Amazon Business. At this scale, a single scanner app and a shared Drive folder is not a system. It is a pile.

The pattern that works: email-based extraction as the primary channel for vendor invoices and digital receipts, a mobile capture path (phone scan or chat bot submission) for physical receipts employees generate, and a routing rule that puts all receipts into a shared archive with org-level access, not scattered across personal phone photos. The tools page covers the capture and extraction options that fit this setup.

Accountants and bookkeepers handling client records

The problem is not your own receipts. It is a client who hands over a shoebox, a phone camera roll, or an inbox they have not organized in 18 months. First step is always the same: run a historical sweep of the email inbox to capture every digital invoice, then handle physical paper manually.

For the email sweep, the setup is a read-only OAuth connection to the client's inbox and a historical extraction back to the start of the audit period. This surfaces every vendor invoice, subscription charge, and platform receipt that arrived via email in one pass, without touching any emails or requiring the client to do anything after the initial connection. Physical thermal receipts from that same period are often already too faded to read, which is the conversation to have with the client upfront.

Common mistakes that blow up at audit time

Seven errors that accountants and auditors see repeatedly, all fixable with small adjustments to the capture process.

Cropping off the VAT or tax line

The bottom of a receipt often carries the tax breakdown: subtotal, tax rate, tax amount, and total. This is the most important section for any jurisdiction where VAT or GST reclaim is relevant. If you crop this section off because the receipt was long and you captured only the vendor name and total, the scan fails as a VAT invoice. The crop looks fine until someone checks the structural completeness of the document.

Fix: always include the full receipt with a small margin. Check the bottom of the preview before filing.

One blurry JPEG instead of a PDF

JPEG compresses images using lossy compression. Each save cycle degrades quality slightly. A JPEG that looks fine on the phone becomes harder to read after it is emailed to an accountant, recompressed by the email attachment handler, downloaded, opened, and screenshotted for the audit binder. PDF does not recompress on save and is universally accepted as an audit-grade format.

Fix: use a scanning app that outputs PDF. Scan-to-JPEG is a camera setting, not a scanning setting.

Storing scans in a personal photos library

Phone camera rolls are not auditable storage. Files can be deleted without a version history, they are not accessible to anyone else without manual sharing, and they are lost if the phone is replaced without a backup. "All my receipts are in my photos" is a statement that will not survive a multi-year audit window.

Fix: scan directly into a cloud storage app with version history. Never rely on the camera roll as the primary store.

Filing the notification email instead of the invoice

For vendors that send a "your receipt is ready" notification with a link to download the actual invoice from a portal, filing the notification email is a common error. The notification is proof the email was sent. The invoice PDF is the actual document. These are not the same thing, and the notification alone does not qualify for VAT reclaim or expense deduction in most jurisdictions.

This is especially common for Amazon Business orders, where the order confirmation is not a tax invoice and the actual invoice has to be downloaded from the business account separately.

Fix: for any receipt that arrives as a notification with a portal link, follow the link and download the actual invoice PDF before filing. Then file the PDF, not the email.

No consistent naming after scanning

A folder of files named scan001.pdf, IMG_20260315.jpg, receipt_final_v2.pdf, and unnamed.pdf is not an archive. It is unsearchable, unsortable, and impossible to audit without opening every file. An auditor asking for "all receipts from Q1 2026 with Amazon" should be able to get an answer without opening 200 files.

Fix: apply a consistent naming convention at the point of scanning, not after. Rename before filing, not during tax prep.

Not confirming OCR ran successfully

Filing a scan and assuming OCR worked is a reliable source of broken records. If the app failed to extract text, the file is an image, not a searchable document. Three months later, searching your archive for a vendor finds nothing because the text was never indexed.

Fix: after each scan, verify the PDF has selectable text before filing. Thirty seconds of checking saves thirty minutes of hunting later.

Mixing personal and business receipts in the same archive

In a tax audit, the auditor will want to see only business receipts. If personal and business are in the same folder, you either produce the whole archive (including personal purchases you did not want to share) or spend hours sorting before you can respond. Either outcome is worse than maintaining separation from the start.

Fix: two folders, two categories, never mixed. If receipts arrive in a shared inbox, tag at capture time which entity they belong to.

Start for free and extract your first 10 invoices without a credit card.

When scanning alone is not enough

Phone scanning handles physical receipts well. It handles the 15 to 20 percent of your receipt volume that arrives as thermal paper or paper invoices.

The other 80 percent arrives as email attachments, portal downloads, and digital invoices that never existed as paper. For that volume, the right answer is not scanning. It is connecting the inbox directly to an extraction pipeline that processes every incoming invoice automatically, with zero manual capture per receipt.

The combination that covers everything: email-based extraction for digital receipts, phone scan for physical paper, and both flowing into the same archive. Once that is set up, the per-receipt manual effort drops to near zero, the archive is searchable, and the OCR question becomes "does the extractor handle this vendor's layout" rather than "did I scan this correctly."

If you want to see what the email extraction side looks like against your actual invoices, connecting an inbox with read-only access takes about five minutes, and the first 90-day historical sweep surfaces every invoice that arrived by email in that window without any manual capture.

The physical scanning practices in this guide are the right answer for the receipts that the automated system cannot reach. Together, they cover everything.