Receipt Processing Service

A service that extracts structured information from receipt images using OCR and language models.

Overview

This service provides a REST API that processes receipt images (provided as base64-encoded strings) and extracts key information such as sender, receiver, amount, and transaction description. It combines OCR technology with advanced language models to deliver accurate data extraction.

Example image

Features

Base64 Image Processing: Convert base64-encoded receipt images to structured data
Information Extraction: Extract sender, receiver, amount, and transaction description
REST API: Simple API endpoints for integration with other services
Docker Support: Easy deployment using Docker and Docker Compose
Memory Optimization: Smart text truncation and GPU/CPU processing options

Installation

Prerequisites

Docker and Docker Compose
Python 3.11+ (for local development)
Hugging Face account and API token

Using Docker (Recommended)

Clone the repository:

git clone <repository-url>
cd base64receipt-to-entity

Create .env file from the template:
```
cp .env.template .env
```
Edit the .env file and add your Hugging Face token:
```
HF_TOKEN=your_huggingface_token_here
```
Build and start the service:
```
make build
make start
```

Local Development Setup

Install dependencies:
```
pip install -r requirements.txt
```

Setup environment variables:

cp .env.template .env
# Edit .env with your configuration

Run the service:
```
uvicorn src.api:app --reload
```

API Usage

Health Check

curl -X GET http://localhost:8000/api/ping

Process Receipt Image

curl -X POST http://localhost:8000/api/base64-to-receipt \
  -H "Content-Type: application/json" \
  -d '{"text": "base64_encoded_image_string"}'

Sample Response

{
  "description": "Payment for services",
  "sender": "John Doe",
  "receiver": "ACME Corp",
  "value": "R$ 150,00"
}

Architecture

The service is built with the following components:

FastAPI: Web framework for building the API
Tesseract OCR: Optical character recognition for extracting text from images
Hugging Face Transformers: Language models for text processing and information extraction
PyTorch: Deep learning framework that powers the language models

Configuration

Configuration is managed through environment variables:

Variable	Description	Default
`MODEL_NAME`	Hugging Face model to use	google/gemma-3-1b-it
`DEVICE`	Device to run the model on (auto, cuda, cpu)	auto
`API_HOST`	Host for the API server	0.0.0.0
`API_PORT`	Port for the API server	8000
`HF_TOKEN`	Hugging Face API token	-

See .env.template for more configuration options.

Development

For more detailed development information, see Developer Guide.

Available Make Commands

The project includes a Makefile with useful commands:

make help     # Show available commands
make start    # Start the service
make stop     # Stop the service
make logs     # View logs
make build    # Build the Docker image
make clean    # Clean up Docker resources
make install  # Install Python dependencies
make test     # Run tests

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
docs		docs
src		src
.env.template		.env.template
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Receipt Processing Service

Overview

Example image

Features

Installation

Prerequisites

Using Docker (Recommended)

Local Development Setup

API Usage

Health Check

Process Receipt Image

Sample Response

Architecture

Configuration

Development

Available Make Commands

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

giomartinsdev/base64receipt-to-entity

Folders and files

Latest commit

History

Repository files navigation

Receipt Processing Service

Overview

Example image

Features

Installation

Prerequisites

Using Docker (Recommended)

Local Development Setup

API Usage

Health Check

Process Receipt Image

Sample Response

Architecture

Configuration

Development

Available Make Commands

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages