FILE VAULT

Enterprise-grade document management system that automatically detects and redacts PII from PDF files using Google Cloud AI, ensuring zero persistence of sensitive data across storage and database layers. Built with defense-in-depth security architecture and comprehensive audit logging.

Project Overview

Security Architecture: Built with defense-in-depth security featuring 5-layer authorization, 7-layer data protection, and a 5-zone trust model ensuring comprehensive PII protection at every system boundary.

AI Integration: Leverages Google Cloud Document AI for OCR with bounding box coordinates, Cloud DLP for PII detection across 15+ types, and Vertex AI (Gemini 2.5 Pro) for intelligent field extraction from redacted documents.

Production Ready: Serverless architecture on Cloud Run with auto-scaling, sub-500ms API response times, comprehensive audit logging via Cloud Logging, and pay-per-use pricing optimized for cost efficiency.

Tech Stack

frontend:

Next.js 16.1React 19TypeScript 5Tailwind CSS 4NextAuth.js 5.0react-pdfRadix UI

backend:

FastAPI 0.115+Python 3.11Pydantic 2.10+SQLAlchemy 2.0+PyMuPDFpython-jose

google cloud:

Document AICloud DLP APIVertex AI (Gemini 2.5)Cloud StorageCloud SQL (PostgreSQL)Secret ManagerCloud LoggingCloud Run

infrastructure:

Upstash RedisVercelService Accounts (IAM)

Key Features

AI-Powered PII Detection

Multi-stage pipeline using Google Cloud Document AI and DLP to detect 15+ PII types with coordinate-based redaction and self-validating verification to ensure zero PII leakage.

Zero-Persistence Architecture

5-layer authorization with 7-layer encryption including TLS 1.3, JWT signatures, AES-256-GCM encryption, and automatic original file deletion within 122 seconds.

Intelligent Field Extraction

Vertex AI (Gemini 2.5 Pro) extracts numerical tax fields from redacted PDFs with structured data storage ensuring zero PII in database schemas.

Business Value

🔒

Enterprise Security

Production-ready security architecture with zero PII persistence

🤖

AI-Powered

Automated PII detection and intelligent field extraction

âš¡

Serverless Scale

Auto-scaling Cloud Run with sub-500ms response times

📊

Complete Audit Trail

Comprehensive logging with structured Cloud Logging

Gallery

File Vault - Upload Interface
File Vault - Document Processing
File Vault - PII Detection Dashboard
File Vault - Redacted Document View
File Vault - Field Extraction Results
File Vault - Security Settings
File Vault - Analytics Dashboard