FILE VAULT
Enterprise-grade document management system that automatically detects and redacts PII from PDF files using Google Cloud AI, ensuring zero persistence of sensitive data across storage and database layers. Built with defense-in-depth security architecture and comprehensive audit logging.
Project Overview
Security Architecture: Built with defense-in-depth security featuring 5-layer authorization, 7-layer data protection, and a 5-zone trust model ensuring comprehensive PII protection at every system boundary.
AI Integration: Leverages Google Cloud Document AI for OCR with bounding box coordinates, Cloud DLP for PII detection across 15+ types, and Vertex AI (Gemini 2.5 Pro) for intelligent field extraction from redacted documents.
Production Ready: Serverless architecture on Cloud Run with auto-scaling, sub-500ms API response times, comprehensive audit logging via Cloud Logging, and pay-per-use pricing optimized for cost efficiency.
Tech Stack
frontend:
backend:
google cloud:
infrastructure:
Key Features
AI-Powered PII Detection
Multi-stage pipeline using Google Cloud Document AI and DLP to detect 15+ PII types with coordinate-based redaction and self-validating verification to ensure zero PII leakage.
Zero-Persistence Architecture
5-layer authorization with 7-layer encryption including TLS 1.3, JWT signatures, AES-256-GCM encryption, and automatic original file deletion within 122 seconds.
Intelligent Field Extraction
Vertex AI (Gemini 2.5 Pro) extracts numerical tax fields from redacted PDFs with structured data storage ensuring zero PII in database schemas.
Business Value
Enterprise Security
Production-ready security architecture with zero PII persistence
AI-Powered
Automated PII detection and intelligent field extraction
Serverless Scale
Auto-scaling Cloud Run with sub-500ms response times
Complete Audit Trail
Comprehensive logging with structured Cloud Logging
Gallery






