IndicStack
Platform
ConsultingModelsAgentsComplianceBlog
Request Early Access
IndicStack

Product

  • Platform
  • Models
  • Agents
  • Consulting

Solutions

  • IT Services & Agencies
  • SaaS Startups
  • Regulated Industries
  • WhatsApp Automation

Company

  • About
  • Compliance
  • Blog

Legal

  • Privacy Policy
  • Terms of Service

IndicStack Consultancy Services LLP — Built for Indian AI builders.

30 May 2026compliancearchitecturebfsihealthcare

Building DPDP-Ready AI Architectures for BFSI and Healthcare

A technical guide for CTOs on designing India-hosted AI systems that satisfy DPDP Act 2023, RBI data localization, IRDAI outsourcing norms, and ABDM health data frameworks.

Why This Matters Now

The DPDP Act 2023 enforcement reaches full effect in May 2027. But for BFSI and healthcare, the timeline is effectively now - RBI, IRDAI, and ABDM already impose data governance expectations that shape how you can deploy AI. If you're building LLM-powered features for banking, insurance, or health applications, your architecture decisions today determine whether you pass compliance review tomorrow.

This guide maps DPDP and sectoral regulations into concrete architecture patterns for AI inference, RAG, and agents.


The Regulatory Stack

BFSI and healthcare AI systems sit under multiple overlapping frameworks:

FrameworkScopeKey Requirement for AI
DPDP Act 2023All digital personal dataConsent, purpose limitation, retention controls, breach notification (72hr), security safeguards
RBI Data Localization (2018)Payment system dataAll payment data stored exclusively in India; foreign copies deleted within 24 hours
RBI Outsourcing GuidelinesAll outsourced IT servicesRE remains accountable; vendor audit rights; data ownership retained
RBI FREE-AI FrameworkAI in financial servicesBoard-approved AI policies, model governance, fairness, explainability
IRDAI Outsourcing Regs (2017)Insurer IT outsourcingDue diligence, data security, audit access, India residency expectations
ABDM Health Data FrameworkHealth records under ABDMConsent artefacts, purpose-bound sharing, structured retention
DPDP SDF ObligationsHigh-volume/sensitive processorsDPO appointment, DPIAs, independent audits, algorithmic fairness

Key Penalties

  • Failure to implement reasonable security safeguards: up to 250 crore
  • Failure to report a breach: up to 250 crore
  • General Data Fiduciary obligation breaches: up to 200 crore

These are statutory maxima. The Data Protection Board considers severity, duration, and mitigation efforts.


What "DPDP-Ready" Means for AI Systems

DPDP is technology-neutral - it doesn't define "AI" or "LLMs" separately. But its obligations apply directly to AI workloads:

If a prompt contains personal data, DPDP applies to that processing. This includes customer names, account numbers, health details, or any text linkable to an individual.

Core requirements for AI systems:

  1. Lawful basis - Consent or legitimate purpose covering AI processing specifically
  2. Data minimization - Only necessary fields in prompts; redact where possible
  3. Purpose limitation - No reuse of data for model training without separate consent
  4. Storage limitation - Inference logs subject to retention policies and auto-deletion
  5. Security safeguards - Encryption, access control, breach detection for model endpoints
  6. Data Principal rights - Ability to locate and delete personal data in AI logs on request
  7. Accountability - DPAs with AI vendors; vendor treated as Data Processor

Reference Architecture

A DPDP-ready AI architecture for regulated verticals has four layers:

User Applications
Banking app, health portal, claims system
API calls containing prompts/documents
AI Governance & Gateway Layer
  • PII detection and classification
  • Consent & purpose validation
  • Redaction / pseudonymization
  • Policy-based routing
  • Request-level structured logging
Cleaned requests with metadata
India-Hosted AI Infrastructure
  • LLM inference (India DCs only)
  • Vector DB for RAG (India DCs only)
  • No-training-by-default guarantee
  • Configurable retention per route
Responses
Post-Processing & Output Controls
  • Output PII scanning
  • Human-in-the-loop for high-risk decisions
  • Evidence packet generation
Audit, Monitoring & DPDP Ops
  • Centralized logs (India-resident)
  • Data Principal rights workflows
  • DPIA support and SDF reporting
  • Automated retention enforcement

Every AI call passes through a governance layer that enforces DPDP and sector policies at the request level. The AI infrastructure itself is India-resident. The logging layer supports audit and rights requests.


Handling PII in Prompts

Prompts are the primary vehicle for personal data entering LLMs. Two strategies:

Strategy 1: Redact Before Inference

Replace direct identifiers with tokens before sending to the LLM:

  • Customer name → [CUSTOMER_001]
  • Account number → [ACCOUNT_001]
  • PAN → [REDACTED_PAN]

Keep a mapping table under strict access control. Works well for summarization, classification, and analysis tasks where the AI doesn't need to return specific identifiers.

Limitation: Pseudonymized data remains "personal data" under DPDP if re-linkable. Obligations still apply - but exposure is reduced.

Strategy 2: Restrict Inference Location

When redaction breaks the task (credit decisions, clinical support tied to a specific patient), process full prompts but only on India-hosted infrastructure with:

  • Contractual no-training guarantee (DPA)
  • Configurable retention (as short as zero)
  • Structured logging for audit trail
  • Access controls matching your internal data classification

For BFSI prompts containing payment data, RBI's localization circular makes this the only compliant option regardless of DPDP.


Embeddings and RAG: Yes, Residency Applies

Vector embeddings derived from personal data are personal data under DPDP. If the embedding can be linked back to an individual (even with auxiliary data), it's in scope.

This means:

  • Vector databases containing customer embeddings should be India-hosted
  • Cross-border transfer of embeddings engages DPDP's cross-border rules
  • Purpose limitation applies - embeddings created for search shouldn't be repurposed for profiling without fresh consent
  • Retention policies must extend to vector stores, not just primary databases

Architecture pattern: co-locate your vector DB with your LLM infrastructure in Indian data centers. Use the same DPA and retention controls.


Retention Policies by Sector

DPDP mandates storage limitation but doesn't hard-code periods. Sector norms provide guidance:

BFSI

Data TypeSuggested RetentionRationale
Raw inference logs (full prompts)30-180 daysDebugging, dispute resolution
Decision records (credit/risk)7 yearsFinancial record norms, audit
Aggregated metricsIndefiniteNo personal data
Embeddings from customer docsPurpose-dependent; delete when product relationship endsDPDP storage limitation

Healthcare

Data TypeSuggested RetentionRationale
Conversational logs (symptom checker)30-90 daysShort purpose; auto-delete
AI-assisted clinical decisionsLinked to medical record retentionMedico-legal requirements
Research embeddings (anonymized)Per research protocolMust be genuinely anonymized

Implement automated deletion with audit trails proving deletion occurred.


What Enterprise Procurement Asks

When BFSI and healthcare companies evaluate AI vendors, their compliance teams typically require:

  1. Data Processing Agreement (DPA) - Covering processor obligations, no-training clause, retention, breach notification
  2. Data flow diagram - Showing exactly where prompts travel, where they're stored, and for how long
  3. Security questionnaire - Encryption standards, access controls, incident response, SOC2 or equivalent
  4. Data residency confirmation - Written guarantee of India-only processing and storage
  5. Audit rights - Contractual right to inspect vendor's AI infrastructure
  6. Subprocessor list - Who else touches the data (GPU cloud provider, monitoring tools)
  7. Retention configuration - Proof that logs can be purged on schedule

If your AI vendor can't provide these on request, you have a procurement problem - not a vendor.


RBI's FREE-AI Framework

RBI's Framework for Responsible and Ethical Enablement of AI sets expectations for banks adopting AI:

  • Board-approved AI policies with clear governance
  • Model lifecycle management - versioning, monitoring, retirement
  • Fairness and bias assessment for credit and risk models
  • Explainability - ability to explain AI decisions to customers and regulators
  • Red-teaming - adversarial testing of AI systems
  • Incident reporting - AI-specific breach and failure reporting

While not yet a binding Master Direction, FREE-AI signals RBI's expectations and will likely shape future supervisory reviews. Design for it now.


Practical Checklist

For CTOs evaluating or building AI infrastructure for regulated workloads:

  • All inference processing runs in Indian data centers
  • AI vendor has signed DPA with no-training guarantee
  • Retention is configurable per use case and auto-enforced
  • Prompts containing PII are either redacted or processed India-only
  • Vector databases are India-hosted with same controls as LLMs
  • Structured audit logs capture: who, what data, which model, when, for what purpose
  • Data Principal rights workflows can locate and delete AI-related personal data
  • Human-in-the-loop exists for high-stakes decisions (credit, clinical)
  • Board/management has approved AI policy covering these systems
  • DPIA conducted for AI systems processing sensitive data at scale

Summary

DPDP-ready AI architecture for BFSI and healthcare isn't about checking a compliance box - it's about designing systems where data residency, retention, consent, and auditability are built into the infrastructure layer, not bolted on after the fact.

The regulatory direction is clear: Indian regulators expect financial and health data to stay in India, AI decisions to be explainable and auditable, and vendors to be contractually bound. Building on India-hosted AI infrastructure with proper controls isn't just the compliant choice - it's the one that lets you ship faster, because you won't be blocked by procurement.