Is It Even Legal to Put Customer Data Into AI? India's DPDP Act for SMBs in 2026
India's DPDP Rules were notified in November 2025, with obligations biting by May 2027. There is no small-business exemption. Here is what the law actually requires when an AI agent touches customer data — and how to build automation that stays on the right side of it.
Short answer: Yes — an Indian SMB can legally put customer data into AI tools. India’s Digital Personal Data Protection (DPDP) Act, with its Rules notified on 13 November 2025, allows it, but on conditions: clear consent, a stated purpose, retention limits, “reasonable” security, and a plan to report a breach within 72 hours. The substantive obligations bite roughly 18 months after notification — around 13 May 2027 — so you have a runway. The catch: the decisions that make compliance cheap or painfully expensive are the architectural ones you make now, while you’re building the automation.
This is the data-privacy conversation we now have on most discovery calls. Here’s the plain-language version. (This is operational guidance, not legal advice — for your specific situation, talk to a lawyer.)
Is it legal for an Indian SMB to put customer data into AI tools?
Yes, with guardrails. The DPDP Act doesn’t ban sending personal data to an AI system; it governs how. If you collect a customer’s phone number, address, or order history and an AI agent processes it, you’re a “Data Fiduciary” and that processing needs a lawful basis — in practice, consent — plus the safeguards below. The law is principle-based, not a list of banned tools. An AI agent answering WhatsApp messages is fine; an AI agent quietly retaining every chat forever, with customer PII flowing to a foreign model under no agreement, is where you get exposed.
What is the DPDP Act, in plain language for a 20-person business?
It’s India’s first comprehensive law for digital personal data. Strip the legalese and it asks five things of anyone who handles customer data:
- Tell people what you’re collecting and why — an itemized, specific notice, not a buried paragraph.
- Get consent for that purpose — and use the data only for it (purpose limitation).
- Don’t keep it forever — define retention timelines and delete when the purpose is done.
- Keep it reasonably secure — access controls, encryption, the basics.
- Have a breach plan — notify the Data Protection Board and affected people, within 72 hours.
Children’s data (under 18) needs verifiable parental consent — relevant if you serve students, parents, or families.
Does it apply to me if I’m small?
Yes. This is the most common misconception we hear. There is no blanket small-business or startup exemption. Obligations scale with the data you process, not your headcount or turnover. A 10-person D2C brand with 50,000 customer records carries more real obligation than a 200-person firm that barely touches personal data. The government can notify certain classes as exempt or as “Significant Data Fiduciaries” with extra duties, but the baseline duties apply to SMBs from day one of enforcement.
What does “consent” actually mean when an AI agent handles a WhatsApp chat?
Consent under DPDP has to be free, specific, informed, and revocable — tied to a clear purpose. In an AI-automation context that translates to concrete design choices:
- When a customer first messages your WhatsApp agent, the notice (“we use your messages to answer queries and process orders; reply STOP to opt out”) should be surfaced, not hidden in a privacy policy nobody opens.
- If you later want to use those chat logs to train or fine-tune a model, that’s a different purpose — it needs its own basis, not a free ride on the original consent.
- “Revocable” means a customer can withdraw, and your systems must actually honor it — which means you need to know where their data lives.
Where do SMBs accidentally break this with AI?
Almost always one of four places — and all four are architecture problems, not paperwork problems:
- Raw PII straight to a foreign LLM with no agreement. Piping a customer’s full name, Aadhaar, or bank details to a model API with no data-processing terms and no idea where it’s stored.
- Infinite retention. Chat transcripts, call recordings, and extracted documents piling up forever “just in case.” Every record you keep past its purpose is liability with no upside.
- No breach plan. A 72-hour notification clock you can’t meet because nobody owns it and you can’t even tell what was exposed.
- Cross-border by default. Customer data leaving India through whatever region your tools default to, with no decision ever made about it.
How do you build AI automation that’s DPDP-ready?
The good news: privacy-by-design makes the automation cheaper to run, not just safer. The patterns we use:
- Redact before the model sees it. Mask sensitive fields (Aadhaar, PAN, account numbers) before they reach a cloud LLM, restore after. We use this same redaction pattern in document processing; it maps directly to DPDP’s data-minimisation principle.
- Self-host the model for sensitive workloads. For finance, healthcare, or anything with data-residency needs, run open-source models on infrastructure you control so personal data never leaves your environment.
- Build retention in, not on. Set a time-to-live on chat logs, recordings, and extracted data from day one. Auto-delete on schedule. “We’ll clean it up later” never happens.
- Capture consent at the entry point and store it with the record, so you can prove it and honor withdrawal.
- Keep an audit trail. Who accessed what, what was sent where. You can’t report a breach you can’t reconstruct.
What’s the minimum an SMB should do before May 2027?
You don’t need a compliance department. You need five things done once and maintained:
- Name a privacy owner. One person accountable — usually the founder or ops lead in an SMB.
- Map your data flows. What personal data you collect, where it sits, which tools (including AI vendors) touch it.
- List your sub-processors. Every LLM provider, every SaaS that sees customer data — and check each has acceptable data terms.
- Set retention limits and turn on deletion.
- Write a one-page breach runbook so the 72-hour clock is survivable.
Do these and you’re most of the way to compliant and running a cleaner operation. The runway to 2027 is for getting it right, not for ignoring it.
What’s the next step?
If you’re deploying AI that touches customer data — a WhatsApp agent, a support bot, document processing — the cheapest time to bake in DPDP-readiness is during the build, not as a retrofit after enforcement. On a 15-minute discovery call we’ll map which parts of your automation touch personal data and how to architect them so compliance is a property of the system, not a scramble. Browse our solutions for the privacy-aware patterns we ship. (Again: operational guidance, not legal advice — loop in a lawyer for your specifics.)
About Shera
Co-Founder & Operations at ClosedChats AI. Owns commercial conversations and ROI modeling. Translates "we want this automated" into a project plan that pencils out.