Privacy Code Scanner

HIPAA Compliance Code Scanning

Trace every PHI flow directly in your application's source code to satisfy HIPAA 164.308(a)(1)(ii)(B) risk analysis, 164.312(b) audit controls, and 164.314(a)(2) business associate oversight, before code is ever pushed to production.

Start Free on GitHub Book a Live Demo

HoundDog.ai dataflow trace of a Medical History PHI element propagating through string transformations into a LangChain SystemMessage and then sent to OpenAI, with line-level code evidence and a left-to-right visualization showing Medical History flowing through Patient_Management.py into OpenAI

The Problem

Why HIPAA Compliance Breaks at the Code Level

PHI exposures to risky data sinks like logs, third-party services, and AI integrations are rarely intentional. They happen as codebases grow. A developer prints a full user object, a tainted variable carries PHI through a chain of transformations, and by the time anyone notices, the data has already been logged or sent to a third party. HoundDog.ai traces every flow into every SDK, API, and AI integration directly in code, so Business Associate Agreement violations are caught at scan time, before any data leaves your application.

164.308(a)(1)(ii)(B)

Manual Risk Analysis Cannot Keep Up

HIPAA requires periodic risk analysis on how PHI is accessed, shared, and stored, but interview-driven assessments lag behind every release
Unpushed code changes routinely add new PHI sinks that the last risk analysis never accounted for
By the time the assessment is signed, the application has already moved on

A snapshot of a system that no longer exists.

164.312(b)

Audit Controls Without Code Visibility

Current manual workflows and reactive tools infer data flows only after the data has already been flowing, so audit controls cannot tell you where PHI actually lives
PHI ends up in logs, temp files, and ingestion systems because of routine developer oversight, not malice
Every unnecessary storage location grows the blast radius of a single breach

No code-level view, no real audit trail.

164.314(a)(2)

BAAs Cannot Police Themselves

A signed Business Associate Agreement does not stop a developer from shipping a new PHI field into a third-party SDK
Vendors are added directly in code, often months before privacy or legal teams find out
Oversharing past the BAA's permitted scope is what regulators and auditors actually catch

Contract in place, controls missing.

The result

Stale Risk Documentation

Risk analyses drift the moment a new release ships.

PHI Sprawl in Logs and AI

Medical history and identifiers reach OpenAI, Sentry, and Datadog without anyone signing off.

Penalties Per Violation

HIPAA fines are calculated per record and per category, multiplying fast.

How It Works

How Code-Driven HIPAA Compliance Works

HoundDog.ai operates inside the development pipeline, tracing how PHI actually flows as code is written and changed. Scans run locally. Your code never leaves your machine, and HoundDog.ai never touches production PHI.

Scan Code as It Is Written

Integrates with IDE plugins for VS Code, IntelliJ, and Cursor, and with CI pipelines. Analyzes source code to map PHI flows across logs, storage, APIs, third-party integrations, and AI services, including hidden or "shadow" integrations.

The taint-flow static analysis detects PHI elements by variable, method, function, and field name, traces them through intermediate transformations across files, functions, and procedures regardless of nesting depth, and flags them when they reach a sink, whether it is a controlled sink like a database or a high-risk one like an LLM prompt.

164.308(a)(1)(ii)(B) This continuous code-level scan replaces point-in-time risk analyses with one that updates on every commit.

HoundDog.ai finding for Auth Token data sent to Standard Output, with severity Critical, the GitHub code segment for console.log of apiKey and apiSecret, compliance framework tags including GDPR-A32 and NIST800-53, and a left-to-right visualization of the data flow into Standard Output

Trace PHI Flows by Data Sink

Automated data flow mapping shows exactly which PHI and PII elements reach each data sink per repository, from logs and AI services like OpenAI to third-party integrations like Salesforce and Slack, with every flow rated safe or risky.

More than 100 sensitive data types supported, including PHI per HIPAA's definition (Medical History, Medical Record Numbers, diagnosis codes, prescription data, biometric and genetic identifiers), traditional PII, CHD per PCI's definition, and auth tokens and secrets.
More than 1,000 integrations supported, including direct and indirect AI SDKs such as OpenAI, Anthropic, LangChain, and LlamaIndex, and third-party integrations spanning monitoring, SIEM, CRM, payment, and many other categories.

164.312(b) Every PHI sink becomes a known audit point with line-level code evidence, not a guess.

HoundDog.ai Datamap view grouped by data sink, showing Medical Record Number tagged PHI and Risky reaching Local Storage, Medical History tagged PHI and Risky reaching OpenAI, Auth Token in JSON Web Token risky, and PII data flowing into AWS SES and Logs across multiple repositories

Suggest RoPA and BAA Updates from Real Code

New PHI flows and subprocessors become suggested edits in your Org RoPA, each traceable to the code that generated it, with the privacy team reviewing and approving every change. The same evidence feeds your BAA inventory so you always know which vendors actually receive PHI, not just which ones were named in last quarter's review.

164.314(a)(2) Subprocessor changes surface as soon as a developer wires up a new SDK, weeks before they reach production.

HoundDog.ai suggested edit to the Names of Subprocessors and DPA Status column in the Org RoPA, where a green Suggestion adds Amplitude with DPA established and LangChain with DPA status unknown to the existing list of Stripe, Adyen, Sentry, Datadog, and OpenAI subprocessors, with approve, reject, and edit controls

Enforce Before Deployment

Bake your HIPAA and BAA policies into the pipeline by customizing the types of data allowed per data sink, then block unsafe PHI flows when they are introduced in pull requests as part of your CI pipeline. Default allowlists ship out of the box, incorporating the standard data types each vendor typically covers under a BAA.

Unapproved PHI sharing is addressed while context is fresh and remediation costs are low. Preventive enforcement turns compliance from a paperwork exercise into an operational control.

HoundDog.ai data sink rule for Stripe with trust mode set to risky and a customizable safe data elements allowlist showing Auth Token, Full Name, Email, Bank Card Number, Address, First Name, Last Name, Bank Account Number, and Phone Number selected, with the dropdown open showing additional searchable data elements

HIPAA Rules in Code

How HoundDog.ai Maps to the Three Rules That Trip Up Engineering

Risk analysis, audit controls, and business associate oversight. Three Security Rule provisions where the textbook answer breaks once real code ships. Here is what changes when evidence comes from the codebase.

164.308(a)(1)(ii)(B)

Proactive Data Flow Mapping & PHI Inventory

Accurate risk assessment of every ePHI flow across the system.

Today's Challenges

Production telemetry and network monitoring only infer where PHI lives, after the fact
Unpushed code changes alter flows the last risk analysis never saw
Compliance reacts to changes long after engineers ship them

With HoundDog.ai

Source code is the single source of truth for every PHI flow
The Datamap inventories every storage, log, and third-party sink that receives PHI
Risk analysis updates on every commit, not every quarter

164.312(b)

Proactive Data Minimization & Audit Trail

Mechanisms to record and examine activity in systems that handle ePHI.

Today's Challenges

PHI lands in logs and temp files with no business justification
Mapping PHI flows manually does not scale to modern release cadences
Audit trails fragment across every new integration and storage location

With HoundDog.ai

PHI in risky sinks is flagged before code is pushed to production
PHI storage footprint shrinks to only the locations policy approves
Taint-flow analysis traces every PHI element across every code path and transformation, no matter how deep the nesting

164.314(a)(2)

Prevention of Third-Party Oversharing

Business associate contracts and satisfactory assurances PHI is safeguarded.

Today's Challenges

BAAs cover the contract, not the actual fields a service receives
SDKs and AI vendors get added in code without privacy or legal review
Oversharing past the BAA's scope is what regulators catch

With HoundDog.ai

Every PHI element flowing to each business associate is detected in code
Per-vendor allowlists fail the pull request when an unapproved PHI element is added
New SDKs and AI services surface as suggested edits before they ship

Customer Trust

Build Customer Trust Through Transparent PHI Handling

164.308(a)(1)(ii)(B) Generate evidence-based data maps that show where PHI is collected, processed, and shared, including through AI and third-party integrations.
164.312(b) Maintain a continuous, code-level audit trail of every PHI element that enters logs, storage, or downstream services.
164.314(a)(2) Keep your Org RoPA and BAA inventory current with new PHI flows and subprocessors surfaced as suggested edits, with the privacy team reviewing and approving every change.
Auto-generate audit-ready Privacy Impact Assessments pre-populated with detected PHI flows and privacy risks, aligned with HIPAA, GDPR, and other frameworks.
Give privacy teams continuous visibility into PHI processing activities without surveys or manual discovery.
No production monitoring required. No retroactive cleanup. No guessing.

HoundDog.ai Privacy Code Scanner showing a Dataflows view that flags a Critical Medical History exposure to OpenAI from llm.invoke and a Medium Phone Number exposure to Sentry, with a table mapping detected PHI elements like Medical Record Number, Medical Condition, Sexual Orientation, and Medication to their data sinks and PHI or PII tags. Works in IDE and CI. No production access required.

Key Differentiators

What Makes HoundDog.ai Different

Purpose built for healthcare and AI-driven engineering teams that need HIPAA evidence grounded in real PHI flows detected directly from source code, not surveys or assumptions.

HoundDog.ai Datamap by data sink for Acme Corp, showing Auth Token tagged Secret and Risky in Logs, Bank Card Number tagged PIFI and Risky in Logs, Passport Number in Logs, Email and First Name flowing to Slack tagged Risky, Auth Token tagged Safe in OpenAI, and PHI and PII flows mapped across acmecorp repositories

Code-Level PHI Flow Intelligence

Detect and map PHI flows directly from source code across APIs, services, and third-party integrations without relying on surveys, spreadsheets, or production tools that miss hidden integrations and SDKs.

HoundDog.ai tracing Medical History PHI through patient_context into a LangChain SystemMessage and an llm.invoke call sent to OpenAI, with each step linked to the exact GitHub source line

Built for AI & LLM Workloads

Discover AI SDKs embedded in code and detect PHI flowing into LLM prompts and external AI APIs before your apps go live, even when the data passes through helper functions and string templates.

HoundDog.ai privacy analysis of a Medical Record Number flowing alongside patient first and last name to Salesforce via the jsforce library, flagged as a critical risk with HIPAA and GDPR rationale, severity reasoning, and explicit remediation guidance

Prevent BAA Violations Before Deployment

Catch PHI oversharing during development and code review, not after an MRN has already been written to a non-healthcare CRM. Each finding ships with HIPAA rationale and remediation guidance.

HoundDog.ai Org RoPA review awaiting approval with suggested edits to categories of personal data and subprocessor lists, including DPA establishment status for Stripe, Adyen, Sentry, Datadog, OpenAI, Amplitude, and LangChain

Audit-Ready Evidence Trail

Every PHI flow links to a specific line of source code, with HIPAA rule mapping and severity rationale. Your Org RoPA and PIA documentation stay current with suggested edits the privacy team can approve in one click.

FAQ

Frequently Asked Questions

How does HoundDog.ai help with HIPAA 164.308(a)(1)(ii)(B) risk analysis?

HoundDog.ai performs static analysis on source code to map every PHI data flow across storage, logs, third-party integrations, and AI services. This produces a continuously updated risk surface that satisfies the periodic risk analysis requirement of HIPAA 164.308(a)(1)(ii)(B) using actual processing behavior, not interview-based estimates.

How does this support HIPAA 164.312(b) audit controls?

HIPAA 164.312(b) requires audit controls over PHI. HoundDog.ai builds a code-level inventory of every place PHI is read, written, and forwarded, and flags PHI reaching unintended sinks like logs or files before code ships. The result is a continuous, evidence-backed trail that maps to audit control requirements and shrinks the storage footprint of PHI.

How does this help with HIPAA 164.314(a)(2) business associate agreements?

HIPAA 164.314(a)(2) requires that PHI sharing with business associates is limited to what is permitted by the BAA. HoundDog.ai detects every third-party SDK and API call in code, traces which PHI elements flow into each one, and lets you define a per-vendor allowlist that fails the pull request when an unapproved PHI element is added.

Does HoundDog.ai need access to production or to actual PHI?

No. HoundDog.ai runs entirely on source code in your development environment or CI pipeline. It never needs production access, never sees actual PHI, and never sends source code off your machine.

Does HoundDog.ai detect PHI flowing into LLMs like OpenAI or Anthropic?

Yes. HoundDog.ai recognizes direct and indirect AI SDKs including OpenAI, Anthropic, LangChain, and LlamaIndex, and traces which PHI fields end up inside prompts or model inputs before the call ever executes in production.

Make HIPAA Compliance a Property of Your Code

Trace every PHI flow before it ships, shrink the storage footprint of patient data, and keep your BAA inventory honest with evidence pulled directly from source.

Start Free on GitHub Book a Live Demo