prompt-injection-detector 제작자: Vishesh Agarwal
Detects hidden prompt injection instructions that might manipulate AI models like Copilot and Claude.
사용자 1명사용자 1명
확장 메타 데이터
스크린샷
정보
AI assistants like GitHub Copilot, ChatGPT, and others read web page content when you ask them to help. Attackers can hide malicious instructions in that content — invisible to you, but visible to the AI — to hijack its behaviour, steal your data, or bypass safety filters.
PromptGuard detects:
- Hidden elements (
- HTML comments — invisible to humans but read by AI tools ingesting page source
- LLM-specific formats:
Three sensitivity levels:
- 🟢 Normal — high-confidence imperative overrides only (low false positives)
- 🟠 High — adds jailbreak, DAN, developer-mode, bypass patterns
- 🔴 Ultra — adds roleplay, persona, exfiltration, and LLM prompt-format patterns
Click any finding to flash and scroll to the exact element on the page.
All scanning runs locally in your browser. Nothing is sent anywhere.
PromptGuard detects:
- Hidden elements (
display:none, visibility:hidden, zero opacity, sub-pixel fonts, same-colour text)- HTML comments — invisible to humans but read by AI tools ingesting page source
- LLM-specific formats:
[INST], system:, assistant: prompt injection patternsThree sensitivity levels:
- 🟢 Normal — high-confidence imperative overrides only (low false positives)
- 🟠 High — adds jailbreak, DAN, developer-mode, bypass patterns
- 🔴 Ultra — adds roleplay, persona, exfiltration, and LLM prompt-format patterns
Click any finding to flash and scroll to the exact element on the page.
All scanning runs locally in your browser. Nothing is sent anywhere.
0명이 0점으로 평가함
권한 및 데이터
추가 정보
- 부가 기능 링크
- 버전
- 1.0.0
- 크기
- 20.48 KB
- 마지막 업데이트
- 2달 전 (2026년 4월 4일)
- 관련 카테고리
- 라이선스
- MIT 라이선스
- 개인정보처리방침
- 이 부가 기능에 대한 개인정보처리방침 읽기
- 버전 목록
- 모음집에 추가