kingfisher/data/rules/huggingface.yml
Mick Grove 0f953f59a5 pattern_requirements for rules — Post-regex character-class gating to cut false positives without lookarounds. Authors can now require minimum counts of digits, uppercase, lowercase, and special characters, with an optional custom special-char set.
Why: Hyperscan doesn’t support lookaheads/behinds, so many “must contain X and Y” checks had to be baked into the regex (hurting readability) or were impossible. pattern_requirements applies lightweight, in-memory checks after a match is found, keeping patterns fast and clean.
2025-11-04 13:55:31 -05:00

40 lines
No EOL
1,021 B
YAML

rules:
- name: HuggingFace User Access Token
id: kingfisher.huggingface.1
pattern: |
(?xi)
(?:
(
(?:api_org|hf)_
(?:[0-9A-Z]{17}){2}
)
)
\b
pattern_requirements:
min_digits: 2
references:
- https://huggingface.co/docs/hub/security-tokens
min_entropy: 3.3
confidence: medium
examples:
- 'HF_TOKEN:"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"'
- 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
validation:
type: Http
content:
request:
headers:
Authorization: Bearer {{ TOKEN }}
Content-Type: application/json
method: GET
response_matcher:
- report_response: true
- status:
- 200
type: StatusMatch
- match_all_words: true
type: WordMatch
words:
- '"name":'
- '"id":'
url: https://huggingface.co/api/whoami-v2