πŸ”“ Prompt Hacking🟒 Tindakan Defensif🟒 Penyaringan Kata / Frasa (Filtering)

Penyaringan Kata / Frasa (Filtering)

🟒 This article is rated easy
Reading Time: 1 minute
Last updated on August 7, 2024

Sander Schulhoff

Filtering adalah teknik umum untuk mencegah peretasan prompt1Kang, D., Li, X., Stoica, I., Guestrin, C., Zaharia, M., & Hashimoto, T. (2023). Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks. . Ada beberapa jenis filtrasi, tetapi ide dasarnya adalah memeriksa kata-kata dan frasa dalam prompt awal atau keluaran yang harus diblokir. Anda dapat menggunakan daftar blokir atau daftar izin untuk tujuan ini2Selvi, J. (2022). Exploring Prompt Injection Attacks. https://research.nccgroup.com/2022/12/05/exploring-prompt-injection-attacks/ . Blocklist adalah daftar kata-kata dan frasa yang harus diblokir, dan allowlist adalah daftar kata-kata dan frasa yang harus diizinkan.

Footnotes

  1. Kang, D., Li, X., Stoica, I., Guestrin, C., Zaharia, M., & Hashimoto, T. (2023). Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks. ↩

  2. Selvi, J. (2022). Exploring Prompt Injection Attacks. https://research.nccgroup.com/2022/12/05/exploring-prompt-injection-attacks/ ↩

Sander Schulhoff

Sander Schulhoff is the Founder of Learn Prompting and an ML Researcher at the University of Maryland. He created the first open-source Prompt Engineering guide, reaching 3M+ people and teaching them to use tools like ChatGPT. Sander also led a team behind Prompt Report, the most comprehensive study of prompting ever done, co-authored with researchers from the University of Maryland, OpenAI, Microsoft, Google, Princeton, Stanford, and other leading institutions. This 76-page survey analyzed 1,500+ academic papers and covered 200+ prompting techniques.

Edit this page

Β© 2025 Learn Prompting. All rights reserved.