Alwin Arrasyid

alwint3r
·

AI & ML interests

LLM, GAN

Recent Activity

liked a model about 1 month ago
Qwen/Qwen2.5-7B-Instruct
View all activity

Organizations

alwint3r's activity

Reacted to vladbogo's post with 🤯 9 months ago
view post
Post
"LLM Agents can Autonomously Hack Websites" is a new paper that investigates the capacity of LLMs to autonomously execute cybersecurity attacks on websites, such as SQL injections without human guidance.

Key points:
* It uses a LLM integrated with Playwright, a headless web browser, enabling automated web interactions through function calling.
* It gives access to the LLM to 7 web hacking documents and planning capabilities through specific prompting, without disclosing the exact methods to prevent misuse.

GPT-4 achieves a 73.3% success rate on the tested vulnerabilities, emphasizing the potential cybersecurity risks posed by advanced LLMs. Other open models cannot yet perform these types of attacks (results in screenshot).

Congrats to the authors for their work!

Paper: LLM Agents can Autonomously Hack Websites (2402.06664)
  • 2 replies
·