Gabriel C's picture

Gabriel C

gabrielchua

·

https://gabrielchua.me

AI & ML interests

Large Language Models, AI Safety, Causal Inference

Recent Activity

updated a Space 21 days ago

govtech/rai-bench

updated a Space 22 days ago

govtech/lionguard-demo

updated a Space 22 days ago

govtech/lionguard-demo

View all activity

Organizations

authored 5 papers 5 months ago

Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security

Paper • 2507.19399 • Published Jul 25 • 1

LionGuard 2: Building Lightweight, Data-Efficient & Localised Multilingual Content Moderators

Paper • 2507.15339 • Published Jul 21

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation

Paper • 2507.11966 • Published Jul 16

Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications

Paper • 2507.09820 • Published Jul 13

RabakBench: Scaling Human Annotations to Construct Localized Multilingual Safety Benchmarks for Low-Resource Languages

Paper • 2507.05980 • Published Jul 8 • 1

authored a paper 9 months ago

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5

authored a paper about 1 year ago

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 22