EvalOps Unfiltered

Does the AI Act Ensure Trustworthiness?

Wed September 17, 2025 17:45-20:45
Merantix AI Campus, Max-Urich-Straße 3
🇩🇪 Berlin (Germany)
EvalOps Unfiltered image

Description

LLMs behave differently with the slightest prompt tweak, context change, or input variation. If you're building anything real with GenAI, you already know the outputs can surprise you: and not in a good way. That’s why testing isn’t optional: it’s essential.

EvalOps Unfiltered is a practical event series for GenAI teams tackling the real-world challenges of evaluating LLM applications. Focused on the emerging field of EvalOps, it goes beyond benchmarks to address unpredictable model behavior, adversarial risks, and production readiness.

Each session features live experiments, tool deep-dives, breakout discussions, and honest conversations about what truly works when deploying LLMs.

What to expect (on 17. Sept):

🔧 Lightning talks from three teams presenting their real testing challenges — the kind that don't show up in research papers

🧠 Breakout sessions where you'll dig deep into one challenge, discuss solutions, share experiences, and test ideas with fellow builders

🍺 Drinks while the conversations continue

No panels, no pitches — just builders sharing what's actually broken and collaborating on what might work. This isn't about theory. It's about the unglamorous, critical work of making Gen AI systems reliable enough for the real world.

Target Audience:
- Gen AI engineers wrestling with evaluation pre-release
- Technical leads managing LLM-powered products
- Data scientists designing and fine-tuning LLM-based applications
- Product owners responsible for delivering reliable AI-driven features

Categories

Distribution: in-person
Talk language: English
Ticket cost: Free access

Location

Address: Merantix AI Campus, Max-Urich-Straße 3
City: Berlin (Berlin)
Country: 🇩🇪 Germany (Europe)
Google Maps: view location

Social Media

Enter links

Website & Tickets

--
YARD 1ae787220f a8825d5fbfc34e0b0cddcd45a1b68c64