Discussion about this post

User's avatar
Pawel Jozefiak's avatar

The BDD + Playwright + Claude Code combo is solid. I've been using Playwright for browser automation in my agent workflows and the reusability of steps is a game-changer.

One thing I'd add: when your agent generates and runs tests autonomously, you need a way to see what passed and what broke without digging through terminal output. That visibility problem pushed me to build a proper dashboard for my agent's work: https://thoughts.jock.pl/p/wiz-1-5-ai-agent-dashboard-native-app-2026

The limitation you mention - Claude creating functionally equivalent but different steps - is real. My workaround: strict skill files that enforce naming conventions. Helps with deduplication across sessions.

1 more comment...

No posts

Ready for more?