Every time you put your input you are failing.
AI will tell you to do it yourself. Force AI to automate everything by giving every possible tool: APIs, UI automation, and even Human API.
Use easy feedback loops like curl to check your webapp, more complex like Playwright. Try to find the perfect spot between token costs/speed to create the feedback loop. There's always a feedback loop you can insert to let the AI self-check and understand.