Multi-app medium multi-003 · opus 4.6 · vision+xml
Open FreshCart and order salmon, spinach, and blueberries from Whole Foods. Calculate a 20% tip on the grocery total, then send a SplitPay request to Leo Chen for half the total (including tip) with the note 'dinner party groceries'. What's the grocery total and tip amount, and confirm the SplitPay request was sent.
50 steps 820s wall time 9 rubric criteria ✗ 40% score
Rubric · 9 criteria
40% · 4/9 satisfied
Click any criterion to see the judge's reasoning.
Previous
multi-002
Check my SkyTrip app for my upcoming SFO to JFK flight details — date, time, and terminal. Then open the Weather app and add New York to check the forecast for my travel dates. Create a note in Notes titled 'NYC Trip Prep' with the flight details and expected weather. What's the flight date, departure time, terminal, and weather summary?
Next
multi-005
Request a CityRideX from Home (410 Brannan St) to Work (201 Mission St) in the CityRide app. Then post a status update in TeamChat #support-ops that I'm running late, and check my Mail for any morning meeting invites. What's the CityRide ETA and are there any meeting invites?