Sports Specialist
1. Goal & Description:
We are currently working on building quality for cricket related queries on AI Mode.
AIM quality on cricket This requires “on the go” evaluations of the live game And also some evals around pre & post game; league, athlete and in general any cricket related queries - will help us understand the queries that fans have associated with a game, league, team & athlete!
The core objective here is for raters to simulate the experience of a dedicated cricket fan by not only identifying and evaluating queries in real-time but also engaging with AIM on pre, post game and also asking their cricket queries associated with the league, athlete etc in general
This process will include side-by-side comparisons across multiple platforms, with raters capturing screenshots of their findings and evaluating them on helpfulness and factuality
2. Data to be Evaluated:
We request that the raters primarily focus on generating: \"live\" queries as they actively watch, track, or report on the game(s)—specifically, the type of questions or prompts a dedicated sports fan would genuinely issue. submission of contextually relevant queries in these buckets:
Specific Mechanism and Request
1.Issue every query simultaneously on both AIM and ChatGPT
2.Track which version of ChatGPT they are using (screenshot)
3.Provide a single-sided rating for each response, evaluating both helpfulness and factuality, accompanied by detailed reasoning for each.
4.Deliver a side-by-side win/loss/neutral rating for each pair of responses.
5. Complete this process for the maximum feasible number of queries within the \"game window,\" which extends from a short time before the game begins to approximately 30–60 minutes after its conclusion.
No of queries: 100 per game
Language: English & Hindi