Gmelli
Created page with "An LLM Application Evaluation Task is an AI application evaluation for LLM-based applications. * <B>Context:</B> ** It can (typically) involve evaluating an LLM's ability to perform specific tasks, such as text classification or sequence generation, against predefined datasets. ** It can (typically) use different types of evaluators, such as correctness or summary evaluators, to score the LLM's output against expected results. ** It can (often) rely on a pre-..."