Improve readability of the quick tour. #501

vxw3t8fhjsdkghvbdifuk · 2025-01-14T20:15:33Z

No description provided.

vxw3t8fhjsdkghvbdifuk · 2025-01-14T20:27:49Z

Actually, I'm not sure running 2 models at the same time is true. When I run

lighteval accelerate \
"pretrained=HuggingFaceTB/SmolLM2-135M-Instruct,pretrained=HuggingFaceTB/SmolLM2-360M-Instruct" \
"leaderboard|truthfulqa:mc|0|0"

it only evaluates the second model. It seems the code doc in

lighteval/src/lighteval/main_accelerate.py

Line 47 in a7aa6ed

    
           help="Model arguments in the form key1=value1,key2=value2,... or path to yaml config file (see examples/model_configs/transformers_model.yaml)"

is not correct?

clefourrier

In the doc you're referring to, you can see that the keys are not the same - you cannot evaluate 2 models at the same time, but you can specify a range of parameters to use for one model however

clefourrier · 2025-01-15T07:01:25Z

docs/source/quicktour.mdx


-Tasks details can be found in the
+All supported tasks can be found at the [tasks_list](available-tasks). For more details, you can have a look at the


We also support the tasks that are community provided in the extended folder

clefourrier · 2025-01-15T07:02:07Z

docs/source/quicktour.mdx

-the [tasks_list](available-tasks) in the format:
+Here, the first argument specifies which model(s) to run, and the second argument specifies how to evaluate them.
+
+Multiple models can be evaluated at the same time by using a comma-separated list. For example:


Nope, we can only evaluate one model at a time - however we can specifiy precision, peft weights, ...

clefourrier · 2025-01-15T07:03:03Z

docs/source/quicktour.mdx

+The task specification might be a bit hard to grasp as first. The format is as follows:
+
+```bash
+{suite}|{task}|{num_few_shot}|{0 or 1 to automatically reduce `num_few_shot` if prompt is too long}


automatically adapt the number of few shot examples presented to the model if the prompt is too long for the context size of the task or the model

(I would add this explanation on antoher line)

clefourrier · 2025-01-22T10:05:10Z

docs/source/quicktour.mdx

+The syntax for the task specification might be a bit hard to grasp at first. The format is as follows:
+
+```txt
+{suite}|{task}|{num_few_shot}|{0 for strict `num_few_shots`, or 1 to allow a reduction}


Suggested change

{suite}|{task}|{num_few_shot}|{0 for strict `num_few_shots`, or 1 to allow a reduction}

{suite}|{task}|{num_few_shot}|{0 for strict `num_few_shots`, or 1 to allow a truncation if context size is too small}

clefourrier · 2025-01-22T10:05:57Z

docs/source/quicktour.mdx

-Tasks details can be found in the
+All officially supported tasks can be found at the [tasks_list](available-tasks).
+Moreover, community-provided tasks can be found in the
+[extended folder](https://github.com/huggingface/lighteval/tree/main/src/lighteval/tasks/extended) and the


Extended are not community provided but maintainer provided, however they are tasks which require added logic (such as an LLM as judge, or redefine new metrics like IFEval)

clefourrier

One last nit then we're good!

Improve readability of the quick tour.

d5feb23

clefourrier reviewed Jan 15, 2025

View reviewed changes

vxw3t8fhjsdkghvbdifuk added 5 commits January 21, 2025 11:44

update based on feedback

b3d878c

delete superfluous edit of float16

6df1aad

deleted , for no reason

dfd2a40

Merge branch 'main' into patch-2

6045d9d

reorganize headers

02b3e30

clefourrier reviewed Jan 22, 2025

View reviewed changes

clefourrier approved these changes Jan 22, 2025

View reviewed changes

vxw3t8fhjsdkghvbdifuk added 2 commits January 23, 2025 09:27

fix nit

6590cdb

closing bracket

381beb2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve readability of the quick tour. #501

Improve readability of the quick tour. #501

vxw3t8fhjsdkghvbdifuk commented Jan 14, 2025

vxw3t8fhjsdkghvbdifuk commented Jan 14, 2025 •

edited

Loading

clefourrier left a comment

clefourrier Jan 15, 2025

clefourrier Jan 15, 2025

clefourrier Jan 15, 2025

clefourrier Jan 15, 2025

clefourrier Jan 22, 2025

clefourrier Jan 22, 2025

clefourrier left a comment


		Tasks details can be found in the
		All supported tasks can be found at the [tasks_list](available-tasks). For more details, you can have a look at the

	{suite}\|{task}\|{num_few_shot}\|{0 for strict `num_few_shots`, or 1 to allow a reduction}
	{suite}\|{task}\|{num_few_shot}\|{0 for strict `num_few_shots`, or 1 to allow a truncation if context size is too small}

Improve readability of the quick tour. #501

Are you sure you want to change the base?

Improve readability of the quick tour. #501

Conversation

vxw3t8fhjsdkghvbdifuk commented Jan 14, 2025

vxw3t8fhjsdkghvbdifuk commented Jan 14, 2025 • edited Loading

clefourrier left a comment

Choose a reason for hiding this comment

clefourrier Jan 15, 2025

Choose a reason for hiding this comment

clefourrier Jan 15, 2025

Choose a reason for hiding this comment

clefourrier Jan 15, 2025

Choose a reason for hiding this comment

clefourrier Jan 15, 2025

Choose a reason for hiding this comment

clefourrier Jan 22, 2025

Choose a reason for hiding this comment

clefourrier Jan 22, 2025

Choose a reason for hiding this comment

clefourrier left a comment

Choose a reason for hiding this comment

vxw3t8fhjsdkghvbdifuk commented Jan 14, 2025 •

edited

Loading