clefourrier

📘

probably reading

Clémentine Fourrier clefourrier

📘

probably reading

Researcher at 🤗

173 followers · 17 following

Achievements

x4 x3 x3

Achievements

x4 x3 x3

Highlights

Organizations

Stars

wentasah / mmul-anim

Visualization of cache-optimized matrix multiplication

Python 98 8 Updated Jun 13, 2019

JonathanChavezTamales / LLMStats

A comprehensive set of LLM benchmark scores and provider prices.

JavaScript 80 4 Updated Jan 2, 2025

huggingface / evaluation-guidebook

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 964 59 Updated Jan 7, 2025

GAIR-NLP / benbench

Benchmarking Benchmark Leakage in Large Language Models

JavaScript 47 3 Updated May 20, 2024

Aider-AI / aider

aider is AI pair programming in your terminal

Python 25,338 2,335 Updated Jan 22, 2025

hclent / CreoleVal

The central repo for Creole based NLU and NLG work

HTML 16 4 Updated May 28, 2024

posit-dev / great-tables

Make awesome display tables using Python.

Python 2,002 75 Updated Jan 22, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,372 2,644 Updated Dec 18, 2024

santisoler / cc-licenses

Creative Commons Licenses for Github

566 304 Updated Dec 10, 2024

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 35,421 2,672 Updated Jan 22, 2025

lechmazur / confabulations

Hallucinations (Confabulations) Document-Based Benchmark for RAG

HTML 65 2 Updated Jan 21, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 36,671 995 Updated Jan 22, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 12,487 767 Updated Jan 22, 2025

bigcode-project / bigcodebench

BigCodeBench: Benchmarking Code Generation Towards AGI

Python 276 30 Updated Jan 22, 2025

allenai / fm-cheatsheet

Website for hosting the Open Foundation Models Cheat Sheet.

JavaScript 263 19 Updated Jun 26, 2024

meg-tong / sycophancy-eval

datasets from the paper "Towards Understanding Sycophancy in Language Models"

Jupyter Notebook 67 7 Updated Oct 25, 2023

i-Eval / FairEval

Python 136 7 Updated Sep 10, 2023

openai / simple-evals

Python 2,209 188 Updated Jan 8, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 985 124 Updated Jan 22, 2025

OpenGenerativeAI / llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,368 165 Updated Dec 17, 2024

carlini / yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 957 70 Updated Nov 4, 2024

Arize-ai / LLMTest_NeedleInAHaystack

Forked from gkamradt/LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Python 99 1 Updated Apr 4, 2024

kdeldycke / awesome-falsehood

😱 Falsehoods Programmers Believe in

25,150 586 Updated Nov 6, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 43,732 4,662 Updated Jan 18, 2025

OpenInterpreter / open-interpreter

A natural language interface for computers

Python 57,917 4,967 Updated Jan 18, 2025

swj0419 / detect-pretrain-code-contamination

Python 74 8 Updated Dec 26, 2023

Weyaxi / scrape-open-llm-leaderboard

Scrape and export data from the Open LLM Leaderboard.

Python 42 3 Updated Dec 17, 2024

EdinburghNLP / awesome-hallucination-detection

List of papers on hallucination detection in LLMs.

745 60 Updated Dec 19, 2024

yokoffing / ChatGPT-Prompts

ChatGPT and Bing AI prompt curation

838 81 Updated Mar 6, 2024

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

HTML 118,453 15,999 Updated Jan 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clémentine Fourrier clefourrier

Achievements

Achievements

Highlights

Organizations

Block or report clefourrier

Stars

wentasah / mmul-anim

JonathanChavezTamales / LLMStats

huggingface / evaluation-guidebook

GAIR-NLP / benbench

Aider-AI / aider

hclent / CreoleVal

posit-dev / great-tables

openai / evals

santisoler / cc-licenses

gradio-app / gradio

lechmazur / confabulations

astral-sh / uv

stas00 / ml-engineering

bigcode-project / bigcodebench

allenai / fm-cheatsheet

meg-tong / sycophancy-eval

i-Eval / FairEval

openai / simple-evals

huggingface / lighteval

OpenGenerativeAI / llm-colosseum

carlini / yet-another-applied-llm-benchmark

Arize-ai / LLMTest_NeedleInAHaystack

kdeldycke / awesome-falsehood

mlabonne / llm-course

OpenInterpreter / open-interpreter

swj0419 / detect-pretrain-code-contamination

Weyaxi / scrape-open-llm-leaderboard

EdinburghNLP / awesome-hallucination-detection

yokoffing / ChatGPT-Prompts

f / awesome-chatgpt-prompts