Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,464 workflow runs
2,464 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Basic flashinfer 0.2 support
Server Tests #3600: Pull request #2862 synchronize by danieldk
January 6, 2025 16:08 7m 7s flashinfer-0.2
January 6, 2025 16:08 7m 7s
Update to marlin-kernels 0.3.7
Server Tests #3599: Pull request #2882 opened by danieldk
January 6, 2025 16:03 8m 26s marlin-kernels-0.3.7
January 6, 2025 16:03 8m 26s
Enable qwen2vl video
Server Tests #3598: Pull request #2756 synchronize by drbh
January 3, 2025 16:01 8m 35s enable-qwen2vl-video
January 3, 2025 16:01 8m 35s
Add fp8 kv cache for ROCm
Server Tests #3597: Pull request #2856 synchronize by mht-sharma
January 3, 2025 11:58 8m 30s fp8_kvcache_rocm
January 3, 2025 11:58 8m 30s
Enable qwen2vl video
Server Tests #3595: Pull request #2756 synchronize by drbh
December 23, 2024 18:47 9m 2s enable-qwen2vl-video
December 23, 2024 18:47 9m 2s
Improve vlm support (add idefics3 support)
Server Tests #3594: Pull request #2437 synchronize by drbh
December 23, 2024 14:40 8m 46s improve-vlm-support
December 23, 2024 14:40 8m 46s
Basic flashinfer 0.2 support
Server Tests #3593: Pull request #2862 synchronize by danieldk
December 22, 2024 13:04 6m 51s flashinfer-0.2
December 22, 2024 13:04 6m 51s
Basic flashinfer 0.2 support
Server Tests #3592: Pull request #2862 opened by danieldk
December 22, 2024 12:24 9m 2s flashinfer-0.2
December 22, 2024 12:24 9m 2s
Improve vlm support (add idefics3 support)
Server Tests #3591: Pull request #2437 synchronize by drbh
December 21, 2024 00:27 7m 42s improve-vlm-support
December 21, 2024 00:27 7m 42s
Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu
Server Tests #3590: Pull request #2815 synchronize by sywangyi
December 20, 2024 04:53 Action required sywangyi:flash_decoding
December 20, 2024 04:53 Action required
fix: include add_special_tokens in kserve request
Server Tests #3589: Pull request #2859 opened by drbh
December 19, 2024 21:54 8m 27s kserve-request-patch
December 19, 2024 21:54 8m 27s
Efficient Transformers backend support
Server Tests #3588: Pull request #2858 synchronize by Cyrilvallez
December 19, 2024 17:49 4m 16s Cyrilvallez:transformers-backend
December 19, 2024 17:49 4m 16s
Efficient Transformers backend support
Server Tests #3587: Pull request #2858 opened by Cyrilvallez
December 19, 2024 17:47 1m 56s Cyrilvallez:transformers-backend
December 19, 2024 17:47 1m 56s
Improve vlm support (add idefics3 support)
Server Tests #3585: Pull request #2437 synchronize by drbh
December 19, 2024 01:54 9m 9s improve-vlm-support
December 19, 2024 01:54 9m 9s
Improve vlm support (add idefics3 support)
Server Tests #3584: Pull request #2437 synchronize by drbh
December 18, 2024 14:58 8m 52s improve-vlm-support
December 18, 2024 14:58 8m 52s
Add fp8 kv cache for ROCm
Server Tests #3583: Pull request #2856 opened by mht-sharma
December 18, 2024 14:56 8m 44s fp8_kvcache_rocm
December 18, 2024 14:56 8m 44s
Add Flash decoding kernel ROCm
Server Tests #3582: Pull request #2855 opened by mht-sharma
December 18, 2024 12:50 8m 14s flash_decoding_rocm
December 18, 2024 12:50 8m 14s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Server Tests #3581: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:15 7m 0s rocm-fp8-tensorwise
December 18, 2024 12:15 7m 0s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Server Tests #3580: Pull request #2825 synchronize by mht-sharma
December 18, 2024 12:05 7m 36s rocm-fp8-tensorwise
December 18, 2024 12:05 7m 36s
Enable FP8 Per-Tensor Scales and Integrate Marlin/MoE Kernels Repo for ROCm
Server Tests #3579: Pull request #2825 synchronize by mht-sharma
December 18, 2024 10:50 8m 39s rocm-fp8-tensorwise
December 18, 2024 10:50 8m 39s
Improve vlm support (add idefics3 support)
Server Tests #3577: Pull request #2437 synchronize by drbh
December 18, 2024 05:06 8m 39s improve-vlm-support
December 18, 2024 05:06 8m 39s
Improve vlm support (add idefics3 support)
Server Tests #3576: Pull request #2437 synchronize by drbh
December 18, 2024 03:25 9m 10s improve-vlm-support
December 18, 2024 03:25 9m 10s
Enable qwen2vl video
Server Tests #3575: Pull request #2756 synchronize by drbh
December 18, 2024 01:41 7m 7s enable-qwen2vl-video
December 18, 2024 01:41 7m 7s
Improve vlm support (add idefics3 support)
Server Tests #3574: Pull request #2437 synchronize by drbh
December 18, 2024 01:36 13m 53s improve-vlm-support
December 18, 2024 01:36 13m 53s