Skip to content

Actions: huggingface/text-generation-inference

Server Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,463 workflow runs
2,463 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: improve qwen2-vl startup
Server Tests #3626: Pull request #2802 synchronize by drbh
January 14, 2025 22:15 4m 24s improve-qwen2-vl-warmup
January 14, 2025 22:15 4m 24s
CI for: Baichuan2-13B does not have max_position_embeddings in config
Server Tests #3625: Pull request #2905 opened by danieldk
January 14, 2025 13:21 4m 11s baichuan2-13b
January 14, 2025 13:21 4m 11s
Baichuan2-13B does not have max_position_embeddings in config
Server Tests #3624: Pull request #2903 synchronize by sywangyi
January 14, 2025 01:24 Action required sywangyi:baichuan2-13b
January 14, 2025 01:24 Action required
Baichuan2-13B does not have max_position_embeddings in config
Server Tests #3623: Pull request #2903 synchronize by sywangyi
January 14, 2025 00:58 Action required sywangyi:baichuan2-13b
January 14, 2025 00:58 Action required
feat: improve star coder to support multi lora layers
Server Tests #3622: Pull request #2883 synchronize by drbh
January 13, 2025 21:53 4m 12s startcoder-support-multi-lora
January 13, 2025 21:53 4m 12s
feat: improve qwen2-vl startup
Server Tests #3621: Pull request #2802 synchronize by drbh
January 13, 2025 18:50 4m 13s improve-qwen2-vl-warmup
January 13, 2025 18:50 4m 13s
flashinfer: switch to plan API
Server Tests #3620: Pull request #2904 opened by danieldk
January 13, 2025 10:06 4m 19s maintenance/flashinfer-plan
January 13, 2025 10:06 4m 19s
Baichuan2-13B does not have max_position_embeddings in config
Server Tests #3619: Pull request #2903 opened by sywangyi
January 13, 2025 06:51 Action required sywangyi:baichuan2-13b
January 13, 2025 06:51 Action required
CI for: fix crash in torch2.6 if TP=1
Server Tests #3618: Pull request #2898 opened by danieldk
January 10, 2025 15:01 7m 31s distributed-fix
January 10, 2025 15:01 7m 31s
CI for: chore: Update jsonschema to 0.28.0
Server Tests #3617: Pull request #2893 synchronize by danieldk
January 10, 2025 11:47 9m 0s update-jsonschema
January 10, 2025 11:47 9m 0s
Update to marlin-kernels 0.3.7
Server Tests #3616: Pull request #2882 synchronize by danieldk
January 9, 2025 15:29 8m 22s marlin-kernels-0.3.7
January 9, 2025 15:29 8m 22s
Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu
Server Tests #3615: Pull request #2815 synchronize by sywangyi
January 9, 2025 13:27 Action required sywangyi:flash_decoding
January 9, 2025 13:27 Action required
CI for: chore: Update jsonschema to 0.28.0
Server Tests #3613: Pull request #2893 opened by danieldk
January 9, 2025 09:16 4m 42s update-jsonschema
January 9, 2025 09:16 4m 42s
Basic flashinfer 0.2 support
Server Tests #3612: Pull request #2862 synchronize by danieldk
January 9, 2025 08:28 8m 9s flashinfer-0.2
January 9, 2025 08:28 8m 9s
Basic flashinfer 0.2 support
Server Tests #3611: Pull request #2862 synchronize by danieldk
January 8, 2025 14:50 8m 18s flashinfer-0.2
January 8, 2025 14:50 8m 18s
Improve vlm support (add idefics3 support)
Server Tests #3610: Pull request #2437 synchronize by drbh
January 8, 2025 13:49 6m 27s improve-vlm-support
January 8, 2025 13:49 6m 27s
Basic flashinfer 0.2 support
Server Tests #3609: Pull request #2862 synchronize by danieldk
January 8, 2025 10:06 8m 17s flashinfer-0.2
January 8, 2025 10:06 8m 17s
feat: improve qwen2-vl startup
Server Tests #3608: Pull request #2802 synchronize by drbh
January 7, 2025 22:36 6m 33s improve-qwen2-vl-warmup
January 7, 2025 22:36 6m 33s
Improve vlm support (add idefics3 support)
Server Tests #3607: Pull request #2437 synchronize by drbh
January 7, 2025 22:25 6m 28s improve-vlm-support
January 7, 2025 22:25 6m 28s
Improve vlm support (add idefics3 support)
Server Tests #3606: Pull request #2437 synchronize by drbh
January 7, 2025 22:19 5m 41s improve-vlm-support
January 7, 2025 22:19 5m 41s
Improve vlm support (add idefics3 support)
Server Tests #3605: Pull request #2437 synchronize by drbh
January 7, 2025 22:07 6m 21s improve-vlm-support
January 7, 2025 22:07 6m 21s
Improve vlm support (add idefics3 support)
Server Tests #3604: Pull request #2437 synchronize by drbh
January 7, 2025 22:05 1m 10s improve-vlm-support
January 7, 2025 22:05 1m 10s
Add fp8 kv cache for ROCm
Server Tests #3603: Pull request #2856 synchronize by mht-sharma
January 7, 2025 07:20 8m 17s fp8_kvcache_rocm
January 7, 2025 07:20 8m 17s
feat: improve star coder to support multi lora layers
Server Tests #3601: Pull request #2883 opened by drbh
January 7, 2025 00:23 8m 27s startcoder-support-multi-lora
January 7, 2025 00:23 8m 27s
Basic flashinfer 0.2 support
Server Tests #3600: Pull request #2862 synchronize by danieldk
January 6, 2025 16:08 7m 7s flashinfer-0.2
January 6, 2025 16:08 7m 7s