Skip to content
View mst272's full-sized avatar
🙃
🙃

Block or report mst272

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • LLM-Dojo Public

    欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

    Python 462 40 Updated Jan 13, 2025
  • trl Public

    Forked from huggingface/trl

    Train transformer language models with reinforcement learning.

    Python Apache License 2.0 Updated Oct 11, 2024
  • mst272 Public

    Config files for my GitHub profile.

    Updated Jun 9, 2024
  • A pytorch Implementation of the Transformer: Attention Is All You Need

    Python 9 2 Updated Jun 7, 2024
  • 从头训练一个小llama

    Updated Mar 24, 2024
  • llama Public

    Forked from meta-llama/llama

    Inference code for Llama models

    Python Other Updated Mar 22, 2024
  • A simple implementation of LoRA+: Efficient Low Rank Adaptation of Large Models

    Python 6 1 Updated Mar 20, 2024
  • About Code release for "Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight), https://openreview.net/forum?id=LzQQ89U1qm_

    Python MIT License Updated Nov 3, 2023
  • 六爻游戏 + GPT(图一乐)

    Python Updated Oct 16, 2023
  • 🎈塞纳河畔,左岸的咖啡。告白气球,说出心里的小九九。https://ajlovechina.github.io/loveBalloon/.

    CSS Updated Oct 19, 2022