Skip to content
View ykim362's full-sized avatar

Organizations

@microsoft @marian-nmt

Block or report ykim362

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 6,449 773 Updated Feb 1, 2025

GRadient-INformed MoE

261 18 Updated Sep 25, 2024

State-of-the-art LLM-based translation models.

Ruby 480 38 Updated Jan 24, 2025
Ruby 84 8 Updated Jun 12, 2023
Python 28 3 Updated May 20, 2022

FastFormers - highly efficient transformer models for NLU

Python 703 53 Updated Jan 14, 2024

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Python 253 30 Updated Nov 2, 2022

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Python 675 52 Updated Sep 13, 2023

Transformer based on a variant of attention that is linear complexity in respect to sequence length

Python 731 68 Updated May 5, 2024

My take on a practical implementation of Linformer for Pytorch.

Python 411 37 Updated Jul 27, 2022

Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"

Python 109 20 Updated Nov 9, 2020

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Python 6 3 Updated Jun 22, 2020

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,455 3,021 Updated Feb 3, 2025

int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991

C++ 67 22 Updated Dec 30, 2023

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 2 4 Updated Jun 6, 2023

Fast Neural Machine Translation in C++ - development repository

C++ 258 127 Updated Oct 18, 2024

Fast Neural Machine Translation in C++

C++ 1,275 234 Updated Aug 25, 2023

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,249 529 Updated Feb 2, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,872 6,448 Updated Jan 9, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 86,431 23,267 Updated Feb 3, 2025

Create and modify Word documents with Python

Python 4,771 1,149 Updated Aug 20, 2024

Models and examples built with TensorFlow

Python 2,903 1,161 Updated Dec 14, 2019

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,703 1,022 Updated Feb 1, 2025

Oxford Deep NLP 2017 course

15,698 3,572 Updated Jul 2, 2023

Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

Python 1,213 323 Updated Oct 24, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,788 6,777 Updated Oct 25, 2023

Implementing Recurrent Neural Network from Scratch

Python 483 151 Updated May 28, 2018
Showing results