-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Submit streaming code and examples #43
Conversation
# Conflicts: # sys/build.rs
@thewh1teagle Didn't see your comment |
/// | ||
/// * `samples` - 样本数据 | ||
/// * `sample_rate` - 采样率 | ||
fn read_wave(filename: &str) -> (Vec<f32>, i32) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use internal function for that
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use internal function for that
This is only for testing, and the code logic is not suitable for the framework
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's useful to keep the example short and simple as possible with only the relevant info to dive in quickly. That's why it's useful to use the internal function like in other examples and even i don't use clap in most of the examples when not necessarily
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This place is just to do safe packaging, not to do higher level packaging, and higher level to use those that were made before. The structs you made before could be structs that were wrapped with this safe.
//! | ||
//! [sherpa-onnx]: https://github.com/k2-fsa/sherpa-onnx | ||
//! [onnxruntime]: https://github.com/microsoft/onnxruntime | ||
//! [Next-gen Kaldi]: https://github.com/k2-fsa/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove all comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok
/// | ||
/// please refer to | ||
/// https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-paraformer/index.html | ||
/// to download pre-trained models |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove most of the comments in the files. you can add link to specific sherpa docs on top of the file
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove most of the comments in the files. you can add link to specific sherpa docs on top of the file
Consider moving it to the entire crate document
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I want to avoid maintain documentation as possible. We'll handle that in another pr/issue
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok
/// Configuration for online/streaming models | ||
/// | ||
/// Please refer to | ||
/// https://k2-fsa.github.io/sherpa/onnx/pretrained_models/online-transducer/index.html |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok
pub(crate) _marker: PhantomData<T>, | ||
} | ||
|
||
pub trait State {} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why use trait?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why use trait?
Check out the latest code.
Simplify the online_stream parameters in online_recognizer.
@ghjdegithub |
I saw it. |
let (samples, sample_rate) = read_wave(&args.wave_file); | ||
|
||
println!("Initializing recognizer (may take several seconds)"); | ||
let config = OnlineRecognizerConfig { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Simplify configs with impl default like in other examples
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rust's default means that there can be a default case, but this configuration obviously doesn't. The best practice should be to provide a series of new methods, such as new_ transducer, new_paraformer, etc. default, there really isn't a particular default standard.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree but even using new methods for the structus will be too much expressive for irrelevant fields. so maybe it's worth using default and the examples will show how to use it correctly + sherpa-onnx will throw error with info
/// Configuration for online/streaming transducer models | ||
/// | ||
/// Please refer to | ||
/// https://k2-fsa.github.io/sherpa/onnx/pretrained_models/online-transducer/index.html |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Link to models should placed in examples. Keep the source code clean
/// Configuration for online/streaming paraformer models | ||
/// | ||
/// Please refer to | ||
/// https://k2-fsa.github.io/sherpa/onnx/pretrained_models/online-paraformer/index.html |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove most of the comments keep tiny reference maybe just a link
|
||
impl OfflineRecognizer { | ||
/// Frees the internal pointer of the recognition to avoid memory leak. | ||
fn delete(&mut self) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove. You can free it directly in impl drop
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a private method, and it is currently only used for drop.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can remove the private delete methods and do the logic directly in drop
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@thewh1teagle I'd recommend keeping it as you might use this private method when writing unit tests.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If we'll need the logic for unit tests we simply can call drop(...) that will call the drop implementation.
} | ||
|
||
/// Free the internal pointer inside the recognizer to avoid memory leak. | ||
fn delete(&mut self) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove but use in impl drop
|
||
impl OfflineStream { | ||
/// Frees the internal pointer of the stream to avoid memory leak. | ||
fn delete(&mut self) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove and keep in impl drop
} | ||
|
||
/// Get the recognition result of the offline stream. | ||
pub fn get_result(&self) -> Option<OfflineRecognizerResult> { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Return result instead of option with bail! Macro to emit errors
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Return result instead of option with bail! Macro to emit errors
Is there an err type for the appropriate result at the moment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Currently we use eyre::{Result, bail}
We can improve it in future in another PR though, I don't see real benefit of using custom errors in the crates at this stage.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Currently we use eyre::{Result, bail} We can improve it in future in another PR though, I don't see real benefit of using custom errors in the crates at this stage.
In rust practice, if you're a library that returns result, it's a good idea to define your own err and result types. This way, sub-documents can see their own error type and related descriptions. The essence is to facilitate the use of the caller.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We'll handle it later. see #46
@thewh1teagle Can you just modify these PRs |
@ghjdegithub |
Next time I used the feature/XXX branch to get it done, so I'll be lazy this time |
Yes, and smaller even tiny PRs :) |
This PR involves too many things, which may be really important, but it’s too difficult for me to review all of them. It’s possible to take parts of it and create smaller, focused PRs to see if they’re a good fit. I’ll close this for now. Thanks! |
No description provided.