Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Staging #48

Open
wants to merge 59 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
59 commits
Select commit Hold shift + click to select a range
9e31425
add: tokenizer and data gen
daniEL2371 Aug 6, 2021
990def5
fix: label length and input length
daniEL2371 Aug 6, 2021
e665090
fix: label length and input length
daniEL2371 Aug 6, 2021
5eefd44
fix: update type of audio input to floar
daniEL2371 Aug 6, 2021
a9b09be
saved the trained model
rasyosef Aug 7, 2021
be181de
saved the character encoder
rasyosef Aug 7, 2021
fa85a9b
Merge branch 'dev-yosef' of https://github.com/Amharic-STT/STT-engine…
rasyosef Aug 8, 2021
4bd688d
data loading and preprocessing modules
rasyosef Aug 8, 2021
6661467
encoding module and saved encoder
rasyosef Aug 8, 2021
14b08a6
data loading and preprocessing modules
rasyosef Aug 8, 2021
ca5fd46
modified feature extraction functions
rasyosef Aug 8, 2021
682aff2
moved model creation to models,py
rasyosef Aug 8, 2021
97e6b63
modular version of Amharic_STT
rasyosef Aug 8, 2021
085c304
mlflow parameter logging
rasyosef Aug 8, 2021
0e2d8f6
simple rnn model trained on sample data
daniEL2371 Aug 8, 2021
1b80b44
mlflow parameter logging
rasyosef Aug 8, 2021
9683378
dev-daniel
daniEL2371 Aug 8, 2021
8291523
simple rnn model trained
daniEL2371 Aug 8, 2021
c3a438c
refactor model building in a notebook
daniEL2371 Aug 8, 2021
fbedd55
extracted functions into separate modules
daniEL2371 Aug 9, 2021
174d174
modified preprocessing functions
rasyosef Aug 9, 2021
9cf2cf8
modified training notebook
rasyosef Aug 9, 2021
31ab1b6
modified requirements.txt
rasyosef Aug 9, 2021
34bc7ed
modified readme
rasyosef Aug 9, 2021
fbc29cb
.
rasyosef Aug 9, 2021
e3e82c1
added batch training
rasyosef Aug 9, 2021
ede46fd
added mel spectrogram saving function
rasyosef Aug 9, 2021
cd221db
added model training python file
rasyosef Aug 10, 2021
5e800d7
model building using bi rnn
daniEL2371 Aug 10, 2021
e097212
Merge pull request #35 from Amharic-STT/dev-yosef
Azariagmt Aug 10, 2021
87e7ccd
better model
rasyosef Aug 10, 2021
6b639a6
model building using LSTM and bi rnnn
daniEL2371 Aug 11, 2021
3a43f85
model building using CNN and bi rnnn
daniEL2371 Aug 11, 2021
d387b04
scripts updated
daniEL2371 Aug 11, 2021
cc63e53
updated ctc loss module to a class
daniEL2371 Aug 11, 2021
ce7f699
used a better model
daniEL2371 Aug 11, 2021
fc1ccb3
used a better model
daniEL2371 Aug 11, 2021
082f8ee
add preporcess model in model2 module
daniEL2371 Aug 11, 2021
0392cd1
refactor predict, train and build function to model_helper
daniEL2371 Aug 11, 2021
0bdb4c2
add: chat to int mapping as pkl objects
daniEL2371 Aug 12, 2021
ab46f4f
Merge branch 'staging' into dev-daniel
Azariagmt Aug 12, 2021
7d926b8
model inference script file added
daniEL2371 Aug 12, 2021
32f5e01
model inference script file added
daniEL2371 Aug 12, 2021
f4bd194
Merge branch 'dev-daniel' of github.com:Amharic-STT/STT-engine into d…
daniEL2371 Aug 12, 2021
b2e4841
Merge pull request #41 from Amharic-STT/dev-daniel
Azariagmt Aug 12, 2021
227a477
add web_app.py
Bethelsis Aug 12, 2021
c32c37f
Merge branch 'main' into HEAD
Bethelsis Aug 12, 2021
807806d
Merge branch 'dev-yosef' of https://github.com/Amharic-STT/STT-engine…
Bethelsis Aug 12, 2021
6303f57
add new_model
Bethelsis Aug 12, 2021
5b8f05e
Add pages
Bethelsis Aug 12, 2021
bd5f5ee
Add new_model.py
Bethelsis Aug 12, 2021
5084345
Add test_model.py
Bethelsis Aug 12, 2021
0c42fa3
model_summary.py
Bethelsis Aug 12, 2021
0b2ba0b
update web_app.py
Bethelsis Aug 12, 2021
a7c0aeb
remove files
Bethelsis Aug 12, 2021
4eb7b38
Add app.py
Bethelsis Aug 12, 2021
bed1227
remove file
Bethelsis Aug 12, 2021
5313cb1
Add new model
Bethelsis Aug 12, 2021
4f70426
Merge pull request #43 from Amharic-STT/dev-Bethelhem
Bethelsis Aug 12, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file modified .DS_Store
Binary file not shown.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1 +1,3 @@
/data
/notebooks/mlruns/
/models/*
26 changes: 5 additions & 21 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,25 +6,9 @@

<p>Our responsibility was to build a deep learning model that is capable of transcribing a speech to text in the Amharic language. The model we produce will be accurate and is robust against background noise.</p>

## Code
The code of our analysis can be found in the **notebooks** folder. The data preprocessing and visualization, and model training parts can be found in the **Amharic_STT_preprocessing.ipynb** jupyter notebook. This notebook can be run in google colab. The **Amharic_Speech_To_Text.ipynb** contains a modularized version of the first notebook. The **scripts** folder contains the data loading and preprocessing functions. The trained models will be stored in the **models** folder.

Structure
├── logs
├── modules
├── notebooks
├── tests
└── Dockerfile

# Contributors

* [Azaria Tamrat](https://github.com/Azariagmt)
* [Bethelhem Sisay](https://github.com/Bethelsis)
* [Daniel Zelalem](https://github.com/daniEL2371)
* [Dorothy Cheruiyot](https://github.com/Doro97)
* [Eliphaz Niyodusenga]()
* [Elizabeth Nanjala]()
* [Natneal Teshome](https://github.com/Natty-star)
* [UWASE Rachel](https://github.com/ntabanarachel)
* [Yosef Alemneh](https://github.com/mozartofmath)



## Dependencies
To install the necessary dependencies, execute the command
```$ pip install -r requirements.txt"```
35 changes: 35 additions & 0 deletions app.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,35 @@
import streamlit as st
import awesome_streamlit as ast
import scripts.pages.recorded_audio
import scripts.pages.record_audio
import scripts.pages.home
import scripts.pages.model_summary

# create the pages
PAGES = {
"Home" : scripts.pages.home,
"Model Summary" : scripts.pages.model_summary,
"Choose Audio": scripts.pages.recorded_audio,
"Record your own voice": scripts.pages.record_audio,

}


# render the pages
def main():

st.sidebar.title("Navigation")
selection = st.sidebar.radio("Go to", list(PAGES.keys()))

page = PAGES[selection]
with st.spinner(f"Loading {selection} ..."):
ast.shared.components.write_page(page)
st.sidebar.title("About")
st.sidebar.info(
"""
This app is an end-to-end solution that is capable of transcribing a speech to text in the Amharic language.
"""
)

if __name__ == "__main__":
main()
Binary file added char_to_int.pkl
Binary file not shown.
Binary file added int_to_char.pkl
Binary file not shown.
Binary file added models/amharic_stt_mfcc.h5
Binary file not shown.
Binary file added models/encoder.pkl
Binary file not shown.
Binary file added models/new_model_v1_2000.h5
Binary file not shown.
Binary file added models/new_model_v1_6000.h5
Binary file not shown.
Binary file modified notebooks/.DS_Store
Binary file not shown.
1,229 changes: 1,229 additions & 0 deletions notebooks/.ipynb_checkpoints/Amharic_Speech_To_Text-checkpoint.ipynb

Large diffs are not rendered by default.

Loading