The basis idea of this project is not set a deep leaning, this is cool... But we want to look at the next step by using non common way to create a new methode to "computer thinking".

We need to start dy a simple way, then we will start with audio IO then here we will base the system of audio recognition (not speech only) and some basic control.

Ideas of architeture:

The basis data source is caming from some sources:

The objective is to use the system remotely with a personal server (communication with REST API):

Global organisation (WIP)

Basic Deep learning:

Final library objective:

AI with some capabilities:

learn new word
write simple program in interpreted language
translate sentence
play music, video on other application

Generation step:

Normalyze the origin corpus:

./normalizer_corpus/audio-reco-corpus/preprocessCorpus.py --input ../audio-reco-corpus-french/ --output ../normalyse-corpus/audio-reco

Some usefull library:

Voice Activity Detection (VAD): https://github.com/jtkim-kaist/VAD

Audio Denoise: https://github.com/jtkim-kaist/Speech-enhancement

Natural Language Processing (NLP): https://github.com/zalandoresearch/flair

The DBLP Computer Science Bibliography: https://datahub.io/collections/yago

License (MPL v2.0)

Licensed under the Mozilla Public License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

https://www.mozilla.org/MPL/2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.