No description

Python 65.7%
Nix 30.4%
Dockerfile 3.9%

Find a file

SonOfLope 3907273045 configures device type for model usage		2024-07-18 20:31:11 -04:00
.github/workflows	Adds pipeline for build/push docker image and Dockerfile	2024-07-17 15:59:31 -04:00
.gitignore	adds import	2024-07-18 18:05:44 -04:00
Dockerfile	Adds pipeline for build/push docker image and Dockerfile	2024-07-17 15:59:31 -04:00
flake.lock	feat: make a Flask app for audio transcription	2024-06-20 20:23:30 -04:00
flake.nix	feat: make a Flask app for audio transcription	2024-06-20 20:23:30 -04:00
LICENSE	Initial commit	2024-05-30 19:44:58 -04:00
point.py	configures device type for model usage	2024-07-18 20:31:11 -04:00
README.md	feat: make a Flask app for audio transcription	2024-06-20 20:23:30 -04:00
requirements.txt	Adds pipeline for build/push docker image and Dockerfile	2024-07-17 15:59:31 -04:00

README.md

point

Transcription d'audio en texte grâce à Whisper.

Utilisation

point.py est un service qui répond aux requêtes POST au chemin /transcript avec le texte correspondant à la transcription du fichier audio donné.

À la première requête, si le modèle Whisper n'est pas présent, il sera téléchargé dans le dossier whisper-models du dossier courant.

Par exemple,

$ python3 point.py --port 5000

Et dans un autre terminal:

$ curl -X POST -T '12 Still Alive.mp3' -H 'Content-Type: audio/mp3' http://127.0.0.1:5000/transcript
2024-06-20 19:54:40, transcript with base

 This was a triumph.
 I'm making a note here, huge success.
 It's hard to always date my sad inspection.
...

Ce n'est pas parfait, mais si le modèle utilisé est plus gros que base (comme distil-large-v3), ça devrait mieux fonctionner. Le langage est détecté automatiquement.