#DailyBloggingChallenge (331/365)
The idea that
> people might enjoy listening to a #podcast like approach of evaluating various #books
has been brought upon me.
To keep everything in the #Fediverse with the power of #ActivityPub the goal is to publish the content onto #FunkWhale.
#DailyBloggingChallenge (332/365)
The main way that I evaluate #books specifically #AudioBooks is by taking a #VoiceRecording after each chapter, section, or idea.
I have noticed that with #NonFiction books, I can easily listen to them at twice the speed. On the other hand, #fiction books need to be listened to at normal speed.
#DailyBloggingChallenge (333/365)
These recordings then would be transformed to text using #SpeechToText. That way the private information can be removed without spending too much time editing the audio files directly.
Further, additional information can be easily added. This would then be converted back to audio using #TextToSpeech. Before being finally published on #FunkWhale.
#DailyBloggingChallenge (362/365)
Originally wanted to use #VOSK to transcribe the #SpeechToText. Initially tried it out over #KdenLive and its 'Speech Recognition' tool.
This took quite awhile to setup, since it is not concrete what kind file format, if any, the VOSK model should have. Additionally, the recommendation of setting up a virtual #Python environment didn't work as expect and went with the global approach.
And finally scratched the whole approach, once realizing that transcribing 26 min audio clip is taking longer than 10min.
#DailyBloggingChallenge (363/365)
Instead opted in to using #Whisper which also works with #KdenLive.
Although Whisper is originally written in #Python there is a #CPP project that makes transcribing very fast. It took less than 2min to transcribe the 26 min audio clip.