Go to Reddit Home

Log in to Reddit

Open settings menu

r/LanguageTechnology

members

online

Best

Open sort options

Change post view

Title: Free Windows tool to transcribe video file to text?

u/ChemistCold4475

Title: Free Windows tool to transcribe video file to text?

I have a video file (not YouTube) in English and want to convert it to text transcript.

I’m on Windows and looking for a FREE tool. Accuracy is important. Offline would be great too.

What’s the best free option in 2026?

Thanks!

u/Librela_for_Dogs

•

Promoted

Taffy’s osteoarthritis (OA) pain used to get in the way of doing what she loves most with her family. Now, once-monthly Librela (bedinvetmab injection) has Taffy’s OA pain under control. See more of Taffy’s real results with Librela at Librela.com.

Please see Prescribing Information for Librela.

media poster

zoetispetcare.com

Learn More

Is NLP threatened by AI?

u/ProfessionalFun2680

Is NLP threatened by AI?

Hello everyone, the question I have been thinking about is whether Natural Language Processing is threatened by AI in a few years. The thing is, I have just started studying NLP in Slovak Language. I will have a Master's in 5 years but I'm afraid that in 5 years it will be much harder to find a job as a junior NLP programmer. What are your opinions on this topic?

Wave Field LLM — O(n log n) attention via wave equation dynamics

Wave Field LLM — O(n log n) attention via wave equation dynamics

I've been working on an alternative attention mechanism that treats language as a physical field system instead of using standard O(n²) self-attention.

How it works:

Tokens are mapped onto a continuous 1D field
Information propagates via damped wave equations: k(t) = exp(-α·t)·cos(ω·t + φ)
Each attention head has just 3 learnable physics parameters (frequency, damping, phase)
Convolution computed via FFT in O(n log n)
Heads self-organize into different roles (local grammar, medium context, long-range)

Results (WikiText-2, 6M params, character tokenizer):

Model	PPL	Accuracy	Complexity
Standard Transformer	5.9	51.0%	O(n²)
Wave Field V3.5	6.2	50.5%	O(n log n)

At longer sequences the savings grow: 31x at 2K tokens, 107x at 8K, 367x at 32K.

Known limitations:

With BPE tokenizer (8K vocab), there's a significant capacity gap vs standard transformer
This is a model capacity issue at small scale, not an architecture flaw
Currently scaling to 100M params to see if the gap closes

What's unique:

Every bug during development was found through physics-based diagnostics (energy flow, conservation, causality tests) — not guessing
Cross-head field coupling and wave interference for information routing
Not a Mamba/Hyena variant — different approach entirely

Happy to answer questions about the physics, architecture decisions, or results.

Created Mar 10, 2010

Public

Anyone can view, post, and comment to this community

10K 127

r/LanguageTechnology Rules

1

Be Nice: No offensive behavior, insults, or attacks

The intent of this rule is to keep this sub accessible to everyone. Correcting others is allowed, but we ask that statements are not as pointed when about other users.

2

Make your post/question clear; if you want to share an idea/project, please provide substance

Demonstrate that you have put in effort prior to asking questions.If the answer can be easily found on Google, your post may be flagged for removal. We encourage the use of ChatGPT to answer coding questions as well.

If you've made a novel innovation, and you're worried about someone taking your idea, wait until it is concrete and has been turned into a product. Claims of invention without substance to back it up aren't helpful to anyone and will be classified as spam.

3

Self-Promotion- Your first post cannot be your project & <10% of posts should be advertisements

Your first post cannot be your github repo, youtube channel, medium article, etc - Arxiv papers are the main exception. All discord invites are removed automatically. Github repos and youtube videos are subject to this rule based on user post history. All linked posts from new members are auto-removed. Exceptions are academic & data sharing posts.

The spirit of this rule is to encourage community interaction - if you cannot meet a minimum level of activity, you cannot share your project.

4

Relevancy

Posts must be related to NLP in some way (application, theory, education, or career). Content related to the technology-facilitated language learning is not within the scope of this subreddit.

5

No Hardware or Cloud Deployment Questions for LLMs

LLM discussions & recommendations are within the scope of this sub, but we'll ask that you direct questions about hardware, custom LLM model development (as in, training a 40B model from scratch), and cloud deployment architectures as skewing towards the scope of r/LocalLLaMA or r/RAG.

6

AI Generated Spam

Content must not be obviously AI-generated. It creates too much spam to weed through. Our members take a lot of time to create thoughtful responses, and it wastes everyone's time.

7

ChatGPT conversations are not research. Also, no recursion, sentience, and alignment discussions.

Discussion about recursion / consciousness / alignment / chatbot memory through conversing with ChatGPT is not allowed. Prompt research is only allowed if it is a formal arxiv research paper. We just see too much spam on the subject. If ChatGPT helped develop a research idea, please do not post it here unless it has been published through formal channels.

Affiliated NLP Subreddits

r/Rag

62,410 members

Moderators

Moderator list hidden. Learn More

View all moderators

Reddit Rules

Privacy Policy

User Agreement

Your Privacy Choices

Accessibility

Reddit, Inc. © 2026. All rights reserved.