feat(TaskProcessing): Add OCR TaskType#56908

marcelklehr · 2025-12-08T11:40:29Z

Summary

Adds a task processing task type for doing OCR

TODO

Ideas for more inputs?

Checklist

Code is properly formatted
Sign-off message is added to all commits
Tests (unit, integration, api and/or acceptance) are included
Screenshots before/after for front-end changes
Documentation (manuals or wiki) will be updated once the PR is merged
Backports requested where applicable (ex: critical bugfixes)
Labels added where applicable (ex: bug/enhancement, 3. to review, feature component)
Milestone added for target branch/version (ex: 32.x for stable32)

janepie · 2025-12-08T12:47:47Z

Looks good! We could add an input for the language to be extracted and have it default to automatic detection, or add that as optional input only for providers that make use of it. Both fine for me, wdyt @julien-nc @kyteinsky ?

julien-nc · 2025-12-08T12:50:18Z

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?
If so, I'm ok with adding an input field. It's also fine to let the providers add an optional one as not all the providers might support the param.

marcelklehr · 2025-12-08T12:55:44Z

Not sure the OCR libraries take a "language" param to help them perform an optimal extraction. @marcelklehr Do they?

The latest models don't require a language input, but older libraries like tesseract may require this. I think an optional input is fine.

Signed-off-by: Marcel Klehr <mklehr@gmx.net>

kyteinsky · 2025-12-09T06:31:21Z

lib/public/TaskProcessing/TaskTypes/ImageToTextOpticalCharacterRecognition.php

+	public function getInputShape(): array {
+		return [
+			'input' => new ShapeDescriptor(
+				$this->l->t('Input Image'),
+				$this->l->t('The image to extract text from'),
+				EShapeType::Image
+			),
+		];
+	}


it would be nice if it were a ListOfFiles so it can accept images and pdfs both, and multiple of them instead of a single one for a single task, which also keeps the task list shorter in the DB.

susnux · 2026-01-28T13:28:03Z

New public API (interface and classes in OCP) need to be mentioned here:
https://github.com/nextcloud/documentation/blob/1a0415c8bf1541e90ae0b4da487a362a51a1cfe2/developer_manual/app_publishing_maintenance/app_upgrade_guide/upgrade_to_33.rst?plain=1#L184

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net>

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net> [skip ci]

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net>

marcelklehr added this to the Nextcloud 33 milestone Dec 8, 2025

marcelklehr requested a review from a team as a code owner December 8, 2025 11:40

marcelklehr added the 3. to review Waiting for reviews label Dec 8, 2025

marcelklehr requested review from ArtificialOwl, CarlSchwan, icewind1991, julien-nc, kyteinsky and leftybournes and removed request for a team December 8, 2025 11:40

julien-nc approved these changes Dec 8, 2025

View reviewed changes

marcelklehr force-pushed the feat/tasktype-ocr branch from bca2b42 to e339591 Compare December 8, 2025 11:48

marcelklehr added enhancement feature: TaskProcessing labels Dec 8, 2025

marcelklehr requested a review from janepie December 8, 2025 12:34

janepie approved these changes Dec 8, 2025

View reviewed changes

marcelklehr force-pushed the feat/tasktype-ocr branch from e339591 to 483a4b2 Compare December 8, 2025 13:49

marcelklehr enabled auto-merge December 8, 2025 13:49

marcelklehr force-pushed the feat/tasktype-ocr branch from 483a4b2 to 42bf379 Compare December 8, 2025 16:41

feat(TaskProcessing): Add OCR TaskType

3355e6a

Signed-off-by: Marcel Klehr <mklehr@gmx.net>

marcelklehr force-pushed the feat/tasktype-ocr branch from 42bf379 to 3355e6a Compare December 8, 2025 16:44

kesselb disabled auto-merge December 8, 2025 16:45

kesselb merged commit b7b4a3a into master Dec 8, 2025
173 of 179 checks passed

kesselb deleted the feat/tasktype-ocr branch December 8, 2025 16:53

kyteinsky reviewed Dec 9, 2025

View reviewed changes

susnux added the pending documentation This pull request needs an associated documentation update label Jan 28, 2026

marcelklehr added a commit to nextcloud/documentation that referenced this pull request Feb 2, 2026

fix(developer_manual): Add TaskProcessing updates to 33 upgrade guide

f0c1a6f

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net>

marcelklehr mentioned this pull request Feb 2, 2026

fix(developer_manual): Add TaskProcessing updates to 33 upgrade guide nextcloud/documentation#14046

Merged

backportbot bot pushed a commit to nextcloud/documentation that referenced this pull request Feb 2, 2026

fix(developer_manual): Add TaskProcessing updates to 33 upgrade guide

c31c4ea

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net> [skip ci]

marcelklehr added a commit to nextcloud/documentation that referenced this pull request Feb 19, 2026

fix(developer_manual): Add TaskProcessing updates to 33 upgrade guide

17128a7

see nextcloud/server#56908 see nextcloud/server#56717 Signed-off-by: Marcel Klehr <mklehr@gmx.net>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

feat(TaskProcessing): Add OCR TaskType#56908