Commit Graph

19 Commits

Author SHA1 Message Date
9d712c921a feat: add PDF document recognition with 10-page pre-hook
- Migrate recognition_results table to JSON schema (meta_data + content),
  replacing flat latex/markdown/mathml/mml columns
- Add TaskTypePDF constant and update all formula read/write paths
- Add PDFRecognitionService using pdftoppm (Poppler) for CGO-free page
  rendering; limits processing to first 10 pages (pre-hook)
- Reuse existing downstream OCR endpoint (cloud.texpixel.com) for each
  page image; stores results as [{page_number, markdown}] JSON array
- Add Redis queue + distributed lock for PDF worker goroutine
- Add REST endpoints: POST /v1/pdf/recognition, GET /v1/pdf/recognition/:task_no
- Add .pdf to OSS upload file type whitelist
- Add migrations/pdf_recognition.sql for safe data migration

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-31 14:17:44 +08:00
e538553045 fix: remove unused time import in task.go
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-27 02:41:40 +08:00
fcd9816b0b refact: add log for export 2026-03-25 18:28:12 +08:00
liuyuanchuang
18597ba7fa feat: add log for export error 2026-03-12 11:43:26 +08:00
liuyuanchuang
81c2767423 feat: add mml from backend 2026-02-05 10:44:11 +08:00
liuyuanchuang
a5f1ad153e refactor: update package path 2026-01-27 21:56:21 +08:00
97c3617731 feat: replace export url 2026-01-25 09:10:54 +08:00
3293f1f8a5 fix: downgrade error 2025-12-27 22:21:34 +08:00
7c4dfaba54 feat: modify field 2025-12-26 17:27:35 +08:00
a538bd6680 fix: modify ip 2025-12-26 16:41:36 +08:00
cd221719cf fix: http req 2025-12-26 16:38:04 +08:00
d0c0d2cbc3 fix: query by task-no 2025-12-26 16:28:49 +08:00
0aaafdbaa3 feat: add file export 2025-12-26 15:48:14 +08:00
e0904f5bfb feat: add mml 2025-12-20 22:57:53 +08:00
203c2b64c0 feat: add mathpixel 2025-12-20 21:42:58 +08:00
5a1983f08b feat: update oss download url 2025-12-18 15:14:42 +08:00
8a6da5b627 feat: add list api 2025-12-18 12:39:50 +08:00
0bc77f61e2 feat: update dockerfile 2025-12-10 23:17:24 +08:00
liuyuanchuang
48e63894eb init repo 2025-12-10 18:33:37 +08:00