Since #ChatGPT already does a decent job of postcorrecting #OCR errors, it would be interesting to see what it can achieve with fine-tuning now that it is available. #DigitizedNewspaper @cneud https://platform.openai.com/docs/guides/fine-tuning
Fine-tuning could be made with some ground truth data where the original OCR is available. I guess we should add the original OCR for our #OpenData on @huggingface @danielvanstrien https://huggingface.co/datasets/biglam/bnl_ground_truth_newspapers_before_1878