Contributors mailing list archives

contributors@odoo-community.org

Browse archives

Avatar

Re: Module to read and extract information from PDF's

by
ClosingAp Open Source Integrators Europe, LDA, Daniel Reis
- 07/09/2023 09:21:08
Out of the box Odoo is capable to extract the text content from a it.attachment.
You just need to make sure the pdfminer.six Python library is installed.

When hat is the case, the attachment document text is extracted and written in a ir.attachment text field.
You can then do content search or even implement business logic based on it.

Reference:
https://github.com/odoo/odoo/blob/55423cbdeeb1ce35fb257624ea0d04d4be99a943/addons/attachment_indexation/__manifest__.py#L13

Thanks
Daniel

On 06/09/2023 21:42, Samuel Macias Oropeza wrote:
Hello everyone. 

We have a client using Odoo 16 that needs to extract information from a PDF file and update a res.partner record with this info. The PDF contains data like name, address, ZIP Code, VAT number, etc. Does anyone know of any module/python library that could help us with this?

Thank you!

--

SAMUEL MACIAS OROPEZA

TECH LEAD

smacias@opensourceintegrators.com

P.O. BOX 940, HIGLEY, AZ 85236

_______________________________________________
Mailing-List: https://odoo-community.org/groups/contributors-15
Post to: mailto:contributors@odoo-community.org
Unsubscribe: https://odoo-community.org/groups?unsubscribe


--
DANIEL REIS
MANAGING PARTNER

M: +351 919 991 307
E: dreis@OpenSourceIntegrators.com
A: Avenida da República 3000, Estoril Office B, 3º Escr.34, 2649-517 Cascais

Reference