Open in app

Sign In

Write

Sign In

Pierre Guillou
Pierre Guillou

1.4K Followers

Home

About

4 days ago

Document AI | APP to compare the Document Understanding LiLT and LayoutXLM (base) models at line level

Through the publication of the DocLayNet dataset (IBM Research) and the publication of Document Understanding models on Hugging Face (for example: LayoutLM series and LiLT), 2 Document Understanding models at line level have already been published: a LiLT base and a LayoutXLM base (Microsoft) models finetuned on the dataset DocLayNet…

Deep Learning

4 min read

Document AI | APP to compare the Document Understanding LiLT and LayoutXLM (base) models at line…
Document AI | APP to compare the Document Understanding LiLT and LayoutXLM (base) models at line…
Deep Learning

4 min read


Mar 6

Document AI | Inference APP and fine-tuning notebook for Document Understanding at line level with LayoutXLM base

Through the publication of the DocLayNet dataset (IBM Research) and the publication of Document Understanding models on Hugging Face (for example: LayoutLM series and LiLT), a Document Understanding model at line level had already been published: a LiLT base model finetuned on the dataset DocLayNet base with overlap chunks of…

Microsoft

5 min read

Document AI | Inference APP and fine-tuning notebook for Document Understanding at line level with…
Document AI | Inference APP and fine-tuning notebook for Document Understanding at line level with…
Microsoft

5 min read


Feb 16

Document AI | Inference APP and fine-tuning notebook for Document Understanding at paragraph level

Through the publication of the DocLayNet dataset (IBM Research) and the publication of Document Understanding models on Hugging Face (for example: LayoutLM series and LiLT), a Document Understanding model at paragraph level has been fine-tuned and published: a LiLT base model with overlap chunks of 512 tokens that uses the…

Deep Learning

5 min read

Document AI | Inference APP and fine-tuning notebook for Document Understanding at paragraph level
Document AI | Inference APP and fine-tuning notebook for Document Understanding at paragraph level
Deep Learning

5 min read


Feb 14

Document AI | Inference APP for Document Understanding at line level

Through the publication of the DocLayNet dataset (IBM Research) and the publication of Document Understanding models on Hugging Face (for example: LayoutLM series and LiLT), a Document Understanding model has been fine-tuned and published: a LiLT base model at line level with overlap chunks of 384 tokens that uses the…

Deep Learning

4 min read

Document AI | Inference APP for Document Understanding at line level
Document AI | Inference APP for Document Understanding at line level
Deep Learning

4 min read


Feb 10

Document AI | Document Understanding model at line level with LiLT, Tesseract and DocLayNet dataset

The publication of the DocLayNet dataset (IBM Research) and that of Document Understanding models on Hugging Face (for example: LayoutLM series and LiLT) allow (at last!) the training of such models on all documents with text (for example: PDFs, slides, images with text) with labels that interest the greatest number…

Hugging Face

5 min read

Document AI | Document Understanding model at line level with LiLT, Tesseract and DocLayNet dataset
Document AI | Document Understanding model at line level with LiLT, Tesseract and DocLayNet dataset
Hugging Face

5 min read


Jan 31

Document AI | DocLayNet image viewer APP

After creating different formats (small, basic, and large) to download DocLayNet (formats small/base/large that also help using DocLayNet data in Hugging Face notebooks on finetuning document layout models), it was important to have an APP for viewing annotated images with labeled bounding boxes of paragraphs and lines. Indeed, this visualization…

Hugging Face

4 min read

Document AI | DocLayNet image viewer APP
Document AI | DocLayNet image viewer APP
Hugging Face

4 min read


Jan 27

Document AI | Processing of DocLayNet dataset to be used by layout models of the Hugging Face hub (finetuning, inference)

Document AI is still a new area of NLP but affects all businesses and individuals. It consists of using AI models to visually and textually understand the content of documents such as PDFs. Thus, it is then possible to categorize the text, images and tables of documents (header, main text…

Hugging Face

7 min read

Document AI | Processing of DocLayNet dataset to be used by layout models of the Hugging Face hub…
Document AI | Processing of DocLayNet dataset to be used by layout models of the Hugging Face hub…
Hugging Face

7 min read


Dec 9, 2022

Speech-to-Text & IA | Transcreva qualquer áudio para o português com o Whisper (OpenAI)… sem nenhum custo!

Paga por um serviço online para obter transcrições de texto de seus arquivos de áudio? E porque não usar um modelo Whisper da OpenAI para fazer esse trabalho… de graça! Precisa especializar um modelo Whisper para as peculiaridades linguísticas de seus arquivos de áudio? Não tem problema, existem scripts e…

Whisper

7 min read

Speech-to-Text & IA | Transcreva qualquer áudio para o português com o Whisper (OpenAI)… sem…
Speech-to-Text & IA | Transcreva qualquer áudio para o português com o Whisper (OpenAI)… sem…
Whisper

7 min read


Nov 22, 2022

IA & empresas | Diminua o tempo de inferência de modelos Transformer com BetterTransformer

Como pode obter inferências mais rapidamente (e de uma forma muito simples!) com seus modelos Transformer existentes (ou seja, sem precisar treiná-los novamente) e, assim, reduzir os custos do servidor de inferência? A solução vem através do código: BetterTransformer da Hugging Face! …

Hugging Face

7 min read

IA & empresas| Diminua o tempo de inferência de modelos Transformer com BetterTransformer
IA & empresas| Diminua o tempo de inferência de modelos Transformer com BetterTransformer
Hugging Face

7 min read


Nov 11, 2022

NLP & Código para todos | Função de perda ponderada para classificação de texto (multiclasse)

Este artigo explica por que e como usar uma função de perda ponderada para treinar um modelo de classificação de texto. Notebook: Text_Classification_on_GLUE_with_weighted_Loss.ipynb Uma das tarefas mais comuns em NLP Conscientemente ou não, uma das atividades intelectuais mais frequentes nos negócios é a classificação de textos. …

NLP

4 min read

NLP & Código para todos | Função de perda ponderada para classificação de texto (multiclasse)
NLP & Código para todos | Função de perda ponderada para classificação de texto (multiclasse)
NLP

4 min read

Pierre Guillou

Pierre Guillou

1.4K Followers

AI, Deep learning, NLP models author | Brazil & France

Following
  • Ng Wai Foong

    Ng Wai Foong

  • Cassie Kozyrkov

    Cassie Kozyrkov

  • Roberto Iriondo

    Roberto Iriondo

  • Cobus Greyling

    Cobus Greyling

  • Sik-Ho Tsang

    Sik-Ho Tsang

See all (277)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech