Scientific article summarization
Technical skill used
- Python
- Tensorflow
- Transformers
- nltk
- HuggingFace
Synopsis
This model generate a summarized scientific article from arXiv. The goal is to understand in a few sentences the content of the article. It is not aimed to replace the article abstract.
Model Description
The model used is a T5-small with Hugging-Face weight (Transfert Learning)
Inputs
- text as a token format
outputs
- A text with a number of word. The general meaning is as close as the article abstract.
Model limitation
- The article must be available at arXiv.
- The article must come in
tex
format. - The `tex' should be in a valid format.
- The article should be written in English.
Tradeoff
- A complete resume requires a longer inference time.
- Mathematics formulas and citations are hidden to not have to make another train for understanding these formulas