Advanced searches left 3/3

Abstractive Summarization - Arxiv

Summarized by Plex Scholar
Last Updated: 05 November 2022

* If you want to update the article please login/register

FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness

Current Seq2Seq summarization models are also struggling from the unfaithful generation issue despite being able to produce fluent and grammatical text. We examine the accuracy of existing methodologies from a new perspective of truthfulness, which is the ability to efficiently gather reliable data against contradictory untrue information. When collecting factual data, we first measure a model's factual robustness by its success rate to shield against adversarial attacks. The factual robustness analysis on a variety of current technologies shows its high consistency with human judgments on faithfulness.

Source link: https://arxiv.org/abs/2211.00294v1


Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling

Abstract summaries that include factual mistakes or hallucinated information are often created by parametric summaries that lack clear information or hallucinated text. However, creating non-factual summaries using heuristics does not always lead to correct model errors. We'll be able to produce hard, accurate synthetic examples of non-factual summaries by infilling language models in this study. According to multiple summarization schemes, our model, FactEdit, raises truthuality scores by over 111 points on CNN/DM and over 316 on XSum on average, producing more factual summaries while maintaining competitive summarization quality.

Source link: https://arxiv.org/abs/2210.12378v2


Leveraging Locality in Abstractive Text Summarization

On several common natural language processing tasks, neural attention technologies have made significant strides. However, long text summarization is difficult for them because of the quadratic memory complexity of the self-attention module with respect to the input length. We explore whether models with a restricted context can have competitive results in comparison to memory-efficient attention models that keep a global context by treating the input as a series. Instead of designing more advanced attention modules, we're exploring if models with a limited context can have competitive outcomes in comparison to the memory-efficient attention models that maintain a global context by treating the input as a single sequence.

Source link: https://arxiv.org/abs/2205.12476v2


How Far are We from Robust Long Abstractive Summarization?

We show that, rather than factual summaries, the constant quest for state-of-the-art ROUGE findings can lead to more precise summaries but not literal ones. ROUGE is the best at assessing the relevancy of a summary's findings, according to long document review studies, human evaluation findings show that ROUGE is the best at determining the relevancy of a summary. It also reveals significant shortcomings of factuality statistics in detecting various types of factual anomalies and the reasons behind BARTScore's success.

Source link: https://arxiv.org/abs/2210.16732v1


Mutual Information Alleviates Hallucinations in Abstractive Summarization

With a lot of model uncertainty, we find a simple criterion under which models are significantly more likely to assign more credibility to hallucinated material during creation: high model uncertainty. We recommend a decoding scheme that shifts to optimizing for pointwise mutual knowledge of the source and target token rather than solely the likelihood of the target token--in the case where the model displays uncertainty. Our experiment results on the XSum dataset show that our method reduces the chance of hallucinated tokens while keeping the Rouge and BertS rankings of top-performing decoding strategies.

Source link: https://arxiv.org/abs/2210.13210v2


Improving abstractive summarization with energy-based re-ranking

At the same time, automated evaluation tools such as CTC scores have been recently introduced that show a greater correlation with human judgments than traditional lexical-overlap metrics such as ROUGE. We've tried various metrics to prepare our energy-based re-ranker and found that it consistently raises the predicted summaries' scores. Nevertheless, human evaluation results suggest that the re-ranking strategy should be used with caution for extremely abstract summaries, as the available metrics are not yet appropriate for this purpose.

Source link: https://arxiv.org/abs/2210.15553v1


Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents

We claim that disentangling content selection from the budget used to produce salient content improves abstract summary's effectiveness and applicability of abstract summaries. FactorSum converts summarization into two steps through an energy function, resulting in a final report based on budget and content recommendations, according to our method, FactorSum. This factorization achieves much higher ROUGE scores on three benchmarks for long document summarization, namely PubMed, arXiv, and GovReport.

Source link: https://arxiv.org/abs/2205.12486v2


Analyzing Multi-Task Learning for Abstractive Text Summarization

Despite the recent success of multi-task learning and pre-finetuning for natural language acquisition, no studies have looked at the effects of task families on abstract text summarization. Task families are a form of task grouping during the pre-finetuning process to teach fundamental skills such as reading comprehension. For the English abstract text summarization task, we investigate the impact of multi-task learning strategies using task families to close the void.

Source link: https://arxiv.org/abs/2210.14606v1


Salience Allocation as Guidance for Abstractive Summarization

Abstractive summarization schemes commonly learn to gather the salient data from scratch implicitly. It is difficult to find a definite threshold determining which content should be included in the guidelines as the number and distribution of salience content pieces vary. SEASON's use of salience expectation guides abstractive summarization and adapts well to articles with different abstractness. Empirical results on more than one million news articles show a natural fifteen-fifty salience split for news article sentences, providing valuable insight for writing news articles.

Source link: https://arxiv.org/abs/2210.12330v1


Taxonomy of Abstractive Dialogue Summarization: Scenarios, Approaches and Future Directions

Abstractive dialogue summarization aims to produce a short and fluent summary covering the salient points in a discussion between two or two interlocutors. In recent years, it has drew soaring interest in recent years, due to the rapid emergence of social media platforms and a pressing need for effective dialogue information processing and digestion. Dialogs are unique in contrast to traditional document summarization or papers in traditional newspaper summarization, conveying different language styles and designs, scattered data, flexible discourse models, and unclear subject boundaries are all typical. It assigns a taxonomy of existing techniques in three directions, namely, injecting dialogue features, designing auxiliary training tasks, and using additional measurements. According to a list of research studies, the task is broken into two broad categories, namely, open-domain and task-oriented, and widely accepted evaluation metrics are summarized for completeness.

Source link: https://arxiv.org/abs/2210.09894v1

* Please keep in mind that all text is summarized by machine, we do not bear any responsibility, and you should always check original source before taking any actions

* Please keep in mind that all text is summarized by machine, we do not bear any responsibility, and you should always check original source before taking any actions