TLDR Progress

Ablation Studies

Studies that remove or modify certain components of the model to understand their contribution to the overall performance.

Abstract Generation Component

Produces coherent summaries by conducting cross-sentence information ordering, compression, and revision

Abstract Meaning Representation (Amr)

A semantic formalism that encodes core concepts and models relations in the text at a high level of abstraction.

Abstraction Reward

A reward used in the mixed objective to encourage the generation of words not in the source document in a summarization model

Abstraction-Based Methods

Methods that aim at constructing new sentences as summaries, thus they require a deeper understanding of the text and the capability of generating new sentences, which provide an obvious advantage in improving the focus of a summary, reducing the redundancy, and keeping a good compression rate.

Abstractive Algorithms

Summarization algorithms that may generate new text that is not present in the initial document.

Abstractive Document-Reference Summary Pairs

Summaries that use natural language to convey the meaning of the original document.

Abstractive English Text Summarization

A subjective task in natural language processing that involves summarizing text in a way that captures the main ideas and meaning of the original text.

Abstractive Meeting Summaries

Preferred by users over extractive summaries

Abstractive Qfs

A type of QFS that incorporates the query relevance into existing neural summarization models.

Abstractive Query-Focused Systems

Summarization systems that generate summaries based on specific queries or questions

Abstractive Question Summarization

The process of summarizing questions in a concise and informative manner

Abstractive Review Summarization

Automatically generating concise and coherent summaries of user reviews.

Abstractive Review Summarizer

A subtask in CASAS that generates aspect/sentiment-aware abstractive summaries of reviews.

Abstractive Strategies

Summarization techniques that generate summaries with novel words.

Abstractive Summarisation Models

Models that generate a summary of a text by paraphrasing and rephrasing the original content, rather than simply selecting and extracting sentences

Abstractive Summarisers

Summarisers that generate new sentences that capture the essence of the input document to form the summary

Abstractive Summarization (Abs)

A technique for generating summaries that go beyond verbatim copying of the original text and instead generate new and abstract concepts that reflect high-level semantics.

Abstractive Summarization Strategy

A summarization strategy that involves paraphrasing the source document.

Abstractive Summarizer

A system that produces summaries that resemble reference summaries on a word-to-word basis.

Abstractive Summary

A summary that uses natural language to convey the meaning of the original text

Abstractive Systems

Summarization systems that generate summaries by paraphrasing and rephrasing the input text

Abstractive Techniques

Generate novel words and sentences

Abstractive Techniques For Summarization

Attentional feed-forward network and recurrent neural network-based encoder-decoder models.

Abstractiveness Control

Defining bins corresponding to three abstractiveness levels and designing constraints that allow users to control the summary’s abstractiveness

Abstractor Network

A component of the hybrid architecture that rewrites and compresses each of the extracted sentences.

Abstractrouge

A metric introduced to extract summaries by exploiting the abstract of a paper

Acquisition Functions

Functions used in Bayesian optimization to quantify the value of querying the user about a particular pair of candidates.

Action Triplets

Extracting action triplets from the text and constructing an action graph to encode the structural information in the unstructured text.

Active Learning

A common solution to reduce the number of labels a user must provide by iteratively acquiring labels and training a model on the labels collected so far.

Actor-Critic Policy Gradient

A reinforcement learning technique used to connect the extractor and abstractor networks and learn sentence saliency.

Ad-Hoc Methods

Existing methods designed for specific scenarios, limiting their applicability in a unified framework.

Admissible Summaries

Summaries that possess desired properties and accurately convey the main concepts of the original text.

Adversarial Attack

A technique used to generate hard factual inconsistent examples for training a model

Adversarial Framework

A framework that jointly trains a generative model G and a discriminative model D, where G takes the original text as input and generates the summary, and D learns to classify the generated summaries as machine or human generated

Adversarial Meta-Evaluation Framework

A methodology for evaluating the performance of factuality metrics by applying adversarial transformations to test datasets with different distributions.

After-Visit Summary Note (Avs)

A summary given to patients after their clinical visit, intended to summarize patients’ clinical visits and help their disease self-management.

Alignment-Based View

A framework in which the similarity of two summaries is calculated based on an alignment between the summaries’ tokens

Alternative Narratives

Multiple perspectives on the same event.

Anaphora Resolution

A model that rewrites pronominal mentions to increase expressivity.

Anchored Version Of Rouge

A specific implementation of the protocol that utilizes a set of lexical items in the source document to compute ROUGE metric.

Annotation Design

The design of the manual annotation process, including the number of annotators and the distribution of annotators to annotation items

Annotation Protocol

A method for collecting human judgements on the factuality of model-generated summaries

Annotation Reliability

The degree to which different annotators agree on the same evaluation

Annotation Scheme

A set of guidelines used to annotate or mark up text for specific purposes.

Annotation Study

Refers to a study conducted to identify errors made by state-of-the-art summarizers on two benchmarks

Answer Overlap Evaluation

The process of evaluating the overlap between the answers generated by the QA model and the selected answers.

Answer Relevance Score

A score that predicts the answer relevance of the given source documents to the query.

Answer Selection

A task in CQA that involves selecting the correct answer from a set of candidates.

Answer Sentence Selection

A method for answer summarization that involves selecting relevant sentences from a set of candidate sentences to answer a question.

Answer Summarization

The paper introduces the concept of answer summarization, which is a form of query-based, multi-document summarization that involves several subtasks, including query sentence relevance, clustering, cluster summarization, and fusion.

Answer Verification

The task of verifying whether the QA model's prediction is correct

Appropriateness Estimator

A noise-estimating model that can be trained from a single noisy corpus to distinguish appropriate and inappropriate pairs of source and target texts.

Arbitrary Aspect

An aspect that may not be explicitly mentioned but is related to portions of the document

Argument Quality Models

Models used to evaluate the quality of arguments

Aspect Identification

The process of mining product-related aspects and identifying sentences related to those aspects.

Aspect Words

Words indicating product information

Aspect-Based Sentiment Analysis

The process of analyzing the sentiment expressed towards different aspects of a product or service in a review.

Aspect-Controllable Summarization

A type of opinion summarization that allows for the control of the number and type of aspects included in the summary

Aspect-Level Keywords

Keywords that represent specific aspects or topics within a document

Aspect-Oriented Summarization

Summarization that focuses on specific aspects or topics within a document

Aspect-Specific Head Representations

Representations used in QT for identifying specific aspects of an entity.

Aspect-Specific Summaries

Detailed summaries on individual aspects of an entity

Aspect-Specific Summarization

A type of opinion summarization that focuses on creating summaries for individual aspects of a product or service

Asr Output

Transcriptions of speech generated by Automatic Speech Recognition technology

Assessors

Individuals hired to collect relevance judgments for tasks such as test collection construction

Assisting Documents

Articles from alternative news resources covering the same news event that can complement the background knowledge in a generated summary.

Attention Models

A variant of sequence-to-sequence models that attend to relevant parts of the input when producing an output

Attention On Fact Triples

A method of ensuring factual consistency by focusing on triples of subject, predicate, and object

Attention Refinement Unit (Aru)

An update unit based on current decoding state, designed to retain the attention on salient parts but weaken the attention on irrelevant parts of input.

Attention Weight Updating

A technique used in recurrent neural networks to inform content selection and boost summarizer performance.

Attention-Based Architecture

An encoder-decoder approach that uses attention mechanisms to focus on important parts of the input

Attentional Recurrent Neural Network (Rnn) Encoder-Decoder Model

A relevant model to the task of abstractive summarization, originally developed for machine translation, which has produced state-of-the-art performance in machine translation.

Attribute Information

Information about users who post reviews, such as gender, age, and occupation.

Attribute-Aware Sequence Network (Asn)

A model that incorporates attribute information into review summarization using a sequence to sequence (S2S) model with an attribute encoder, attribute-aware review encoder, and attribute-aware summary decoder.

Attribute-Specific Vocabulary

Words and phrases that are specific to certain user attributes and can be incorporated into review summarization to improve performance.

Auto-Encoder Architecture

A method of compressing data into a latent representation without supervision

Auto-Regressive Decoding

A decoding method where the model generates one token at a time based on the previously generated tokens.

Auto-Regressive Design Of Neural Probabilistic Text Generators

A type of neural network architecture used for text generation that generates one word at a time based on the previous words generated.

Auto-Regressive Language Model

A type of language model that generates text one word at a time, conditioned on the previous words.

Auto-Regressive Sentence Extraction

Using previous predictions to inform future predictions in sentence extraction

Autogeneration Of Ml Models

Algorithms that automatically generate new, smaller, customized models for a custom dataset.

Automated Abstractive Summarization

Highly desirable since it requires a lot of effort and language skills for generating summaries from varying information sources such as social media, databases, web articles etc.

Automated Evaluation

The evaluation of a summarization model without human intervention

Automated Generation Of Summary

The generation of a summary without human intervention

Automated Machine Learning

A strategy to automate the pipeline for model creation, including automated generation of the model itself.

Automated Measures Of Summary Quality

Different types of measures, including question-answering, text reconstruction, semantic similarity, and lexical overlap, with various families of measures within each type

Automated Retrieval

The process of automatically finding and retrieving information

Automated Summarization

The process of generating a summary sentence from a set of input sentences.

Automated Summarization Assessment

An efficient evaluation method for the quality of learner summaries that can enhance reading comprehension tasks.

Automated Summarization Of Conversations

The paper discusses the growing demand for automated summarization of online conversations due to the increasing amount of information exchanged.

Automated Summarization Systems

Systems that produce a summary of a text automatically

Automated Text Summarization

The process of generating a summary of a given text using machine learning techniques.

Automatic Abstractive Summarization

A method for producing concise and coherent summaries to facilitate quick information consumption

Automatic And Crowd-Sourced Evaluations

Designed for comparative summaries on a new dataset of three ongoing controversial news topics.

Automatic Assessment

Automatic detection of factual inconsistency in summarization models

Automatic Document Summarisation

The process of creating a shorter version of a longer text document while retaining its most important information.

Automatic Email To-Do Generation

Summarizing to-do items from given emails to help people overview overwhelming numbers of emails they receive every day and schedule their daily work.

Automatic Evaluation Metric

A metric used to evaluate system performance for the task of document summarization.

Automatic Evaluation Of Summarization Systems

Challenging and often inconsistent with human evaluation

Automatic Informativeness Evaluation

The use of fact descriptions brings significant improvement in this aspect.

Automatic Key Point Extraction

Method for fully automatic key point analysis that selects short, high quality comments as key point candidates and leverages previous work on argument-to-key-point matching to select a subset of candidates that achieve high coverage of the data

Automatic Meeting Speech Transcriptions

Transcriptions of speech from meetings that are generated automatically using speech recognition technology

Automatic Metrics Of Summarization Evaluation

Methods used to automatically evaluate the quality of summarization

Autoregressive Architecture

A model that models the unidirectional dependency between sentences, where the state of the current sentence is based on previously sentence labels.

Autoregressive Decoder

A solution to the issue of redundant phrases by introducing an autoregressive decoder, which extracts sentences one by one and allows different sentences to influence each other.

Autoregressive Language Modeling Objectives

A type of pre-training objective for language models where the model is trained to predict the next word in a sequence

Auxiliary Loss

A technique that encourages the model's scores for sentences to mimic an estimated score distribution over the sentences

Auxiliary Self-Supervised Task

Added to strengthen the robustness of the node representations. Aims to classify the shortest path length (e.g., N-hops) from query nodes to others.

Auxiliary Tasks

Additional tasks designed to improve the performance of the main summarization task, such as recognizing salient entities and ensuring factual consistency.

Back-Propagation

A technique used in training neural networks to adjust the weights of the model based on the error between the predicted output and the actual output.

Bag-Of-Words (Bows)

A method of representing text data as a bag of its words, disregarding grammar and word order.

Base-Meta Learning Gap

The parameterized models between two learning stages are relatively independent, preventing the meta model from fully utilizing the knowledge encoded in the base systems.

Baseline Algorithms

Algorithms used to compare and evaluate the performance of example-based summarization systems

Beam Search Length Penalty

A parameter used in the beam search algorithm for text generation. Raffel et al. (2019) used a beam search length penalty of 0.6 in their study.

Beam Search Strategy

An optimisation strategy that iteratively evaluates the semantic coverage scores of a set of candidate summaries to obtain the optimal extraction.

Beam Search With Reference Mechanism

A mechanism that helps to ensure that the generated summary is similar to the reference summary.

Best-Worst Scaling

A ranking-based method in which the annotator selects the best and worst example out of a set of examples

Binary Factuality

The labeling of summaries as either factual or non-factual, which can be difficult to determine and may not provide a fine-grained understanding of factual errors

Block Coordinate Descent Algorithm

A method of solving the non-convex sparse optimization problem in compressive summarization.

Bootstrapping And Permutation Techniques

Resampling techniques used to generate new instances of matrices of values, which are used to calculate the correlation of an evaluation metric to human judgments.

Branch And Bound Technique

An algorithmic approach that divides a problem into smaller sub-problems and solves them recursively.

Brevity

The quality of being concise and to the point

Budgeted Maximum Coverage Problem

Given a collection S of sets with associated costs and a budget L, find a subset S′ ⊆ S such that the total cost of sets in S′ does not exceed L, and the total weight of elements covered by S′ is maximized.

Byte-Pair Encoding (Bpe) Segmentation

A technique used to split rare words into sub-units to eliminate the out-of-vocabulary (OOV) problem.

Candidate Summary Quality

The quality of candidate summaries generated by the abstractive model, which can be estimated more accurately using the proposed training paradigm.

Cascade Architecture

A type of architecture for neural text summarization that involves a pipeline approach where one or two sentences are selected from the source document, their summary-worthy segments are highlighted, and a summary sentence is composed using a neural generator.

Category-Based Alignment (Ca)

A method used to align each section of the structured summary with a set of input sentences classified as the same type.

Causal Inference

A method for determining causal relationships between variables

Central Content Identification

Identifying the most important content in a document cluster

Centrality Measure

A measure used to determine the importance of each sentence in a document

Centrality Or Prestige

Factors considered by selection models in extraction-based summarization systems

Centrality-Aware Attention

An attention function that is extended to consider the centrality score of source words.

Citation Graph

important for generating high-quality summaries by allowing models to learn from reference papers

Citation Summary

The set of citing sentences to a paper, which contains extra information that does not appear in paper abstracts and is more focused in describing the paper's main contributions.

Citation Text

A short textual description regarding related work in scientific literature that highlights certain contributions of the referenced paper and can provide useful information about that paper.

Citation-Based Summaries

A type of scientific summary which is formed by utilizing a set of citations to a referenced article. This set of citations has been previously indicated as a good representation of important findings and contributions of the article. Contributions stated in the citations are usually more focused than the abstract and contain additional information that is not in the abstract.

Citation-Context

The textual spans in the reference articles that reflect the citation.

Clinical Diagnostic Reasoning

A process involving clinical evidence acquisition with integration and abstraction over medical knowledge to synthesize a conclusion in the form of a diagnosis.

Clinical Terms

Terms related to the clinical domain, which can be used to improve the content selection and summary generation of radiology reports.

Co-Decoding Algorithm

An algorithm that takes two review sets as input to compare and contrast the token probability distributions of the models to generate more distinctive summaries.

Co-Occurrence Matrix

A matrix that shows the frequency of co-occurrence of words or phrases in a corpus

Co-Training

A technique used to compensate for the missing regularization requirement of abstractive summarization in the standard framework, learning a category-specific text encoder and improving the quality of locating salient aspect information of the review.

Coarse-To-Fine Attention

A two-layer hierarchical attention method that divides the input into chunks and sparsely attends to one or a few chunks at a time using hard attention, then applies full attention over those chunks

Coherence Patterns

Linguistically motivated patterns created by entities in connected sentences that ensure coherence in automatic summarization.

Coherence Summary Ranker

A method that ranks summaries based on their coherence using human annotated coherence scores.

Coherent Summaries

Summaries that are logically connected and easy to read

Cohesive Devices

Words or phrases that tie sentences together into a coherent text.

Collaborating Encoder Agents

Dividing the task of encoding a long text across multiple agents, each in charge of a different subsection of the text

Collective Intelligence Production

The process of generating knowledge or solutions through the collaboration and input of multiple individuals

Commitment Sentence

The most related sentence to the to-do item.

Commonsense Knowledge

Prior knowledge that facilitates and enhances human reading, which is expected to have a large influence on the reading process as it helps the reader to construct a coherent mental representation of the document and gain an overview of the content in the document.

Community Question Answering (Cqa)

A type of question answering system that involves real-world applications such as Yahoo! Answer and StackExchange.

Comparative Evaluation

A comparison of well-known summarization systems regarding their implicit choices of θ by measuring the correlation of their θ functions with human judgments on two datasets from the Text Analysis Conference (TAC).

Comparative Opinion Summarization

A novel task that generates two contrastive summaries and one common summary by comparing multiple entities.

Comparative Summarisation

Selecting documents that represent each group and highlight differences between groups.

Compare-Aggregate Architecture

A method for modeling the interaction between QA pairs by aggregating comparison signals from low-level elements into high-level representations.

Comparison To Abstractive Summarization

post-editing is less likely to lead to problems of factual correctness and consistency

Complementarity

The existence of complementarity introduced by decoding algorithms or system combination among the current state-of-the-art summarization systems, which can be effectively utilized by Refactor for boosting the system performance.

Complementary Guidance Signals

The different types of guidance signals investigated in the paper are complementary to each other, and their outputs can be aggregated together to obtain further improvements in summarization quality.

Compositionality

The property of word embeddings that allows the combination of vectors to represent the meaning of a phrase or sentence.

Comprehensive Information (Attention History)

Lacking it might result in generating puzzling words where some subtopics are unnecessarily accessed for multiple times and generating faultiness summary in which some salient information is mistakenly unexplored.

Compressed Oracle Summaries

Summaries that are compressed automatically for each sentence

Compression

A technique historically used in heuristic-driven systems or in systems with only certain components being learned. It involves identifying permissible deletions in text.

Compression And Merging

Summarization techniques used to reduce the length of the summary

Compression Or Rewriting

A technique used in hybrid methods to discard uninformative phrases in selected sentences.

Compression-Reconstruction (Cr) Learning

A paradigm used to train unsupervised encoder-decoder-based summarizers in the absence of paired data

Compressive Summarization Systems

A system that compresses text in a data-driven way, offering a tradeoff between the robustness of extractive models and the flexibility of abstractive models.

Concept Maps

Visual summaries structured as directed graphs

Concept Pointer Generator

A novel model that encourages the generation of conceptual and abstract words by leveraging context-aware conceptualization and a concept pointer, both of which are jointly integrated into the generator to deliver informative and abstract-oriented summaries.

Concept Pruning

The process of reducing the number of concepts in the model to find optimal solutions efficiently.

Conceptnet Knowledge Graph

A knowledge graph used to expand the aspect scope and enrich the supervisions

Conciseness

The quality of being concise, expressing much in few words.

Conditional Generation Approach

A method of generating summaries that uses control codes to condition the output.

Conditional Generation Models

Models used for single-document news summarization that include sequence-to-sequence architectures with attention and copy mechanisms, Transformers, and pre-trained language modeling.

Conditional Language Models

A method to directly test abstractive summarizers in terms of how they score potential candidate summaries, allowing for testing of specific desired qualities such as semantic consistency and entailed by the source text.

Conditional Recurrent Neural Network

A type of recurrent neural network that generates output based on a conditioning input

Conditional Summarization

The selection of the most salient points and how those points are expressed are explicitly conditioned on an ad-hoc context, such as a question or topic of interest.

Conditional Text Generation Tasks

Results are reported on three canonical conditional text generation tasks of increasing complexity

Confidence Intervals

A range of values that is likely to contain the true value with a certain level of confidence

Confounding Effect

The impact of output length on the evaluation of summarization systems.

Consensus Summaries

Summaries that represent dominant opinions in reviews and can be useful for quick decision making and getting an overall feel for a product or business.

Consensus Summarization

A process of generating summaries that represent dominant opinions in reviews, which can be useful for quick decision making and to get an overall feel for a product or business.

Consistent Performance Improvement

Improvement of controllable summarization models based on two different architectures

Constrained Summarization Problem

Providing summarized content with an additional constraint, i.e., the commonality criteria.

Constraints From Coreference

Ensuring that critical pronoun references are clear in the final summary.

Constraints From Syntactic And Discourse Parsers

Ensuring that sentence realizations are well-formed.

Consumer Health Questions

Questions related to health asked by patients and their families, which tend to include numerous peripheral details that are not always needed to find correct answers.

Content Fidelity

The degree to which the summary accurately represents the content of the original reviews.

Content Importance

The degree of importance of information in a given document

Content Overlap

The degree to which the content of different summaries of the same editorial overlap.

Content Planning

The process of planning the content of a generated text.

Content Quality

The quality of the content in a summary

Content Quality Indicators

Measures used to evaluate the quality of a summary, such as relevance, coherence, and completeness.

Content Quality Of A Summary

The degree to which a summary accurately captures the information in the original text

Content Ranking Module

A module designed for coarse filtering that retains the most promising candidates from a sheer number of sentences in the original document.

Content Reordering

A phenomenon in abstractive summarization where the contents of the original document may be reordered in its summary.

Content Selection Behaviors

The process of selecting important content from the source text to generate a summary.

Content Selection System

A system that decides on relevant aspects of the source document by identifying tokens from a document that are part of its summary.

Content Units

Similar text segments that form a content unit, where the contributing text segments of a content unit should have similar semantic meanings

Content Verifiability Errors

Errors in the content of the summary that can be verified by checking the source document.

Content Weighting

a new evaluation metric based on abstracting away from the particular surface form of the target summary, but representing it as facts using Semantic Role Labelling (SRL)

Context Encoder

Interprets textual sequential meaning on the Transformer.

Contextual Bandit

A method of formulating extractive summarization that greatly reduces the size of the space that must be explored, removes the need to perform supervised pre-training, and prevents systematically privileging earlier sentences over later ones.

Contextual Bandit Problem

A problem where the model takes an action which is a to-be-selected sentence set and then receives a reward based on the correlation between extractive summary and gold-standard reference summary.

Contextual Embeddings

used for alignment of facts present in the source document according to the facts selected by a human-written summary

Contextual Information

Information from the source article that is lost in post-processing based approaches

Contextual Input Encoder

A model used in the data-driven approach that learns a latent soft alignment over the input text to help inform the summary

Contextual Network

A network responsible for extracting and compacting the source document in a summarization model

Contextual Word Embeddings

A method used to build a content selection model that can identify correct tokens with a recall of over 60%, and a precision of over 50%.

Contextualization

Adding the appropriate context from the reference article to the citation texts to better understand the context for the ideas, methods or findings stated in the citation text.

Contextualized Language Model

A model that can be finetuned based on the deep bi-directional Transformer for content selection in summarization.

Contextualized Pre-Trained Models

A type of neural network model that is pre-trained on large amounts of data and can be fine-tuned for specific tasks.

Contextualized Sentence Representation

A representation that takes the whole document into consideration to learn the document-level context.

Contextualized Word Representations

Word embeddings that capture the meaning of a word in context

Continuous Latent Representations

The approach used in this paper to rely on continuous latent representations, in contrast to MeanSum, which treats the summary itself as a discrete latent representation of a product.

Contrast Candidate Generation And Selection

A method for correcting hallucinations in which named entities in a potentially hallucinated summary are replaced with ones with compatible semantic types that are present in the source, and variants of candidate summaries are created and ranked with a discriminative model trained to distinguish between faithful summaries and synthetic negative candidates generated given the source.

Contrastive Attention

A mechanism that encourages the contribution from the conventional attention that attends to relevant parts of the source sentence, while penalizing the contribution from an opponent attention that attends to irrelevant or less relevant parts

Contrastive Distractors

Clearly incorrect summaries generated using a rule-based procedure to test how well neural abstractive summarizers distinguish human-written abstracts.

Contrastive Summaries

Summaries that highlight the differences between two entities.

Contrastor

A module that encourages the model to better differentiate factual summaries from nonfactual ones by paying attention to the document using contrastive learning.

Control Codes

Codes used to condition summary generation based on sub-aspect functions, such as importance, diversity, and position.

Controllable Neural Model

A proposed model for abstractive summarization that incorporates entity information and generates summaries with selected entities, resulting in improved content accuracy and topic coherence.

Controllable Summarization Model

A model that enables personalized generation of summaries and allows the reader to control important aspects of the generated summary, such as length, focus on entities, style, and portion of the article to be summarized.

Controlled Summary Generation

Algorithms that allow for controlling various dimensions of the output summary, such as length, entities, and topics.

Controlled Text Generation For Summarization

The ability to generate summaries that conform to specific topic distributions or sentiment polarity.

Conversational Text

Text that scatters main points across multiple utterances and between numerous writers.

Convex Sparse Optimization

A method of summarization that formulates the problem as a decomposable row-sparsity regularized optimization problem.

Convolutional Encoder

Associates each word with a topic vector capturing whether it is representative of the document's content.

Convolutional Gated Unit

A unit used in global encoding to perform global encoding on the source context

Convolutional Sequence To Sequence Model

A type of neural network architecture that uses convolutional layers instead of recurrent layers, which can be faster and more stable when processing long sequences.

Coordinated Models

Models that have scores that are coordinated with the actual quality metrics by which the summaries will be evaluated – higher model scores should indicate better quality summaries.

Copy Aggregations

Aggregations that can be directly copied from the source text

Copy Bias

A bias observed in pseudo summaries generated from a Seq2Seq teacher model, where more continuous text spans from original documents are copied than in reference summaries.

Copy Mechanism

A neural component that allows a summarization system to copy words or phrases from the input text to the summary

Copy Rate

The percentage of summary n-grams (sequences of words) appearing in the source text.

Copy Tokens

Tokens in the generated summary that are copied directly from the input document

Copying Likelihood

An indicator of a word's saliency in terms of forming an impression, which can be learned via a sequence-tagger.

Coreference Relations

Relations between mention phrases of the same entity

Corpus Bias

The bias towards certain sub-aspect functions in different types of documents.

Correctness Estimates

An estimation of the accuracy of generated summaries in abstractive summarization.

Corrector

A module that removes hallucinations existing in reference summaries, allowing training on the full training set without learning unfaithful behaviors.

Correlation With Human Assessment

The process of comparing system-generated texts with human-generated reference texts to evaluate the quality of the system

Correlation With Human Judgments

The degree to which the generated summary matches the judgment of a human evaluator.

Counterfactual

A technique used to evaluate the model's performance with different subsets of the input data.

Counterfactual Abstractive Summarization

A method for estimating the causal effect of language prior on the generated summary and removing it from the total causal effect

Counterfactual Consistency (Coco)

A proposed evaluation metric for text summarization that evaluates factual consistency via counterfactual estimation and does not rely on auxiliary tasks

Counterfactual Hallucinations

Extrinsic hallucinations that introduce information that is not true in real life.

Coverage Loss

Added to prevent repetitions on long summaries

Coverage Model

A model that takes as input the original document with keywords masked out and uses the current best automatically generated summary to try to uncover the missing keywords

Coverage Report

A report generated by a script reader consisting of a logline, a synopsis, comments explaining its appeal or problematic aspects, and a final verdict as to whether the script merits further consideration

Coverage Vector

Used to track and control coverage of the source document, remarkably effective for eliminating repetition

Coverage-Based Regularizer

A regularizer that helps to ensure that all parts of the source text are covered in the summary.

Cross Dependency

The relationship between the prototype document-summary pair used to obtain a summary pattern and prototype facts.

Cross Domain Applications

Applications where the summarization model is tested on data from a different domain than the one it was trained on

Cross-Dataset Evaluation

Evaluating a summarization system on a range of out-of-dataset corpora to test its generalization ability

Cross-Domain Setting

A scenario where the model is tested on a dataset that is different from the one it was trained on.

Cross-Organizational Transferability

The ability of the proposed model to perform well on another publicly available clinical dataset (OpenI).

Cross-Sentence Relations

A core step in extractive document summarization, where the relations between sentences are modeled to effectively extract summary-worthy sentences.

Data Biases

Particular biases in the data that the summarizer can learn to exploit.

Data Efficiency

The ability of a model to perform well with a small amount of training data

Data Manipulation

Includes synthesis and augmentation

Data Noise

Incomplete or irrelevant information in a dataset that can negatively impact model performance.

Data Quality

The degree to which data is accurate, reliable, and consistent, and must be evaluated either during dataset construction or post hoc.

Data Reconstruction

A method of summarization that selects sentences that can best reconstruct the original document.

Data Scarcity

The limited availability of high-quality data for training and evaluating automatic summarization systems.

Data-Efficient

A property of the content selection model that can be trained with less than 1% of the original training data, providing opportunities for domain-transfer and low-resource summarization.

Dataset Bias

The tendency of a summarization system to perform better on certain types of datasets due to biases in the training data

Dataset Bottleneck

The issue that released datasets do not contain both annotation and social information, making it challenging to evaluate summarization systems that consider both sentences and social messages.

Debiasing Method

A proposed method to effectively demote the lead bias learned by the neural news summarizer and improve its generalizability.

Decision Faithfulness

A desideratum that suggests the selected sentences should lead to the same decision as using the full text based on the model.

Decision-Focused Summarization

A summarization task that emphasizes supporting decision making by identifying the most relevant information for decisions.

Decoder Architecture

A novel architecture that adds another ‘closed book’ decoder without attention layer to a popular pointer-generator baseline, such that the ‘closed book’ decoder and pointer decoder share an encoder.

Decoder Overconfidence-Regularizing Objective

A method suggested in a previous study to regulate the overconfidence of the decoder in a summarization model

Decoder Probing Tasks

Part-of-speech tagging (POS), Dependency Labeling (DEP), Semantic Role Labeling (SRL), and Named Entity Labeling (NEL) used to explore the information encoded in the role, filler, and TPR space.

Decoder Uncertainty

The entropy of decisions made by the model during generation

Decoder-Only Framework

A type of abstractive summarization framework that uses only a decoder, as opposed to an encoder-decoder architecture.

Deep Rl Methods

Reinforcement learning methods that use feedback to improve the quality of generated summaries

Deepchannel

a neural extractive summarizer that estimates salience for guiding the extraction procedure instead of learning an end-to-end mapping

Definiteness Prediction

a lightweight method for post-editing extractive summaries, which involves predicting whether articles in the summary should be kept as is or modified

Deliberation

The process of discussing and considering different options or viewpoints in order to make a decision or reach a conclusion

Dependency Graph

A graph built from sentences to decode a tree using integer linear programming, which is finally linearized to generate a summary sentence.

Dependency Or Word Graph

A graph that represents the syntactic structure of a sentence or set of sentences.

Dependency Parse

A rule-based sentence compression module that operates on the dependency parse of the answer sentence can yield better results than query-based extractive summarizers trained for the specific dataset.

Dependency Parse Tree Structure

A structure that naturally combines with the copy mechanism of an abstractive summarization system to encourage salient source words/relations to be preserved in summaries.

Dependency Relations

The relationships between words in a sentence that ensure grammaticality in compressed sentences.

Design Considerations

The factors that need to be taken into account when designing human-AI interaction in text summarization and broader text generation tasks.

Design Opportunities

The potential areas for improvement in the design of human-AI interaction in text summarization and broader text generation tasks.

Desirable Properties

Properties of a summary such as capturing the most important information, being faithful to the original text, grammatical and fluent.

Determinantal Point Process

Optimization method that selects a diverse subset from a ground set of items, characterized by quality and diversity scores

Diagnostic Decision Support Systems

Computerized systems that assist healthcare providers in accurately understanding a patient’s condition and reducing the effort in document review during time-sensitive hospital events.

Dialogue Systems

Computer systems that can engage in conversation with humans.

Diminishing Returns

The trend of diminishing benefits from intermediate pretraining as the amount of pretraining data increases

Directed Centrality

An approach for measuring centrality in single-document summarization that uses directed edges and considers the relative position of nodes

Directionality And Hierarchy

Augmenting the document graph with directionality and hierarchy to reflect the rich discourse structure of long scientific documents.

Discourse Entities

The unit of information used to relate sentences in automatic summarization. They are referred to as head nouns of noun phrases.

Discourse Markers

Words or phrases that connect sentences or parts of sentences to show the relationship between them.

Discourse Processing Techniques

Techniques used to analyze the relationships between sentences in a text

Discourse Relations

Relations between Elementary Discourse Units (EDUs) within a document

Discourse Structure

Patterns in the structure of long scientific documents that are highly useful for determining sentence importance in summarization systems.

Discourse Tree Framework

A method of representing text as a tree structure, where each node represents a sentence and its children provide additional information about the parent sentence

Discourse-Aware Decoder

A model component that generates the summary and attends to different discourse sections

Discourserank

A method of ranking the importance of each sentence in terms of the number of descendants to generate a summary that focuses on the main review point

Discrete Distributions

Probability distributions that take on a finite or countably infinite number of values

Discrimination Ability

The ability to distinguish between relevant and non-relevant documents

Discriminative Deterministic Variables

The variables used in the deterministic transformations in existing seq2seq models, which lead to limitations on the representation ability of the latent structure information.

Discriminator

A transformer-based model that scores the factuality or discourse quality of candidate summaries using one of four different objectives

Discursive And Referential Constraints

Improving human judgments of linguistic clarity and referential structure.

Disfluencies

Interruptions, hesitations, and filler words such as "um" and "uh-huh" that occur in speech

Disjunctive Model

A model that computes two scalar values, one from a content feature and the other from a context feature, to predict the popularity of a post and exclude the effect of context.

Disparity Constraints

Constraints used to ensure the specific embeddings in each feature space.

Distant Labels

Labels used to train models for summarization tasks that do not require manual labeling, such as categories of news articles and ratings of online reviews.

Distant Supervision

A pre-training method that uses filtered sentences of the documents as noisy targets to pre-train all the parameters of the NHG model.

Distant Supervision Training Strategy

A training strategy that uses external knowledge sources to provide supervision for the model, allowing it to adapt and generalize better.

Distantly-Supervised Training

A method of optimizing the information selection process

Distributional Semantic Reward

A reward signal that measures the similarity between the generated summary and the input based on their distributional semantics

Document Context

The subject and structure of the document that the system should have a global view of in order to decide whether to choose a particular sentence.

Document Encoder

A hierarchical architecture that suits the compositionality of documents

Document Encoder Layer

A layer in the model that encodes the document

Document Modeling Module

A module that learns representations of documents.

Document Reader (Encoder) And Sentence Extractor (Decoder)

The model consists of a document reader (encoder) and a sentence extractor (decoder).

Document Segmentation Algorithm

A method for segmenting the document into facet-aware semantic blocks

Document Structure

The way a document is organized, including sections and paragraphs, can facilitate information searching, reading comprehension, and knowledge acquisition.

Document Subjects

Topics, categories, sentiments, and other meta-information about a document.

Document Summaries

Summaries of documents created based on a model of relevance for the topic

Document Summarisation

Generating a summary for a long document or multiple documents on the same topic

Document-Dependent Features

Features that depend on the specific document being summarized, such as term frequency or position.

Document-Independent Features

Features that do not depend on the specific document being summarized, such as stopword ratio or word polarity.

Document-Level Feature

A feature that captures global information and plays a key role in sentence selection for summarization.

Domain Terminology

Terminology specific to a particular domain.

Domain Variations

differences in the language and terminology used in different domains or fields

Domain-Adaptive Pre-Training (Dapt)

A pre-training method based on an unlabeled substantial domain-related corpus.

Domain-Sensitive

The effectiveness of summarization models varies depending on the domain of the text being summarized.

Domain-Specific Resources

Resources that are customized for a specific domain, such as chemistry, to aid in NLP pipelines.

Drop-Prompt Mechanism

Enables dropping out hallucinated entities from the predicted content plan and prompting the decoder with this modified plan to generate faithful summaries

Dynamic Key-Value Memory-Augmented Attention (Dma)

Alleviates the problem of generating repetitive words and incomplete summary, allowing the model to track the comprehensive information typically for each salient facet within the source document.

Early-Stop During Decoding Methods

Focus on when to output eos (end of sequence), indicating the end of the summary. An ad-hoc method generates the eos by assigning a score of −∞ to all candidate words at the position of the desired length during test. Others learn the relationship between length and the decoder state at training time.

Ease Of Summarization

The degree to which a reference summary is easy to summarize.

Edit-Based Approach

A method that maximizes a heuristically defined scoring function to evaluate the quality of the generated summary

Edit-Based Summarization

A technique that generates a summary by operating an edit action (e.g., keep, remove, or replace) for each word in the input sentence

Editorial Discourse Units

The three parts of an editorial - lead, body, and conclusion - each with a specific contribution to the overall argument.

Effective Faithfulness

Improvement in faithfulness over a baseline system operating at the same level of extractiveness

Effective Summaries

Summaries that convey the most important information in a text

Effectiveness Testing

Comparisons with multiple abstractive and extractive baselines, including traditional syntax-based systems, integer linear program-constrained systems, information-retrieval style approaches, and statistical phrase-based machine translation

Efficacy

The effectiveness of selecting sentence singletons and pairs for abstractive summarization.

Efficiency

The ability of the AI-assisted text generation system to generate summaries quickly and accurately.

Efficient Content Consumption

The idea that the effectiveness of content consumption is not only determined by the information contained within it, but also by the tone and style of presentation.

Efficient Document Summarization

The process of creating a shorter version of a longer document while retaining its key information.

Efficient Training

A benefit of curriculum learning without the need for external data

Embeddings

Vector representations of text that encode meaning and allow the application of statistical and geometrical methods to words, sentences, and documents.

Encode-Encode-Decode Paradigm

A framework for generating abstractive summaries using two complementary models, transformer and seq2seq.

Encoder

The part of the model that encodes the source passage to a fixed-size memory-state vector.

Encoder Module

A component of the model that processes the input sentence and produces a representation that is used by the decoder

Encoder-Centric Stepwise Models

Proposed models for extractive summarization using structured transformers that enable stepwise summarization by injecting the previously planned summary content into the structured transformer as an auxiliary sub-structure.

Encoder-Decoder

A type of neural network architecture used for sequence-to-sequence learning

Encoder-Decoder (Seq-To-Seq) Paradigm

a common approach for abstractive summarization based on encoding the original text sequence and decoding the summary sequence

Encoder-Decoder Approaches

Methods for abstractive summarization that use a neural network to encode the input and decode the summary

Encoder-Decoder Attentions

A type of attention mechanism used in sequence-to-sequence models where the decoder attends to the encoder's hidden states.

Encoder-Decoder Framework

a framework that encodes a document and decodes its summary

Encoder-Decoder Mechanism

A method widely used for single document extractive summarization, where the encoder encodes one sentence into vector representation and the decoder with top-k strategy predicts the probability scores of those sentence vectors, sorts sentences in descending order, and picks sentences until exceeding the length limit.

Encoder-Decoder Networks

Modern neural summarization systems that aim at producing abstractive summaries and rely on the attention mechanism to focus on different parts of input during the decoding stage.

Encoders With Induced Latent Structures

Benefit several tasks including document classification, natural language inference, and machine translation

Encoding

The procedure of representing a document into high-dimensional representations

End-To-End Argument Mining

Recent work used to instantiate the theoretical graph framework for conversation summarization.

End-To-End Fashion

A type of training where the entire system is trained together, rather than training individual components separately.

End-To-End Learning Framework

A framework that directly learns to detect summary-worthy content as well as generate fluent sentences, circumventing efforts in feature engineering and template construction.

Entail Reward

Gives higher weight to summaries whose sentences logically follow from the ground-truth summary.

Entailment

Summaries whose sentences logically follow from the ground-truth summary.

Entailment Knowledge

Knowledge that ensures a correct summary is semantically entailed by the source sentence

Entailment-Aware Decoder

A model that uses entailment RAML training to encourage the decoder of the summarization system to produce summary entailed by the source

Entailment-Aware Encoder

A model that incorporates entailment knowledge into summarization models by jointly modeling summarization generation and entailment recognition

Entailment-Based Metrics

Metrics that determine whether the content in the summary is entailed by the input document.

Entity Chains

Ordered sequences of entities in the summary

Entity Coherence

A desirable summary aspect that is encouraged in reinforcement learning approaches.

Entity Control

Designing constraints that guide the generated summary to cover the salient information of user-specified entities

Entity Control Code (Ecc)

A technique that guides the model learning process by utilizing entity coverage precision between the training document and its reference summary as faithfulness guidance

Entity Graph

A graph used for summarization where one set of nodes corresponds to entities

Entity Grid

A local coherence model used for summary coherence evaluation

Entity Grid Model

A coherence modeling method that constructs a grid to represent grammatical and semantic transitions of entities between sentences.

Entity Hallucination Problem

A problem in text summarization where a model-generated summary contains named entities that never appeared in the source document.

Entity Level Hallucination

The type of hallucination that occurs when the generated summary contains entities that do not exist in the source document

Entity Linking System (Els)

A system used to extract linked entities from the original text

Entity Nodes

Nodes representing entities in the document

Entity Replacement

A method of sentence fusion that involves replacing entities in the original sentences with pronouns or other entities

Entity Selector

A component of the proposed controllable neural model that identifies the most important entities and sends their representations to the summary generation phase.

Entity Tracking

An NLP task on TV show transcripts that focuses on identifying and tracking entities

Entity-Aware Content Selection Component

Selects important sentences from the input that includes references to salient entities

Entity-Based Modeling

Enables enhanced input text interpretation, salient content selection, and coherent summary generation

Entity-Centric

A majority of work on opinion summarization is entity-centric, aiming to create summaries from text collections that are relevant to a particular entity of interest, e.g., product, person, company, and so on.

Entity-Centric Metrics

Metrics proposed to evaluate the quality of generated plot summaries based on bags of characters and character relations

Entropy

A measure of the amount of uncertainty or randomness in a system, used in Peyrard's method to model importance

Episodic Markov Decision Process (Mdp)

A model used to treat extractive summarization as a multi-step process that is aware of the extraction history.

Error Alerting

A mechanism that automatically detects errors in the generated after-visit summary, including missing medical events and hallucinations.

Error Propagation And Exposure Bias Problems

Issues faced by autoregressive models.

Evaluating Abstractive Summarisation

an open challenge due to decoders being amenable to pathogeniessuch as hallucination and/or omission of important information, which are hard to capture using existing evaluation metrics

Evaluating The Quality Of Summaries

The paper discusses the challenges of evaluating the quality of summaries and the importance of human evaluation as the gold standard.

Exact Match

A method of answer verification that compares the prediction to the expected answer by exact match

Exact N-Grams Matches

A type of automatic evaluation metric that counts the number of matching n-grams between the generated summary and a reference summary

Example-Driven Paradigm

A paradigm that subsumes generic, query-based, question-based, and even abstractive summarization systems

Exemplar-Based Data Representation

A method of summarization that introduces an additional sentence dissimilarity term to encourage diversity in summary sentences.

Exhaustive Identification

The process of identifying all possible content points in a text passage.

Expected Risk Minimization (Risk)

A technique that leverages a small pool of strong sampled candidates to smartly inform the reward function

Explanation Identification

The process of identifying the reasons behind a label

Explanatory Modules

Specialized modules added to the factual consistency model that explain which portions of both the source document and generated summary are pertinent to the model’s decision.

Explicit Guidance Signals

Supervision signals that prevent the model from violating the specified attribute requirement

Explicit-Structure Attention Module

Incorporates external linguistic structure (e.g., coreference links)

Exposure Bias Problem

A problem where the model is trained with teacher forcing, which leads to a discrepancy between training time and inference time.

Expressive Neural Abstractive Systems

Systems that are difficult to control to produce correct and fluent output.

Expressive Summarization Model

A model that selects important content with lexical features and allows aggressive compression of individual sentences by combining two different formalisms.

Extended Summary

A summary of a long document, typically containing 400-600 terms, that conveys more detailed information than a short summary

Extensive Experimentation

A large scale set of experiments comparing different approaches to summarization.

External Data Sources

Data sources such as Web search results, clickthrough data, query logs and Wikipedia, comments from news readers, and tweets corpus used for improving summary quality.

External Knowledge Representations

Representations of external knowledge that can be used in summarization

Extract-Abstract (Ea) Framework

an extractive model first selects a subset of opinions and an abstractive model then generates the summary while conditioning on the extracted subset

Extract-Then-Abstract Framework

A method that first extracts the summary-worthy sentences and then abstracts each of them, but suffers from an information loss in abstract stage and lacks an effective reinforcement learning framework to bridge together two modules.

Extractive Diversity

A subaspect of summarization that is determined by how well an extractive model performs on the data, compression ratio between the source document and summary, and lead bias.

Extractive Document Summarization (Eds)

The process of automatically extracting a set of sentences that represent the information of a whole document by ranking the importance of sentence features.

Extractive Model

A model that combines representations of the topic and partial summary with representations of the document sentences through an attention mechanism to extract one reference sentence.

Extractor Agent

A component of the hybrid architecture that selects salient sentences or highlights from the original document.

Extreme Summarization

A new single-document summarization task that requires an abstractive modeling approach and aims to create a short, one-sentence news summary answering the question "What is the article about?"

Extreme Summarization Task

Producing one to two summary sentences in extreme compression and high abstraction.

Extrinsic And Intrinsic Errors

Refers to errors in summaries that are related to the source text and errors that are unrelated to the source text, respectively.

Extrinsic Hallucination

The type of hallucination that occurs when the generated summary contains information that is present in the source document, but is not relevant to the main content

Facet Bias Problem

The problem of centrality-based models tending to select sentences from one facet of a document, rather than important sentences from different facets.

Facet Overlap

A proposed evaluation metric for summarization that measures whether the system summary covers the facets in the reference summary

Facet-Aware Centrality-Based Model

A model proposed in this paper to address the facet bias problem in document summarization.

Facet-Aware Mappings (Fams)

Mappings from each facet (sentence) in the reference summary to its support sentences in the document

Facet-Aware Ranking (Far) Method

A method that forces a centrality-based model to select summary sentences from different facets by incorporating the relevance between the candidate summary and the document

Facet-Aware Recall (Far)

A measure of information coverage in summarization based on facet overlap

Faceted Summarization

Summarizing an article from distinct aspects including purpose, method, findings, and value.

Facetsum Dataset

A dataset consisting of 60,024 scientific articles collected from Emerald journals, each associated with a structured abstract that summarizes the article from distinct aspects including purpose, method, findings, and value.

Fact Checker

A mechanism designed to prevent the generator from copying irrelevant facts from the prototype by providing mutual information between the generated summary and the input document.

Fact Checking

Fact checking focuses on verifying facts against the whole of available knowledge, whereas factual consistency checking focuses on adherence of facts to information provided by a source document without guarantee that the information is true.

Fact Consistency Assessment Framework

A framework proposed to assess the factual consistency of summarization models

Fact Descriptions

Short sentences formed by merging words in a triple or tuples to represent a fact.

Fact Verification

A task that aims to verify the truthfulness of a given statement.

Factoid Question Answering

A type of question answering system that can be answered by a certain word or a short phrase.

Facts Errors

Inaccuracies in the information presented in a summary compared to the source document

Factual Checking

The process of verifying the credibility and usability of models by checking for factual consistency

Factual Consistency And Semantic Coverage

The paper introduces entailment-based and semantic area RL rewards to analyze their effect on factual consistency and semantic coverage, ensuring that all factually relevant perspectives are captured.

Factual Consistency Classifier

A machine learning model trained to identify factual inconsistencies in summaries.

Factual Consistency Evaluation Model

A model used to evaluate the factual consistency of a summary

Factual Consistency Improvement

Improvement in the factual consistency of summarization models

Factual Correctness

Existing abstractive summarization models are optimized to generate summaries that highly overlap with human references, but this does not guarantee factual correctness. Maintaining factual correctness of generated summaries remains a critical yet unsolved problem.

Factual Faithfulness

Can be objectively annotated with detailed classification of factual errors

Factual Knowledge Graph

A graph that contains all the entities with their relationships mentioned in the documents, extracted from the original documents to support the summarization task.

Factuality Detection Model

A model capable of leveraging fine-grained human annotations to detect errors in generated texts.

Factuality Dynamics

A metric that shows the degree of factuality degradation observed for different models.

Factuality Evaluation Model

A model used to evaluate the factual consistency of a generated summary

Factuality Measurements

Measurements used to evaluate the faithfulness of abstractive summarization.

Factuality Models

Models trained on synthetic data to detect and correct errors in generated summaries.

Factuality Performance

The accuracy of the generated summary in reflecting the content of the source document.

Factually Consistent Summaries

Summaries that accurately reflect the content of the source document.

Faithfulness Errors

Generating information in the summary that is not present in the original text

Faithfulness Ratings

A measure of how well a summary conveys the same meaning as the original document

Faithfulness To The Original Article

The quality of accurately representing the content of the source text

Faithfulness To The Original Text

The degree to which a summary accurately conveys the main concepts of the original text.

Fast Approximation Algorithm

An algorithm developed to address the problem of multiple optimal solutions and achieve near-optimal performance without the need for concept pruning.

Feature Attribution

Techniques used to understand the contribution of input features to the output of a model.

Feature Engineering

The process of selecting and designing features to be used in a machine learning model.

Few-Shot Capability

The ability of a model to perform well on a task with only a small amount of labeled data.

Fine-Grained Content Selection

A type of content selection that guides the neural text generator to stitch selected segments into a coherent sentence.

Fine-Grained Human Annotations

Annotations at the word- or span-level used to detect errors and localize them within generated texts.

Fine-Grained Informativeness Metric

A metric developed to evaluate the output quality of the proposed method in semantic vector level, assessing not only the relevance between model outputs and manual references but also the extra and omissive information in generated summaries.

Fine-Tuning

The process of adapting a pre-trained language model to a specific task using supervised learning.

Finer-Grained Text Segmentation

A method of content selection that can benefit from splitting documents into sub-sentential segments following its discourse structure, as it provides a more refined granularity of semantic information.

Finetuning

Finetuning the model using a labeled dataset for the target task

Fixed Length Text Summarization (Fls)

Demands extra focus on controlling the length of output summaries, specifically, it requires generating summaries with a preset number of characters or words

Fixed Number Or Proportion Of Summary Sentences

The issue of a fixed number or proportion of summary sentences in the top-k strategy, which has been a popular method employed by many models. A flexible extractor should generate a non-fixed number of summary sentences based on source document length, topics, or other aspects.

Flat-Structured Model

A type of summarization model that does not take into account the word-sentence hierarchy.

Flexible Content Selection

The ability to customize content selectors to be combined with general-purpose neural text generators.

Flexible Content Type

Structured transformers are flexible in terms of content type (e.g., text or tables) that can be modeled.

Flexible Framework

The proposed approach is easily adaptable from single-document to multi-document summarization tasks.

Flexible Quantity Of Sentences

The ability of the architecture to extract any number of sentences with a threshold, instead of a constant number.

Fluency Model

A model that ensures the generated language is fluent

Focus Attention Mechanism (Fame)

A technique for improving document summarization by focusing on supported and topical content.

Focus Sampling

A technique for generating diverse yet topically consistent and faithful summaries.

Focus-Attention Mechanism

A mechanism that employs a learnable Gaussian focal bias as a regularization term on attention scores to emphasize the corresponding part of the document

Focused Summaries

Produced by the model according to human evaluations

Formal And Informal Summary Variants

Summaries generated by the same model, tailored towards different levels of formality.

Formality

A stylistic parameter that can affect the tone and style of text, with formal language being more appropriate for certain audiences, such as corporate executives.

Format Accuracy

The degree to which the generated text conforms to a specific format.

Formats

The different ways in which a summary can be presented

Forum Thread

A discussion initiated by a user on an online forum

Forum Thread Summarization

The task of generating concise summaries of lengthy and comprehensive forum threads

Framework And Training Strategy

The authors propose a framework where an external information extraction system is used to extract information in the generated summary and produce a factual accuracy score by comparing it against the human reference summary. They further develop a training strategy where they combine a factual correctness objective, a textual overlap objective, and a language model objective, and jointly optimize them via reinforcement learning (RL).

Free-Form Generation

A type of generation where the model is not constrained to copying from the input document and can generate new text

Frequent Errors

Failing to understand the sender's intent and failing to identify the roles of the sender and receiver

Frequent Itemsets

Common data mining techniques for calculating frequent sequences of words in transactional data.

Gap Sentence Generation (Gsg)

The original pre-training objective of PEGASUS that transforms any text into a pseudo-summarization dataset by selecting important sentences using ROUGE as output summaries.

Gated Global Information Filtering

A part of the information selection layer that filters unnecessary information in the original document

Gated Linear Units

A type of activation function used in neural networks that can help prevent the vanishing gradient problem.

Gaussian Bias

a bias term used in CoCoNet to model positional correlations between source words, considering the relative distances between them and the scope of the local context when copying.

General Summarization

A type of summarization that is different from scientific summarization in three main aspects. First, the length of scientific papers are usually much longer than general articles. Second, in scientific summarization, the goal is typically to provide a technical summary of the paper which includes important findings, contributions or impacts of a paper to the community. Finally, scientific papers follow a natural discourse.

Generality Across Domains

The ability of a model to generate summaries that are applicable to different domains.

Generalizability

The ability of the model to perform well on different datasets and scenarios

Generalizability And Transfer

Improved by saliency and entailment skills.

Generalization

The ability of a model to perform well on new, unseen data.

Generalized Pointer Generator (Gpg)

A model that replaces the hard copy component of the pointer generator architecture with a soft "editing" function by learning a relation embedding to transform the pointed word into a target embedding.

Generation Probability

The probability of generating the next word in a summary

Generation Style

The style in which the summary is generated, which may differ between domains.

Generative Process

The process by which neural summarizers generate summary text.

Generator Model

A component of the proposed pipeline framework that is trained to reconstruct reviews from their extracted opinion phrases and can then generate opinion summaries based on the selected opinions.

Generic Summarization

A summarization method that creates a single summary for a document, not taking into account user preferences or control aspects.

Global Attention Distribution

Describes how all reference tokens should assign the attention to each source token and can be predicted from the source by training an attention-prediction model.

Global Coherence

The overall coherence and consistency of the summary

Global Decoding Method

A method for summarisation that considers the entire document at once

Global Document Information

The overall information contained in a document

Global Encoding

A model proposed to tackle the problems in attention mechanism in abstractive summarization

Global Inference Methods For Extractive Summarization

Methods that formulate summarization as a combinatorial optimization problem, selecting a subset of sentences that maximizes an objective function under a length constraint, and use Integer Linear Programming (ILP) to solve it exactly.

Global Optimization Of Sentence Selection

A suggested training method for abstractive summarization models

Global Optimization Under Length Constraint (Golc)

Proposed method for neural summarization models to control the output length better than existing methods

Global Optimum

A hypothesis of the highest probability among all possible sentences, consisting of words in vocabulary V.

Global Scoring Mechanism

A novel mechanism composed of attention scores and length rewards to guide beam search based on the predicted global attention distribution.

Global Semantics

The overall meaning and context of a document.

Global Variance Loss

A loss function introduced to directly optimize the attention from the global perspective by preventing assigning high weights to the same locations multiple times.

Gold Standard Summaries

High-quality summaries created by expert writers, requiring a lot of effort

Gold Summary

The summary of a document that is considered to be the most accurate and informative.

Gold-Standard Annotations

Annotations that are considered to be the most accurate and reliable.

Graph Attention Network (Gat)

A network that simultaneously updates the representations of sentence and topic nodes in a heterogeneous document graph.

Graph Contrastive Learning

A method that pushes close topic representations of documents and sentences that have high semantic similarity with the gold summary and pulls away otherwise.

Graph Contrastive Topic Enhanced Language Model (Gretel)

A model that incorporates the graph contrastive topic model (GCTM) empowered by the semantic information of the gold summary and the global document context, with PLMs for long document extractive summarization.

Graph Propagate Attention

A technique used in the decoding procedure of the graph-based encoder-decoder model to guide the summary generation process by incorporating the graph structure.

Graph Representation

A structured representation that produces a summary and highlights the proximity of relevant concepts.

Graph Structure

Intuitive way to model long-range dependencies among text spans throughout a document

Graph Techniques

Methods for summarization that use graph-based representations of the input

Graph-Based Attention Mechanism

A novel approach inspired by graph-based extractive summarization methods, introduced in the encoder-decoder framework to discover the salient information of a document.

Graph-Based Encoder-Decoder Model

A model that improves both the document representation and summary generation process of the Seq2Seq architecture by leveraging the graph structure.

Graph-Based Mask Algorithm

A method to impose structure information on BERT directly to jointly learn contextual representations of different text granularities within a single BERT

Graph-Based Ranking Model

A method that uses sentences as nodes and weighted edges to represent the degree of similarity between sentences.

Graph-Based Summarization Technique

A technique for summarization that uses graphs to represent the relationships between sentences

Greedy Approach

A method of adding one sentence at a time incrementally to a summary, with the goal of maximizing the ROUGE score and stopping when no remaining candidate sentences improve the score.

Greedy Method

A method used to solve the maximal coverage problem by ranking sentences by their coverage of best compressing frequent word sequences and selecting the top-ranked sentences to a summary.

Grounded Summaries

Summaries that are closely tethered to the original audio, providing a preview of notable podcast clips and reducing the risk of misleading or inaccurate information.

Grouping Factors

Factors that arise in summarization evaluation when one annotator rates multiple summaries

Guard Rails

Specialized techniques employed during training to guide the model away from pathological behavior, including reducing repetition, encouraging the model to complete sentences, and avoiding frame filling patterns

Guidance Information

Input to enhance factual consistency of the summary

Guided Neural Abstractive Summarization

Methods that provide various types of guidance signals to constrain the summary so that the output content will deviate less from the source document and allow for controllability through provision of user-specified inputs.

Guided Signals

Signals extracted from the input source document such as keywords, highlighted sentences, and others to aid the model architecture in summarizing the input document.

Guiding Generation Model

A model proposed in this paper that combines extractive and abstractive models and uses keywords as guidance for the latter

Hallucinated Quantities

Quantities that are not supported by the source text and are introduced by abstractive summarization models

Hallucination Of Unsupported Or Incorrect Facts

A known shortcoming of current text generation and summarization models, which has been established for both abstractive and extractive summarization models.

Hallucination Risk Measurement

A measurement of the risk of generating false or inaccurate information in the summary

Hand-Crafted Features

Features that are manually designed by experts.

Hard Attention Methods

attention methods that perform abstraction only on text regions that were initially selected by some extraction process

Hard Targets

0/1 labels used in traditional training methods.

Heterogeneous Graph Network

A proposed approach for extractive summarization that introduces more semantic units as additional nodes in the graph to enrich the relationships between sentences.

Hidden Topics

Aspects of text semantics that can be derived from word embeddings computed from a general corpus.

Hierarchical And Sequential Context Modeling (Hscm)

A method for question-driven extractive answer summarization that integrates hierarchical interaction information between question-answer pairs in both word-level and sentence-level into a sequential extractive summarization model.

Hierarchical Attention

A technique used in the student model of DistilSum to encode sentences into hidden vectors.

Hierarchical Attention Networks (Han)

A novel supervised thread summarization approach that learns effective sentence and thread representation by attending to important words and sentences

Hierarchical Biases

Learnable biases that adjust attention weights between tokens based on their relative positions with regard to the document structure, enabling summarizers to capture long-range relatedness for better document understanding.

Hierarchical Decoder

A type of decoder architecture that generates a summary from latent sentence representations.

Hierarchical Decoding Algorithm

A proposed method to tackle the constraints of saliency, nonredundancy, information correctness, and fluency under a unified framework.

Hierarchical Encoder

A model component that captures the discourse structure of the document

Hierarchical End-To-End Model

A model proposed in this work that consists of a summarization layer and a sentiment classification layer.

Hierarchical Heterogeneous Graph

A graph constructed from a document where both words and sentences are nodes and the relations between them are constructed as different types of edges.

Hierarchical Incorporation Of Induced Topic Vectors

Into word embedding and paragraph attention, to expose the critical words and paragraphs for summarization.

Hierarchical Lstm

A model proposed by Cheng and Lapata (2016) to encode a document and predict binary labels for each sentence

Hierarchical Lstm And Attentional Bilstms

Deep learning based sequence learning methods for summarization

Hierarchical Meeting Summarization Network (Hmnet)

End-to-end deep learning framework leveraging encoder-decoder transformer architecture

Hierarchical Multi-Scale Abstraction Mining

Required to comprehend the hierarchical multi-facets information within the article, encoding the temporal dependencies with different timescales.

Hierarchical Multi-Scale Abstraction Modeling (Hmam) Model

Captures multiple hierarchical levels of abstraction of the source document, encoding the temporal dependencies with different timescales.

Hierarchical Networks

Networks that model a document as a sequence of sequences to tackle the challenge of modeling long-range inter-sentence relationships for summarization.

Hierarchical Neural Network

A neural network used for rough reading which consists of a neural net to encode the whole document and another one to capture features in paragraphs.

Hierarchical Question-Summary Generation

A new summarization task that produces hierarchically organized question-summary pairs to facilitate information consumption, inspired by the top-down knowledge learning process.

Hierarchical Topic-Aware Technique

Incorporates topic information into words embedding and paragraph attention.

Hierarchical Transformer Encoder (Hte)

A model that fully embeds the global context of long documents, to inform topic representations of documents and sentences.

High Correlation With Human Evaluation Scores

The crucial criterion for selecting a good evaluation measure

High Quality Data

Data that is accurate, reliable, and consistent, and forms the foundation for building meaningful statistical models in NLP.

High Quality Metric

A metric for which a small difference in score reliably indicates a difference in quality

High Quality Summary

A summary that is shorter than the original document, conveys only the most important and no extraneous information, and is semantically and syntactically correct

High-Level Text Operations

Text operations such as paraphrasing, generalization, text reduction, and reordering that pose a considerable challenge to natural language understanding.

High-Scoring Range

The range of scores where summarization systems aim to perform well.

Holistic Learning

The model can holistically learn any document-level coherence properties, such as saliency, redundancy, and ordering, embodied in the gold summaries.

Human Abstracts

Used to derive labels for extraction units, but not necessarily the best method due to generalization, paraphrasing, and words not present in the source text.

Human Annotation

The process of manually marking or labeling data by humans.

Human Evaluations

Two evaluations conducted to assess which type of summary participants prefer and how much key information from the document is preserved in the summary.

Human Feedback

A method of training language models by collecting human preferences between pairs of summaries and using reward learning to fine-tune the model.

Human Judgement

The gold standard for evaluating the quality of generated summaries, but is time-consuming and labor-intensive.

Human Judgements

Evaluation of the generated summaries by human judges.

Human Judges

People who evaluate the quality of summaries.

Human References

Summaries written by humans that are used as a benchmark for evaluating system-generated summaries.

Human-Ai Collaboration

Can improve summarization performance by taking advantage of complementary strengths.

Human-Ai Interaction

The interaction between humans and artificial intelligence in the context of text summarization and broader text generation tasks.

Human-In-The-Loop

A method of incorporating human input to improve the performance of a system.

Human-In-The-Loop (Hitl)

A concept that allows humans to actively participate in supervising AI systems by approving, rejecting, or re-labeling current outputs, and providing expert-guided advices to the system. It also acts as the unique source of external knowledge from humans.

Human-Like Reading Strategy

A three-phase process that includes general understanding of the document, task-specific reading comprehension, and polishing process. It involves leveraging prior knowledge, making inferences, and evaluating and polishing the generated summary.

Human-Readable Representation

A representation of the input source text in an auto-encoder architecture that complies with human grammar and can be comprehended by humans

Human-Written Summaries

Typically high quality, but require heavy cognitive load due to limited time and energy.

Hybrid Extractive-Abstractive Architecture

A combination of both extractive and abstractive summarization methods, using policy-based reinforcement learning to bridge the two networks.

Hybrid Learning Model For Abstractive Text Summarization (Hats)

A novel approach that mimics the human-like reading strategy for abstractive text summarization. It consists of three components

Hybrid Memnet Model

A data-driven, end-to-end enhanced encoder-decoder based deep network that summarizes a news article by extracting salient sentences.

Hybrid Methods

Text summarization methods that combine both abstractive and extractive approaches.

Hybrid Pointer-Generator Network

Facilitates copying words from the source text via pointing, which improves accuracy and handling of OOV words, while retaining the ability to generate new words

Hybrid Representation

A combination of token-level and sentence-level representations used to encode text.

Hybrid Sentence-Word Attention Model

A model used in the document summarization process to import both structural-compression and structural-coverage regularizations.

Hybrid Structure-Aware Sentence Representation

Combines coreference based dependency graph and latent structure attention module output

Hybrid Summarization

A combination of extractive and abstractive methods that employ specialized components.

Hybrid Summarization Approaches

Summarization methods that combine extraction and abstraction techniques

Hyperparameter Analysis

The process of selecting the best parameters for a model.

Hypotheses Reranking

A task of selecting the best output from multiple models, related to text-generation tasks.

Hypothesis Tests

Statistical tests used to determine whether the difference between two metrics' correlations is reflective of one metric being better than the other or if it is an artifact of random chance.

Idea Units (Ius)

Texts are divided into IUs to deal with complex sentences that convey multiple ideas. The IU is defined as a minimal fragment of a text that conveys an "idea" or "thought" coherently.

Importance Of Human Evaluation

Stressed by the poor correlation between automatic metrics and human judgment

Importance Of Quotes

The paper proposes the use of "the probability of being quoted" as an alternative aspect for summarization. The ability to extract quotes can help extract important sentences irrespective of how frequently the same topic appears in the text.

Impression Section

A section of the radiology report that summarizes the most prominent observation.

Inconsistency Loss Function

A novel loss function to encourage the consistency between two levels of attentions in the unified model

Incorporating Syntactic Information

Incorporating source syntactic structure in neural sentence summarization to help the system identify summary-worthy content and compose summaries that preserve the important meaning of the source texts.

Incorrect Facts

Information in the summary that is not accurate or true to the source text

Indegree Centrality

A measure of centrality that assumes a word receiving more relevance score from others is more likely to be important.

Inference

Summaries that require the reader to infer information not explicitly stated in the original text.

Inference Time

The time it takes for a model to generate an output.

Inference-Time Attention Head Masking Mechanism

A mechanism proposed in this work that works on encoder-decoder attentions to underscore salient content from the source and improve the quality of abstractive summaries.

Influence Functions

Techniques used to understand the effect of individual training examples on the model's output.

Information Saliency

The focus of the methodical empirical evaluation, which assesses the summary quality with reading comprehension tasks and compares favorably with automatic metrics against state of the art.

Information Selection

The process of selecting important information from a document for summarization

Information Selection Methods

Two-stage methods that extract top l most important tokens from the source document as a prototype summary where l is the desired length, and in the second stage encodes the original source document and prototype summary by a dual-encoder.

Information-Theoretic Measures

Measures used for controllable summarization, such as relevance and redundancy.

Input Gradients

A technique used to understand the contribution of input features by computing the gradients of the output with respect to the input.

Input-Dependent Reward Function

A novel method for training models with stylistic feedback on sampled and ground-truth summaries together.

Input-Specific Rl

An RL-based summarization system that employs a handcrafted reward function to learn a policy specifically for a given input, without requiring parallel data or a reward oracle.

Inter-Metric Correlation

The degree to which different automatic metrics agree in ranking summaries.

Inter-Reasoning Self-Attention

A reasoning module in the encoder that infers useful information from historical summaries to learn a more comprehensive representation for the given input review.

Inter-Sentence Coherence

The ability to connect sentences in a summary to create a coherent text

Interactive Hitl-Based Text Summarization Framework

A framework that continuously collects user-feedback to improve model prediction robustness. The user's intention is implicitly acknowledged as a factor influencing the extraction of important sentences from the source documents. The AI model is trained with human-produced summaries and adapted as more human-feedback is fed in.

Interactive System

Users can query relationships with a concept map interface

Interactive Text Ranking Approach

A method that efficiently gathers user feedback and combines it with predictions from pretrained, generic models to solve text ranking tasks.

Intermediate Pretraining

Supervised pretraining using labeled datasets from different domains for a task that is related to or is the same as the target task

Interpretable Visualisation

A visualisation of the semantic coverage of a generated summary by visualising the transport plan between summary tokens and document tokens.

Intra-Reasoning Attention

A personalized attention mechanism that selects informative words in the input review.

Intra-Temporal Attention

Attention that records previous attention weights for each of the input tokens

Intrinsic Hallucination

The type of hallucination that occurs when the generated summary contains information that is not present in the source document

Intrinsic Metrics

Metrics that are based on the properties of the dataset itself, rather than external factors, and are used to evaluate the quality of summarization datasets.

Isolation Problem

The problem of performing the two tasks of relevance ranking and saliency ranking in isolation.

Issues With Extractive Summarization

including problems with verbosity and coherence, as well as coreference and pragmatic context issues

Iterative Refinement

A process of refining sentence representation by fusing redundant information between selected sentences through iterative refinement, which is supervised by knowledge distillation.

Its

Iterative Text Summarization is an iteration based summary generator which uses a sequence classifier to extract salient sentences from documents. It consists of a novel “iteration mechanism” and “selective reading module”. ITS is an iterative process, reading through the document many times. There is one encoder, one decoder, and one iterative unit in each iteration. They work together to polish document representation. The final labeling part uses outputs from all iterations to generate summaries.

Kabob Structure

A news structure commonly used in journalism where a narrative hook is presented first to catch the reader's attention, followed by the main story presented in a Synopsis and Body Section.

Key Points

Short list of high-level arguments that summarize the pros and cons of a proposal. Salience of each key point is represented by the number of matching arguments. Can be composed by domain experts without looking at the arguments themselves. Used for measuring the distribution of key points in a large collection of arguments, interactive exploration of arguments, and novelty detection. Proposed method demonstrates feasibility and effectiveness of summarizing a large set of arguments by mapping them to a small set of key points. ArgKP dataset developed for argument-to-key point mapping task, comprising about 24,000 (argument, key point) pairs labeled as matching/non matching. First dataset for this task. Empirical evaluation and analysis of various classification methods performed.

Knowledge Augmented Summarization

a type of summarization that incorporates knowledge structures to improve the quality of the summary.

Knowledge Graph Construction

The process of constructing a graph that captures interactions among entities in a document.

Latency Of Tweets

The delay between the news exposure and the tweets linking to it, which might affect the summarization performance.

Latent Alignment

Alignment relations that cannot be identified by the standard pointer generator.

Latent Discourse Tree

A discourse tree that is induced without a parser

Latent Drichlet Allocation (Lda)

A model for topic modeling where topic probabilities are assigned words in documents

Latent Semantic Analysis (Lsa)

Topic modeling method for summarization

Latent Semantic Representation

A representation of chemical texts produced by deep learning models.

Latent Sentence Representations

A higher-level representation of the input document that captures the main ideas and concepts.

Latent Structure Information

The inherent structures that people may naturally follow when they write abstractive summaries, such as "What", "What-Happened", "Who Action What", etc.

Latent Variable Extractive Model

A model that views labels of sentences in a document as binary latent variables and directly maximizes the likelihood of human summaries given selected sentences

Lead Sentences

Sentences in news articles that contain key information and are placed early due to journalistic conventions

Lead-3 Extractive Baseline

A baseline method for summarization that selects the first three sentences of the original text as the summary.

Leading Bias

A bias observed in pseudo summaries generated from a Seq2Seq teacher model, where the summaries tend to summarize the leading part of a document.

Learning-To-Rank (L2r) Algorithms

Algorithms that approximate the ground-truth reward oracle from weak supervisions, such as numeric scores indicating the quality of the summary or preferences over summary pairs.

Left-Context-Only Decoder

A decoder that does not have complete context when predicting each word

Length Attention (Lenatten)

An effective length controlling unit that allows summarizers to generate high-quality summaries with a preset number of characters, breaking the trade-off between length controllability and summary quality

Length Constraint

The summary should be as long as the width of target devices such as smart-phones and digital signage

Length Control

Dividing summary length into disjoint length bins and restricting the summary length according to the desired length bin

Length Controllability

The ability to generate summaries with a preset number of characters or words

Length Normalization Constraint

Added to the Entail reward to avoid misleadingly high entailment scores to very short sentences.

Length Variation

A metric used to evaluate the ability of a summarization model to generate summaries of different lengths.

Length-Aware Attention Mechanism (Laam)

Extends a transformer seq2seq model with the ability to select information in the context according to the length constraint. LAAM re-normalizes the attention between encoder and decoder to boost the tokens with higher attention scores based on the desired length, helping with selecting length-aware information from source document.

Length-Balanced Dataset (Lbd)

A dataset created by predefining the length ranges and constructing extractive summaries within different length ranges, which helps model to select different information from source document via desired lengths.

Length-Control Algorithm

A method based on dynamic programming to satisfy the constraint of output lengths

Length-Controllable Summarization

A multi-objective optimization problem that includes generating complete summaries within desired lengths and selecting proper information to summarize based on desired lengths. Existing length-controllable summarization based on encoder-decoder models can be divided into two categories

Level Of Abstraction

The degree to which a summary is concise and does not only copy long passages from the source document

Lexical

Relating to the vocabulary or words used in a language.

Lexical Centrality

A measure of the importance of a passage in a text where the sentences of the document are connected by the similarity of their vocabularies.

Lexical Choice

The process of selecting the specific words to use in the summary

Lexical Features

Simple features used to pre-compute review relevance in training, as opposed to deep encoder representations, allowing for selection of reviews from large collections without a significant computational burden.

Lexical Frequency Information

The frequency of occurrence of words in a text. Concept

Lexical Overlap

A commonly used evaluation metric for summarization that measures the n-gram overlap between system and reference summaries

Lexical Overlaps

The use of the same words or phrases in different texts

Lexical Variety

The use of different expressions to communicate the same or similar meanings

Lexico-Semantic Analysis

The analysis of both the lexical and semantic aspects of a text

Likert Scale

A rating scale used to measure attitudes or opinions

Linguistically Motivated Typology Of Factual Errors

A framework for analyzing factuality in summarization systems based on frame semantics and linguistic discourse theory

Linguistically-Informed Rewards

Included in the study of coherent abstractive summarization

Linguistically-Inspired Constraints

A modeling approach for abstractive summarization

Linked Entities

Entities found in the original text that can be used to infer the summary topic

Local Context Information

Information about the immediate surroundings of a token in a document that contributes to producing high-quality summaries with adequate salient information

Local Self-Attention Transformer

A method of exploiting standard transformer models by constraining the attention mechanism to be local, allowing longer input spans during training.

Local Sentence Selection

A part of the information selection layer that selects important sentences while generating each summary sentence sequentially

Local Variance Loss

A loss function introduced to encourage the model to put most of the attention on just a few parts of input states at each decoding step.

Local-Global Sparse Attention

A method proposed by ETC and Longformer to reduce computational overhead of fully-connected attention in Transformers by limiting each token to attend to a subset of other tokens.

Locational Biases

Biases in summarization models that result from the tendency of key sentences to be located at the beginning of the text.

Logical Inference Skills

The ability to logically follow and entailed by the input document

Long Document Summarization

Summarization of documents with input sequences that are longer than the limits of transformer models.

Long-Document Summarization

A complex summarization scenario that poses challenges to Seq2Seq models due to the even distribution of numerous details and salient content.

Long-Document Tasks

Tasks that involve processing long documents, such as scientific paper summarization and long-text reading comprehension, are challenging in natural language processing due to the varying details and subjects covered in such documents.

Long-Form Structured Document Types

Document types that are significantly longer and structured, such as scientific papers

Long-Range Contexts

modelling concepts that span more than a few sentences, which is still a challenging task

Long-Span Neural Lms

Techniques that rely on long-range dependencies within a large local context for sentence ranking tasks.

Long-Span-Based Neural Sentence Lms

Techniques that estimate word probabilities by considering long-range dependencies within a large local context.

Long-Term Dependencies

Dependencies among distant tokens in a document that contribute to producing high-quality summaries with adequate salient information

Longer Summaries

Summaries with more sentences

Loss Truncation

A technique for modifying the loss computation during training to alter the learned behavior of summarization models

Lossy Compression

A key challenge in summarization is to optimally compress the original document in a lossy manner such that the key concepts in the original document are preserved, whereas in machine translation, the translation is expected to be loss-less.

Lossy Semantic Compression

The process of summarization, which involves compressing a text while retaining its meaning.

Low Precision

Another common drawback of neural abstractive summarization models where the generated content is inconsistent with the source document, also known as hallucination.

Low Recall

A common drawback of neural abstractive summarization models where the generated summaries fail to capture critical facts in the source document.

Low Resourced Languages

Languages that have limited resources and tools available for natural language processing tasks

Low-Information Units

Units that are less informative than units connected with verbs

Low-Rank Alternative

An approach that approximates the co-occurrence matrix to tackle the challenge of lexical variety in summarizing student feedback

Low-Resource And Zero-Shot Setup

A setup where there are limited training examples or no training examples available for the downstream task.

Majority Vote

A method of selecting the most frequent output from multiple models in post-processing.

Manual Summary Assessment

A high time and cost demanding task that is often replaced with multiple choice or short answer questions in modern English exams.

Manual-Feature Engineering

An earlier approach for extractive summarization that involves implementing graphs and integer linear programming.

Manually Tuned Parameters

Parameters that are adjusted by hand to optimize the performance of a model

Manually-Curated Dataset

The paper addresses the lack of relevant datasets and techniques for effective answer summarization by developing a human-annotated dataset for multiperspective abstractive answer summarization.

Mask Task

Randomly masks some sentences and predicts the missing sentence from a candidate pool.

Masked Document Generation (Mdg)

A STEP objective that learns to recover a masked document to its original form.

Masked-And-Fill With Masked Article (Mfma)

A novel method proposed in the paper for generating negative summaries.

Masking

A technique used to constrain copying words to the selected parts of the text, which produces grammatical outputs.

Maximal Coverage Principle

An unsupervised approach for extractive text summarization where the extract that maximally covers the information contained in the source text is selected.

Maximum Likelihood

A training method used to optimize the parameters of a model to maximize the likelihood of generating the correct output.

Maximum Likelihood Estimation

A method used in the mixed objective to optimize the n-gram overlap with the ground-truth summary in a summarization model

Maximum-Likelihood Cross-Entropy Loss

A loss function used in prior work to train the model

Maximum-Marginal Relevance (Mmr) Principle

A representative example of early work in extractive summarization for selecting sentences based on both their relevance (to the central theme of the document) and the diversity of the selected sentences.

Mdl-Based Approach

An approach for extracting relevant sentences into a summary that represents documents as a sequential transactional dataset and then compresses it by replacing frequent sequences of words by codes.

Meaning Representations (Mr)

A form of content representation for factuality evaluation.

Measure Of Popularity

A metric used to indicate the popularity of a post in social media, such as the number of votes, shares, or bookmarks.

Meeting Summarization System

A system that takes a meeting recording and its transcript as input and produces a concise text summary as output, which preserves the most important content of the meeting discussion.

Memory Issue

Difficulty in scaling complex summarization models to long-form documents due to memory concerns.

Memory Mechanism

A technique used to preserve salient information learned from previous windows and enrich local texts

Memory Network

We propose the use of memory networks and convolutional bidirectional long short term memory networks for capturing better document representation.

Meta-Evaluation

The process of evaluating the quality of evaluation metrics.

Metric Quality

How similarly a metric replicates human judgments of systems

Metric Reliability

The consistency and accuracy of automatic evaluation metrics in measuring the quality of summaries.

Metric-Oriented Training

An approach to training Seq2Seq models that directly optimizes the model with the corresponding evaluation metrics.

Micro Dpps And Macro Dpps

Two kinds of methods proposed to adapt DPPs to large scale computing

Minimax Two-Player Game

A game where the generator G and the discriminator D are optimized, with D trying to distinguish the ground truth summaries from the generated summaries by G, and G trying to maximize the probability of D making a mistake

Minimum Description Length (Mdl) Principle

A principle defining the best summary as the one that leads to the best compression of the text by providing its shortest and most concise description.

Minimum Risk Training

An alternative to RL-based training, used in language generation tasks, but the accuracy of the estimated loss is restricted by the number of sampled outputs.

Minimum Risk Training (Mrt)

Used to optimize a model globally for an arbitrary evaluation metric

Misaligned Summarization Models

Language models that generate summaries that contain errors or inaccuracies.

Misalignment

The discrepancy between the objective of fine-tuning language models (maximizing the likelihood of human-written text) and the objective of generating high-quality outputs as determined by humans.

Misleading Conclusions

Conclusions that are based on incomplete or biased information.

Mismatch

a discrepancy or lack of alignment between the current research directions in automatic summarization and the needs of users, specifically university students

Missing Concepts

Relevant concepts that are not displayed in the search result summary

Missing Content

Important details such as medication dosage and route that are left out in a summary, undermining patients’ medical self-management.

Mix-And-Match Strategy

A method of training an abstractive summarizer to generate summaries with varying amounts of copied content by gradually transitioning from copying only to both copying and generating new words not present in the source text.

Mixed Effect Models

Statistical models that account for both fixed and random effects

Mixed Integer Programming (Mip)

An objective function used to capture importance, non-redundancy, and coherence in automatic summarization.

Mixed Objective

An objective that jointly optimizes the n-gram overlap with the ground-truth summary while encouraging abstraction in a summarization model

Mixture-Of-Experts Architecture

A multi-task learning framework that optimizes jointly over several measures.

Mle

Maximum likelihood estimation, a standard training approach for seq2seq learning

Mmr

Maximum Marginal Relevance, a strategy to control redundancy

Model Creation Efficiency

The speed and ease with which a model can be created.

Model Ensemble

A method of combining multiple models to improve performance, often used in text-generation tasks.

Model Summaries

Human-generated summaries used as a benchmark to evaluate the quality of system-generated summaries.

Model-Free Evaluation

An evaluation approach that does not rely on human-generated model summaries.

Modified Decoder Architecture

A proposed architecture that can dynamically select salient input sentences to constrain the encoder-decoder attention without having to compute complete attention at inference time.

Modified Self-Attention Mechanism

Variants of the transformer architecture that reduce the quadratic complexity of the self-attention mechanism.

Modified Training Objective

An objective that leverages information about error spans in gold summaries, derived from factuality models, to train the summarizer.

Monolingual Summarization (Ms)

Summarization of text in a single language.

Monte-Carlo Simulations

A method of statistical analysis that uses random sampling to simulate possible outcomes

More Balanced Interaction

Leads to better performance when the decoder interacts with multiple agents

Moverscore

An automatic evaluation metric that performs well on the TAC dataset but is significantly worse than ROUGE-2 on the CNNDM dataset.

Multi-Armed Bandit Policy

A policy used to select sentences during careful reading, with an adapted termination mechanism to select various but proper numbers of sentences.

Multi-Channel Convolutional Graph Neural Network (Gnn)

A network designed to explicitly integrate summary-related features, like sentence semantics, importance, and position.

Multi-Domain Articles

Articles from a variety of genres, including newswire, academic papers, movie scripts, and product or restaurant reviews.

Multi-Fact Correction Problem

The problem of correcting multiple erroneous facts in generated summaries.

Multi-Factor Attention Fusion Network

A model that exploits the recent success of the encoder-decoder framework to generate aspect/sentiment-aware review summaries. It uses a mutual attention mechanism to capture the correlation of context words, sentiment words, and aspect words, and explores three kinds of attentions (i.e., semantic attention, sentiment attention, and aspect attention) to selectively attend to the context information when decoding summaries.

Multi-Head Attentions

A feature in Transformers where multiple attention heads are used at all layers to highlight salient content.

Multi-Hop Reasoning

Considering the semantic relevance to the question as well as the information consistency among different sentences to enable human-like multi-hop reasoning in question-driven summarization.

Multi-Label Classification

A task where data points are assigned multiple labels

Multi-Labeled

A dataset where each data point can have multiple labels

Multi-Layer Recurrent Neural Network (Rnn)

A selector architecture that can be used for content selection in summarization.

Multi-Layered Encoder And Decoder Models

Novel multi-task learning architectures

Multi-Level Memory Networks (Mmn)

A novel memory network model for abstractive summarization that stores information from different levels of abstraction and can build representations of multiple ranges.

Multi-Modality Manifold Ranking

An approach to topic-focused summarization that uses multi-modality manifold ranking.

Multi-Objective Training Strategies

State-of-the-art solutions in summarization utilize multi-objective training strategies, including reinforcement learning techniques.

Multi-Party Conversations

Conversations involving more than two participants.

Multi-Perspective Summaries

The paper emphasizes the importance of multi-perspective summaries that cover the varying perspectives found in the answers to a question.

Multi-Reference Datasets

Datasets with multiple summaries, which are unrealistic to expect at scale for neural network training

Multi-Reward Optimization Approach

Optimizes multiple rewards simultaneously in alternate mini-batches.

Multi-Scale Topics

Including paragraph-level and document-level ones, induced to capture local and global semantic and syntactic information of a document.

Multi-Sentence Summarization Tasks

Summarization tasks where the goal is to generate a summary consisting of multiple sentences

Multi-Stage Machine Learning Pipelines

Proposed methods for generating meeting summaries

Multi-Task And Multi-Reward Training Techniques

Utilized in current approaches to text summarization

Multi-Task Fine-Tuning

Using multiple tasks to fine-tune a model for a downstream task.

Multi-Task Learning (Mtl) Approach

Proposed approach to solve the problem of extractive summarization of the MD&A section of 10-K reports

Multi-Tasking Approach

An approach that involves training a model to perform multiple tasks simultaneously

Multi-View Attention

A technique proposed in this work to obtain different representation of the texts for summarization and sentiment classification.

Multi-View Coverage Mechanism

Addressing the repetition issue along with the multiview pointer network and generating informative answers.

Multi-View Information Bottleneck Framework

A framework that can effectively incorporate multiple guided signals for the scientific document summarization task.

Multi-View Learning Paradigm

A learning paradigm in semi-supervised learning that encourages models to learn from multiple views of the same data.

Multiple Instance Learning

A type of machine learning where the input data consists of bags of instances, rather than individual instances

Multiple Optimal Solutions

The problem that arises when reducing the number of concepts in the model, allowing different sentences to have the same score, and ultimately leading to multiple optimal summaries.

Multiplex Graph

A type of graph that involves multiple types of edges to capture different types of relationships among sentences and words.

Multitask Content Selection Method

A method that ranks sentences through an extractive labeling-based module and an attention-based module.

Multitask Learning

A machine learning approach where a model is trained to perform multiple tasks simultaneously

Multiword Terms

Terms that describe molecules and techniques in chemistry that can be accurately substituted for chemical formulae or accurate hyponyms like ketone without ontological knowledge.

Municipal Surveys

Surveys conducted by a municipality to gather feedback from citizens

N-Gram Based Metrics

Metrics such as ROUGE that poorly reflect human preference for summarization

N-Gram Copying

The amount of copying performed by different models when generating summaries.

N-Gram Language Model

A model used to convert traditional CNN architecture into an unsupervised learning regime

N-Gram Overlap

A commonly used automatic evaluation metric based on the overlap of n-gram

Natural Language Questions

Queries for health-related content in the form of natural language

Near-Optimal Outputs

Outputs that are close to optimal in terms of the objective function being optimized.

Nearest Neighbor Models

Models that look up similar transcripts or recaps

Negative Example Baseline

A novel baseline introduced in this work that uses a sentence containing information that the summarization model should not focus on.

Negative Sampling

A technique used in machine learning to train models to distinguish between positive and negative examples.

Negative Summaries

Summaries that are factually inconsistent with the source text.

Neural Coherence Model

A model that estimates the coherence degree between two sentences by their distributed representation in an end-to-end fashion.

Neural Decoders

recent advances that led to a number of single-document summarisation systems that exhibit some level of abstraction in their outputs

Neural Encoder-Decoder Architectures

A type of neural network used for natural language processing tasks

Neural Encoder-Decoder Based Framework

A framework used to generate summaries tuned to the target topic of interest.

Neural Extractive Summarization

Produces a short summary for a document by selecting a set of representative sentences.

Neural Extractive Summarization Models

Over-emphasize sentence importance and pay little attention to reducing redundancy in the selection phase.

Neural Extractive Summarizers

State-of-the-art systems that use machine learning to automatically generate summaries

Neural Extractor

A novel approach that exploits pre-trained LMs for sentence classification in abstractive summarization

Neural Framework

A novel neural framework for the identification and extraction of salient customer opinions that combines aspect and sentiment information and does not require unrealistic amounts of supervision.

Neural Headline Generation (Nhg)

The process of automatically generating a headline based on the text of the document using artificial neural networks.

Neural Topic Models (Ntms)

Models that can provide an approximation of the global semantics captured from document contents, i.e., latent topics, as well as their posterior topic representations.

Neural-Based Abstractive Models

Models that use a seq2seq framework to generate a summary after encoding a full document.

Neural-Based Models

Summarization models that use encoder-decoder structures with either recurrent neural networks or Transformer.

News Elements

Five defined elements in a news story, including two ledes (Standard Lede and Image Lede) and three other categories (Synopsis, Narration, and Body Section), each with a specific function in building a news story and featuring a writing style (narrative or expository).

News Summarization

The process of creating a brief and concise summary of news articles.

Next Sentence Generation (Nsg)

A STEP objective that generates the next segment of the original document given the first segment of a document.

Node Selection Problem

The process of selecting nodes (i.e., sentences) that are semantically similar to other nodes to be included in the final summary.

Noise Injection Techniques

Techniques introduced to make both the teacher and student robust to noise

Noisy-Channel Model

a probabilistic approach for sentence-level and document-level compression

Non Auto-Regressive Sentence Extraction

Extracting sentences independently without using previous predictions

Non-Autoregressive Approach To Unsupervised Summarization (Naus)

A method that utilizes non-autoregressive decoders to generate all output tokens in parallel

Non-Autoregressive Decoder

A decoder that can extract a non-fixed number of summary sentences simultaneously and individually, which is formulated instead of extracting sentences one by one to form a top-k summary.

Non-Deterministic Distribution

A target distribution for abstractive models in which candidate summaries are also assigned probability mass according to their quality, as opposed to the one-point deterministic distribution assumed by MLE training.

Non-Factoid Qa

Questions that require detailed analysis to explain or justify the final answers, such as questions in community QA or explainable QA.

Non-Informative Evaluation Protocols

Part of the current research setup for text summarization

Non-Introductory Information

The supplemental information that explains the introductory information in more detail

Nonfactoid Question Answering (Qa)

A type of QA that deals with questions that do not have a straightforward factual answer, such as opinion-based or explanatory questions.

Nonneural Network Approaches

Approaches that use frequency and information theoretic measures as proxies for content salience in summarization

Normalized Rouge Scores

ROUGE scores that are adjusted to account for differences in summary length.

Novel Abstractive Ts Technique

A technique that combines deep learning models of encoder-decoder architecture and semantic-based data transformations to generate summaries in a generalized form.

Novel Aggregations

Aggregations that are not found in the source text and must be generated by the summarization system

Novel Words

Words that are not present in the source document but are generated by the model in the summary.

Novelty

a concept in summarization that refers to the degree to which a sentence contributes new information to the summary

Novelty And Informativeness

The final summary is formed by maximizing both novelty and informativeness of the sentences in the summary.

Novelty In Summarization

The degree to which a model generates novel output

Numerical Aggregation

The process of aggregating customer satisfaction scores across different aspects of an entity

Objective Information

Information gathered by the care team that is based on observable and measurable data.

Occlusion

A technique used to understand the contribution of input features by masking them and observing the change in the model's output.

Off-Line Rl Algorithm

A reinforcement learning algorithm that uses a pre-collected dataset to train a model.

Off-The-Shelf Components

Existing components for tasks other than query-based summarization can be competitive with state-of-the-art methods in the field.

Off-The-Shelf Frameworks

Frameworks such as entailment models or QA systems that have been explored in past work to detect and correct errors in generated summaries.

On-Demand Abstractive Summarizer

A summarizer that mimics how a human might approach a lengthy transcript, identifying important and relevant portions to produce a new summary piece.

One-Size-Fits-All Paradigm

The assumption that model output is evaluated based on similarity to general-purpose reference summaries reflecting the full content of the original document.

Ontology

A set of concepts and categories in a subject area or domain that shows their properties and the relations between them.

Ontology Information

Information about the relationships between medical concepts.

Oov Words

Out-of-vocabulary words or words of limited occurrences that the proposed framework is capable of coping with, achieving semantic content generalization.

Opinion Extractor

A pre-trained component of the proposed pipeline framework that identifies opinion phrases in reviews.

Opinion Phrase

A subsequence of tokens within a review that expresses the attitude of the reviewer towards a specific aspect of the entity. It may not be contiguous in the review, and a word can be part of multiple opinions.

Opinion Selector

A simple and controllable component of the proposed pipeline framework that merges, ranks, and optionally filters the extracted opinions.

Opinion Set

The set of opinion phrases within a review, defined as Or = {(oi, poli, ai)}|Or|i=1, where poli is the sentiment polarity of the i-th phrase (positive, neutral, or negative) and ai is the aspect category it discusses.

Opinion Summarization Dataset

A dataset consisting of Amazon reviews from six product domains, and includes development and test sets with gold standard aspect annotations, salience labels, and multi-document extractive summaries.

Optimal Summary

A summary that is considered to be the best possible version of the original text

Optimal Transport (Ot) Theory

A novel non-learning based extractive summarisation method that formulates extractive summarisation based on the optimal transport theory.

Optimal Transport Usage

A method for measuring the distance between probability distributions

Optimization Problem

A problem of finding the best solution among all possible solutions.

Oracle

A method used at training time to select informative guidance signals to encourage the model to pay close attention to the guidance.

Oracle Extractive Approaches

Approaches that extract the most important sentences from the input document to generate a summary

Oracle Label

One-to-one matching of each fact in the gold summary to one fact in the source text to obtain the oracle label

Oracle Labels

Extractive labels created using a greedy oracle labeling algorithm for the Pubmed and arXiv datasets

Oracle Summary-Worthy Sentences

Sentences that tend to be crowded at the beginning of news articles while distributed more evenly in scientific papers.

Orthogonal Context Vectors

A solution proposed to prevent the repeated phrases problem by ensuring that successive context vectors are orthogonal to each other.

Out-Of-Domain Experiments

Experiments conducted to examine the cross-domain generalizability of the compressive system.

Out-Of-Vocabulary (Oov) Problem

A problem in natural language processing where a word is not present in the vocabulary of a model, making it difficult to generate accurate summaries.

Out-Of-Vocabulary (Oov) Words

Words that are not present in the vocabulary of the model

Out-Of-Vocabulary Words (Oov)

words that are not present in the vocabulary of a language model, making it difficult for the model to handle them.

Outdegree Centrality

A measure of centrality that assumes a word sending out more relevance score to others is more critical.

Output Length

The length of the summary produced by a summarization system.

Output Selection

A category of ensemble algorithms that corresponds to the majority vote method in classification tasks.

Output Summary Length

The length of the summary produced by the system

Overfitting

When a model performs well on the training data but poorly on new, unseen data

Overfitting Problem

when a model is too complex and fits the training data too closely, resulting in poor performance on new, unseen data

Overlapping Words

Words that appear in both the query and the source document

Overlength Summaries

Summaries that exceed the desired length constraint

Pagerank

A method used to derive sentence prestige by building a connectivity graph

Pairwise Comparison

A manual evaluation approach that compares two summaries

Pairwise Preference Approach

An approach to summarization evaluation that collects preference labels over sentences in documents or over summaries from a human assessor, requiring less cognitive effort than writing a reference summary or manually scoring a machine-generated summary.

Pairwise Preference Labels

Labels provided by the user that compare two candidates and label the best one.

Pairwise Preferences

Simple and inexpensive annotations used to train the evaluation model

Parallel Corpora

A set of texts in two or more languages that are aligned at the sentence or phrase level

Parallel Decoding

A method of decoding the summary in parallel, rather than sequentially.

Parameter Sharing

A technique used in global encoding to refine the representations at each time step with consideration of the global context

Parameterized Unsupervised Learning Methods

Methods used to induce a latent variable model for unsupervised sentence summarization tasks

Paraphrase Detection

A task where the model determines if two sentences have the same meaning

Paraphrasing Ability

The ability of models to generate novel text by rephrasing the source document.

Partial Masking

A technique used in this work where only some attention heads are masked to guide the summarization process.

Patternized Summary Generation Task

Generating summaries that conform to a specific pattern, such as court judgments, diagnosis certificates, abstracts in academic papers, etc.

Personalized Pagerank (Ppr)

One of the link analysis algorithms applied to graph attention network (GAT) to obtain node representations that better reflect the query and document relationships.

Personalized Pagerank (Ppr) Algorithm

An algorithm used to leverage repetitive random walks on a semantic network to identify the relevance of different senses of a word

Perturbation Technique

A technique based on person names to reduce extrinsic hallucinations involving named entities.

Phrase Summarization Method

A method that creates summaries from extracted phrases rather than from sentences

Plug And Play Language Models (Pplm)

Models that condition the generation process on themes of interest and text style transfer controls selected attributes, such as politeness, emotions, or humor of the generated text.

Plug-And-Play Property

The ability of the proposed algorithm with global awareness to enhance beam search for neural abstractive summarization without requiring any model or parameter modification.

Podcast Summary

A condensed version of the podcast episode that can serve as a basis for decision-making or as a synopsis

Point Mode

A mode in the pointer generator architecture where words are directly copied from an aligned source context.

Points And Counterpoints

Statements made by participants in online discussions, either agreeing or disagreeing with others.

Points Of Correspondence

The establishment of correspondence between sentences, which is crucial for fusing sentences. It includes entity and event coreference, shared words/concepts between sentences, and more.

Points Of Correspondence (Poc)

Text chunks that convey the same or similar meanings and tie two sentences together into a coherent text.

Pointwise Mutual Information (Pmi)

A metric used to measure the relevance and redundancy of a summary. It is based on the probability of a sentence or summary occurring in a document.

Policy

A model trained via reinforcement learning to generate summaries that maximize the score given by the reward model.

Policy Gradient

A method used in the mixed objective to encourage abstraction in a summarization model

Polytope

A set of 8 metrics on the Accuracy and Fluency aspects designed to quantify the primary sources of errors over representative models

Position Bias

A problem in extractive summarization where sentences appearing earlier in a document tend to be selected as the most important, resulting in sub-optimal models.

Position Cues

Indicators of important content in news articles based on the sentence's position in the document

Position Encoding Channel

A channel used to learn sentence position features.

Position Hypothesis

The idea that important sentences appear in preferred positions in a document.

Position Information

Information about the position of words in the input sentence

Position Prediction Component

An adversary introduced to optimize the neural extractive summarizer in an alternating manner.

Position-Augmented Centrality

A method that significantly outperforms strong baselines in single-document summarization by using position information to transform undirected edges into directed ones

Post-Editing

Editing generated summaries to improve their quality

Post-Editing Correction

The process of correcting errors in a generated summary after it has been generated

Posterior Probability

The probability of an entity being in a summary given the source document.

Pre-Trained Contextualized Language Models

Language models that are trained on large amounts of data and can be used to improve the decoder's ability to learn summary representations, context interactions, and language modeling together

Pre-Trained Encoder

A pre-trained encoder is an essential component for sequence generation tasks and often these tasks benefit from sharing the weights between the encoder and the decoder.

Pre-Training And Fine-Tuning

A common approach in natural language processing where a model is first pre-trained on a large corpus of text and then fine-tuned on a specific task.

Pre-Training Objective

The objective used during pre-training to improve the quality of the generated output in downstream tasks.

Precision

The degree to which repeated measurements under unchanged conditions show the same results

Prediction-Guide Mechanism

A mechanism that predicts the extent of key information covered in the final summary to further guide the summary generation

Preference Learning Problem

A problem formulation for reward learning that involves predicting the relative preference between two summaries

Preference-Based Feedback

A form of feedback in which a user provides a preference over a pair of predictions, which is less cognitively burdensome than providing ratings or categorical labels.

Premise-To-Entailment Generation

Teaching the model how to rewrite a summary which is a directed-logical subset of the input document

Pretrained Embeddings

Embeddings that are trained on large amounts of data and used in the summarization task

Problem List Summarization

A task designed to summarize a patient’s problems and generate relevant diagnoses to assist healthcare providers and overcome the cognitive burden and information overload.

Product-Of-Experts Objective

A combination of the pretrained language model and a smoothed problem specific target language model to guide the fluency of the generation process

Prompting Techniques

Techniques used to guide the generation process of text generation models.

Prosodic Features

Features of speech such as intonation, stress, and rhythm that convey meaning and emotion

Prototype-Based Generation Models

Models that use prototype document-summary pairs to improve summarization performance.

Pseudo-Labeling

An effective distillation method for Seq2Seq models, where the teacher model generates pseudo summaries for all documents in the training set and the resulting document–pseudo-summary pairs are used to train the student model.

Purpose Of Summaries

The reason why a summary is being created

Qualitative Analysis

An analysis that brings insights as to what current unsupervised models are missing in automatic text summarization

Query Attention

An approach to topic-focused summarization that uses query attention.

Query Focused Summarization

A type of summarization that highlights those points that are relevant in the context of the query.

Query Relaxation Techniques

Interactive and non-interactive techniques proposed to translate input questions into structured queries covering specific elements of the questions.

Query-Biased Summaries

Summaries that consist of a few sentences around query terms in the results

Query-Dependent Features

Features like query word overlap that are designed to learn the relevance ranking.

Query-Focused Summarization

A technique that creates a brief, well-organized and fluent summary that answers the need of the query. It is useful in many scenarios like news services and search engines, etc.

Ranking

The process of ordering a set of systems based on their performance.

Ranking Function

A function that maps text documents to scores and is used to rank candidates.

Ranking Metrics

Metrics used to evaluate the informativeness of posts based on their ranking in a summary.

Ranking Models

Models used to rank sentences in order of importance for summarization.

Re-Ranker

An additional module in a summarization system that re-scores candidate summaries generated by the main summarizer.

Reading Comprehension (Rc)

A type of question answering system that focuses on answer span extraction in long documents.

Recall

The proportion of true positives that are correctly identified by a model

Recommender System

A system that provides recommendations to users based on their preferences and behavior.

Recurrent Neural Network

A type of neural network that can process sequences of inputs

Recurrent Neural Networks With Encoder-Decoder Architecture

Successful in a variety of NLP tasks where an encoder obtains representations of input sequences and a decoder generates target sequences

Redundant Information

Information that is repetitive and should not be included in summaries

Redundant Phrases

The issue of redundant phrases between selected sentences in the naive approach for the first prediction step of the decoder, which makes independent binary decisions for each sentence, leading to the absence of overlap or redundancy modeling between the selected target sentences.

Refactor

A general framework that serves as a base system to construct a summary and as a meta system to select the best system output from multiple candidates, allowing the base and meta learners to share a set of parameters.

Reference Bias

The bias that occurs when evaluating against a single reference summary

Reference Texts

Texts written by humans used as a basis for comparison with generated summaries.

Reference-Free Evaluation Metrics

Reference-free evaluation metrics focus on evaluating summaries without the need for human-annotated summaries as reference. A high-quality summary should be concise and contain the most important information of its document. Some reference-free evaluation metrics unsupervisedly construct a pseudo-reference summary by selecting salient sentences from the source document, while others evaluate the summary quality by measuring how much information from the document is represented in the summary. QA-based evaluation metrics achieve this possibility by first asking the same questions to document and summary and then comparing their answers.

Reference-Free Evaluations

Evaluating single document summaries without reference summaries, using embedding similarity between the full document and system summaries.

Regression Analysis

A statistical method used to analyze the relationship between variables

Regression Problem

A problem formulation for reward learning that involves predicting a continuous value

Regularization Technique

A technique used by the summarizer to visit portions of the transcript in chronological order, while allowing zigzags to produce a coherent summary.

Regularization Term

A term added to the loss function during training to prevent overfitting.

Relation Level Inconsistency

A problem in text summarization where the entities in the summary exist in the source document, but the relations between them are not accurately reflected in the summary.

Relation Matching Rate (Rmr)

An easy-to-compute model-free metric that evaluates factual consistency given a summary and the article, employing the extracted relations and not requiring human-labelled summaries.

Relative Quality Estimation

A technique for maintaining reasonable performance even in the case of a sub-sequence with errors, which involves accurately estimating the relative quality of different generated outputs, since effective inference requires comparison among these candidates.

Relevance Judging

The process of assessing the relevance of documents for a given task or search topic

Reliability

The degree to which a metric can be trusted to produce consistent and accurate results.

Reliable Metric

A metric that can automatically evaluate the content of a summary

Repeated Phrases Problem

A recurring problem in models based on the encode-attend-decode paradigm where the summaries produced by such models contain repeated phrases.

Repetition

The quality of the summary in terms of how much it repeats information from the original text

Representation

Different ways of representing text, including sentence embedding, un-contextualized word embedding, and contextualized word embedding.

Representation Learning

A process of learning unbiased sentence representations using deep neural networks

Residual Connections

A technique used in neural networks to help prevent the degradation of performance that can occur when training very deep networks.

Responsiveness

A simpler manual evaluation approach that does not rely on reference summaries and can be attained via crowdsourcing

Responsiveness Metric

A manual evaluation method where human annotators score summaries on a LIKERT scale ranging from 1 to 5.

Retraining

The process of adjusting and updating a machine learning model to improve its performance on a specific task or in a specific domain

Retrieval Model

A model that utilizes word embeddings and domain-specific knowledge for finding the appropriate context of citations, aimed at capturing terminology variations and paraphrasing between the citation text and its relevant reference context.

Reward Function

A function that can guide RL-based summarisers to generate more human-appealing summaries

Reward Model

A model trained via supervised learning to predict the human-preferred summary.

Reward Shaping

A method used to alleviate the sparsity of training signals in abstractive summarization models

Rhetorical Structure Theory

A theory that inspires the formulation of sub-sentence highlights, where subsentence highlights resemble the nuclei which are text spans essential to express the writer’s purpose.

Rhetorical Structure Theory (Rst)

A theory that provides a coherent and well-organized representation of documents and suggests discourse-level segmentation can help model semantic information with more refined granularity.

Rl Agents

The agents that are misled by the poor performance of ROUGE at summary level, as existing RL-based summarisation systems rely on summary-level ROUGE scores to guide the optimisation direction

Robustness

The ability of a model to perform well on different document genres and lengths.

Safety-Critical

Refers to tasks where mistakes made by AI systems can have serious consequences.

Salience And Faithfulness

Two aspects rated by humans to evaluate the model-generated summaries

Salience And Redundancy

The balance between selecting sentences with high semantic similarity to the gold summary and resolving redundancy between selected sentences.

Salience Estimation

the process of estimating the importance of each sentence in a document

Salience Labels

Labels provided by externally trained content selectors that indicate the importance of different parts of the source text.

Salience-Guided Extraction

the process of extracting the most salient sentences to form a summary

Saliency Metrics

Metrics used to evaluate the importance of the information in the generated summary.

Saliency-Selection Network

A network that manages the information flow from encoder to decoder explicitly and assigns a salient score for each token in source documents according to their encoded representations

Salient Clinical Terms

The most significant clinical terms occurring in the findings, which can be used to improve the final impression generation.

Salient Information Modeling

The procedure of information representation and discrimination to ensure generated summaries contain adequate salient information of the original documents

Salient Opinions

Opinions that are important or relevant to the product or service being reviewed.

Salient Phrases

Important phrases that are not common across different domains

Salient Relational Triples

A third type of guidance signal investigated in the paper, which involves providing the model with salient relational triples in the form of (subject, relation, object).

Sample Inefficiency

The inability of a machine learning model to generalize well to new data due to insufficient training examples

Sampling Bias

a potential problem when recruiting participants to evaluate summarization systems, as different demographics may exhibit different preferences in rater studies

Sampling Mechanisms

Mechanisms that selectively sample offline data in favor of human feedback learning. The sampling strategies focus on low-rewarded samples or documents that are similar to fine-tuning data.

Scientific Document Summarization

The process of summarizing research papers, usually generating paper abstracts.

Scientific Paper Summarization

The process of generating summaries from professional texts like COVID-19-related papers, which are difficult due to their long texts with complicated structures.

Scu

Summary content unit, which is a unit of information in a summary

Select And Generate Framework

a strategy for summarization where an extractor selects salient sentences, then an abstractor generates a summary

Selective Disambiguation

A technique used to solve the issue of extracted linked entities being too ambiguous and coarse to be considered relevant to the summary

Selective Reading Module

A modified version of a Gated Recurrent Unit (GRU) network, which can decide how much of the hidden state of each sentence should be retained or updated based on its relationship with the document.

Self-Cohesion

The average cosine similarity between sentences in a paper's citation summary, which is consistently higher than in its abstract and measures the focus in describing the paper's main contributions.

Self-Consistency

The characteristic of a summary that conveys the majority opinion of the reviews and does so in a self-consistent manner.

Self-Contained

Highlighted text that is understandable on its own, without the need for specific information from surrounding context

Self-Critical Baseline

A baseline used in reinforced abstractive summarization methods that is obtained by greedily searching for a sequence that maximizes the likelihood probability of the current model.

Self-Critical Reinforcement Learning

A method used to train the network end-to-end

Self-Critical Sequence Training Technique

A technique employed in this paper to optimize the ConvS2S architecture enhanced by topic embedding and SCST, which yields high accuracy for abstractive summarization, advancing the state-of-the-art methods.

Self-Descriptive Clusters

Clusters with titles that accurately reflect their contents

Self-Referenced Similarity Score

A score that measures the degree of semantic similarity between different parts of a summary.

Self-Repetition

Repetition of n-grams within the output of a model

Self-Supervised

A method of training a summarization model without the need for human-generated reference summaries.

Self-Supervised Methods

Methods that do not require labeled data for training

Self-Supervised Pre-Training

A method to boost the zero-shot capability of the model.

Self-Supervised Pretraining

Pretraining with prohibitively-large datasets to facilitate adaptation to new tasks with less abundant data.

Self-Supervision

A training method that uses unlabeled data to train error detectors.

Semantic Aggregation

The process of combining multiple entities into a more general expression in order to change the level of detail in a summary

Semantic Autoencoder (Semae)

An unsupervised extractive model that learns a representation of text over latent semantic units using dictionary learning.

Semantic Block

Continuing sentences that describe the same facet

Semantic Cohesion

A desirable summary aspect that is encouraged in reinforcement learning approaches.

Semantic Content

The meaning and context of the text beyond its literal words

Semantic Correctness

the aim of the new evaluation metric to better capture this aspect of a summary, i.e. be more sensitive to hallucinations and omissions

Semantic Dependency Graph

A representation of predicate-argument relations between content words in a sentence that can guide summary generation.

Semantic Dependency Guided Summarization Model

A model that leverages the input sentence and semantic dependency graph to generate a summary in a complementary way.

Semantic Deviation

Output summary by existing graph-based methods tends to deviate from input text due to statistical level graphs

Semantic Distance

A metric clustering paradigm used to estimate the student coverage of each phrase in the summary

Semantic Drifts

The semantics of the document may drift from section to section.

Semantic Encoding Channel

A channel used to learn sentence linguistic features.

Semantic Equivalency

The degree to which a summary conveys the same meaning as the input text

Semantic Graph

A type of multiplex graph that connects two sentences sharing similar meanings.

Semantic Inference

The process of using context and background knowledge to understand the meaning of a text

Semantic Matching

The exploration of more compelling ways to evaluate summarization, translation, and dialog beyond token overlap, including using word embeddings and universal sentence representation.

Semantic Meaning

The meaning of words and phrases in context

Semantic Overlap Based Method

Evaluation of abstractive summarization needs a semantic overlap based method.

Semantic Overlap Summarization (Sos)

A new NLP task for summarizing multiple alternative narratives with different perspectives by cross-verifying their information contents against each other.

Semantic Relations Among Sentences

The relationships between sentences that are ignored when existing language models model sentences word-by-word.

Semantic Relationship Among Words

A type of relational information among words that has been proven to be useful for downstream tasks.

Semantic Scholar Network (Ssn)

a large-scale scientific papers summarization dataset with citation graph

Semantic Search And Discovery

Another task that can be aided by the transfer of the latent semantic representation into useful editing tasks.

Semantic Similarities

The similarities in meaning between different words or phrases

Semantic Similarity Scores

Scores that measure the degree of similarity in meaning between different words or phrases

Semantic Understanding

The ability of models to understand the meaning of the source document and generate meaningful paraphrases.

Semantic Volume Overlap

The degree of overlap between the meaning of the reference and model summaries.

Sensitivity Diagnostics

A set of diagnostics proposed in this work for measuring the sensitivity of factuality metrics to factual inconsistency.

Sentence Compression

The method of shortening text by removing words or rephrasing parts of a sentence

Sentence Embedding Techniques

State-of-the-art techniques used to prepare groundtruth rankings of sentences from the original document by computing the semantic similarity between each individual sentence of the original document and the entire human-written summary.

Sentence Encoders

Design choices for encoding sentences in neural network architectures for summarization

Sentence Extraction

A crucial step in extractive summarization where a representative subset of sentences is selected, which contains the information of the entire set.

Sentence Extraction Methods

Methods that select sentences in a document to create its summary, with advantages of truthfulness compared with abstractive methods and of fluency compared with word extraction methods

Sentence Extractor

Built with recurrent neural networks that remember the partial output summary and provide a sentence extraction state to score sentences

Sentence Extractors

Design choices for extracting sentences in neural network architectures for summarization

Sentence Integrity

The degree to which the generated text is grammatically correct and coherent.

Sentence Level Extraction

Extracting full sentences as the extraction unit

Sentence Planner

A model that includes a planning step at the sentence level before generating the summary word by word, in order to generate more abstractive summaries without sacrificing ROUGE and coherence.

Sentence Position Bias

The bias in news summarization where sentence position dominates the learning signal

Sentence Ranking

A process in sentence regression that evaluates the importance of each sentence with a ranking model

Sentence Ranking Problem

Extractive summarization is modeled as a sentence ranking problem with length constraints

Sentence Regression

A branch of extractive summarization methods that models the relative importance of a sentence given a set of sentences

Sentence Selection Stage

The first stage of the fact consistency assessment framework where top-K pieces of evidence are selected from the original document

Sentence Singletons

Individual sentences selected for summarization.

Sentence-Based Extractive Summarization

A type of extractive summarization that involves selecting important sentences from the original text to create a summary. Concept

Sentence-Deployed Attention Mechanism

a component of ESCA that ensures the abstractive summary focuses on both correct and desired concepts

Sentence-Document Similarity

A measure of the relevance between summary sentences and the original document.

Sentence-Level Centrality-Based Estimator

A method for extracting the final summary from candidate sentences

Sentence-Level Evaluation

The proper way to evaluate the SOS task, which improves the inter-rater agreement compared to the traditional ROUGE metric and shows a higher correlation with human judgments.

Sentence-Level Summarization

Focusing on summarizing individual sentences

Seq-Attention-Seq Model

A sequence-to-sequence model that uses attention mechanism.

Seq2seq Architecture

A neural sequence-to-sequence architecture used for text generation tasks like machine translation and image captioning.

Seq2seq Framework

A framework that has achieved state-of-the-art performance on abstractive sentence summarization task.

Seq2seq Learning Problem

A type of machine learning problem where the input and output are both sequences, and the goal is to learn a mapping between them.

Seq2seq Model

A statistical machine translation model that uses an encoder to convert the input text as a vector representation, and then feeds this representation into a decoder to generate summary.

Seq2seq Transformer

A type of neural network architecture that has been demonstrated to be the state-of-the-art for SEQ2SEQ modeling in natural language generation tasks, such as abstractive summarization.

Seq2seq-Based Model

A model that uses a sequence-to-sequence architecture to incorporate the salient clinical terms into the summarizer.

Sequence Labeling Problem

A method of extractive summarization that formulates it as a problem of labeling each sentence in the original text as either included or excluded in the summary.

Sequence To Sequence Framework

A deep learning-based approach that encodes input documents as vector representations with a long short-term memory (LSTM) and uses another LSTM as the decoder to generate corresponding summaries. It captures the semantic and syntactic relations between raw documents and their summaries in a scalable and end-to-end way.

Siamese Network

A neural network architecture that learns to compare two inputs and output a similarity score.

Siamese-Bert Architecture

A method of computing the similarity between the source document and the candidate summary in extractive summarization, using a pre-trained BERT model in a Siamese network structure to derive semantically meaningful text embeddings that can be compared using cosine-similarity.

Significance Test

A statistical test used to determine if the differences between two sets of data are significant

Simple Average Vector Aggregation

A method of aggregating latent vectors by taking their average.

Simpson Paradox

A phenomenon where different conclusions are drawn depending on which subset of a population is considered.

Single Reference Datasets

Datasets with only one summary, which might not be optimal for summarization due to human variation

Single-Document Summaries

Story highlights of an article provided by news websites consisting of three or four succinct itemized texts for readers to quickly capture the gist of the document.

Single-Entity Opinion Summarization

A technique that generates a general summary of popular opinions for a single entity.

Single-Reference Summaries

Summaries created by humans that serve as the sole reference for evaluating the quality of machine-generated summaries.

Sliding Window

A practical solution for summarizing long-form documents by processing them separately in multiple windows

Social Context Summarization

A task that extracts important sentences and representative comments as the summarization, utilizing the social information of a web document to support sentences for generating a high-quality summarization.

Social Media Content

Informal language and massive noise within social media content make training deep neural networks on such datasets challenging.

Soft Alignment

A way of aligning the input sentence with the summary that allows the decoder to focus on the most important parts of the input sentence

Soft Attention

The de facto standard attention mechanism that assigns attention weights to all input encoder states.

Soft Attention Mechanism

A mechanism that calculates aspect/sentiment-aware review representations.

Soft Attention Methods

attention methods that first locate salient text regions within the input text and then bias the abstraction process to prefer such regions during decoding

Soft Labels

Labels that are not binary or definitive, but rather represent the degree of relevance or importance

Soft Parameter Sharing

A method of optimizing shared parameters that achieves higher performance than hard sharing

Soft Targets

Class probabilities produced by the teacher model in knowledge distillation.

Source Document

The original document(s) from which the summary is generated.

Source Domain

The domain used for training the neural summarization system.

Sparse Knowledge Structure

a knowledge structure that contains few connections between concepts or ideas.

Sparse Transformer

A transformer model with the sparse attention mechanism for abstractive summarization, which supports the encoder to model longer input sequences with limited GPU memory.

Sparsity Of Encoder-Decoder Attention

The degree to which the attention weights are concentrated on a small subset of the input sentences, which can be exploited to reduce computation cost.

Speedup

Achieving faster processing time compared to existing methods

Static Summaries

Summaries that do not adapt to the search query

Statistical Models

Models for abstractive sentence summarization trained on relatively small scale training corpora

Statistical Sampling

A method of selecting a sample of data from a larger population for analysis

Statistically Significant Improvements

Improvements that are unlikely to have occurred by chance

Step Objectives

Three sequence-to-sequence pre-training objectives, namely Sentence Reordering (SR), Next Sentence Generation (NSG), and Masked Document Generation (MDG), which can be used to pretrain a SEQ2SEQ model on unlabeled text.

Stepwise Summarization

A method where a summary is constructed incrementally by choosing new content conditioned on previously planned content.

Stiffness

A measure of how much a summarization system's performance changes when evaluated on different datasets

Straplines

Added to articles to provide a teaser summary of the most important points of the article, but most straplines in the Newsroom corpus are not summaries of their associated articles. Distinguishing straplines aimed at piquing a reader’s interest from abstractive summaries is necessary to obtain high quality data.

Structural Patterns

Patterns in the organization of text that can be exploited by extractive summarization methods.

Structural-Compression

Each summary sentence is generated by compressing several specific source sentences.

Structural-Coverage

Different summary sentences usually focus on different sets of source sentences to cover more salient information of the original document.

Structure-Infused Copy Mechanisms

Mechanisms that facilitate copying source words and relations to the summary based on their semantic and structural importance in the source sentences.

Structured Action Representations

Incorporating structured action representations to generate more faithful todos.

Structured Attention

Used as both the objective and attention weights for extractive summarization.

Structured Intermediate Representations

Enable the model to better control both the content to be conveyed and the syntactic structure needed to express it, ultimately improving the factuality and grammaticality of the generated summaries.

Structured Representation

A representation that facilitates the connection of relevant subjects and the preservation of global context.

Structured Semantic Representations

Potentially beneficial for reducing data sparsity and localizing generation errors in abstractive scenarios.

Structured Summaries

Summaries that allow humans to browse specific aspects of interest more readily.

Structured Transformers

Transformer-based architectures that have the flexibility to model some form of structure of the input, e.g., hierarchical document structure.

Style

The degree to which the generated text matches the style of the input or a specific genre.

Style In Text

The non-informational or non-factual aspect of text that drives the quality of response from its audience.

Style-Tailored Summaries

Summaries that are tailored towards specific stylistic preferences, such as formality.

Sub-Aspect Functions

Functions of summarization including position, importance, diversity, and information.

Sub-Aspect Theory

The idea that summarization is a combination of sub-aspect functions, such as information and layout.

Sub-Sentence Highlights

Identifying a single most informative textual unit from each sentence to create highlights.

Sub-Sentence Segments

Identified from a document to strike a balance between the quality and amount of highlights

Sub-Sentential Unit Extraction

Extracting non-terminal nodes in a constituency parsing tree to separate important and unimportant contents

Subaspects Of Summarization

Position, importance, and diversity, which determine the output form of summarization.

Subjective

Based on personal opinions, feelings, and attitudes

Subjective Information

Information gathered by the care team that is based on the patient’s experiences and perceptions.

Subjective Intents

Intents that are difficult to express through queries and require more personalized examples

Summarization Evaluation Metrics

Metrics used to measure the quality of generated summaries are important for the development of summarization systems. Previous evaluation metrics require human-annotated summaries as reference and measure summary quality through the similarity between generated summaries and their reference summaries. However, such reference-based evaluation metrics cannot accurately evaluate the summary, because a document has many correct but different summaries. Thus, it is useful to develop reference-free evaluation metrics for this task.

Summarization Graphs

Graphs built among sentences to capture inter-sentence relationships and rank them by estimating summary-worthy features of sentence importance.

Summary

A shortened version of a text document which maintains the most important ideas from the original article. Automatic text summarization is a process by which a machine gleans the most important concepts from an article, removing secondary or redundant concepts. Extractive summarization is a technique for generating summaries by directly choosing a subset of salient sentences from the original document to constitute the summary.

Summary Cloze Task

A new task for content selection in topic-focused summarization that involves producing the next sentence in the summary given a topic, a partial summary, and a reference document(s).

Summary Coherence

The degree to which the summary is logically connected and easy to understand

Summary Decoder Layer

A layer in the model that decodes the selected information into a summary

Summary Evaluation Methods

Techniques used to assess the quality of system-generated summaries, including manual and automated pyramid and ROUGE scores.

Summary Faithfulness

The generated summaries must accord with the facts expressed in the source.

Summary Highlights

Overlaid sub-sentence segments on source documents to enable users to quickly navigate through content

Summary Length

The length of a summary, which can be controlled by the generator and predicted by the selectors.

Summary N-Grams

Phrases or groups of words that appear in a summary

Summary Prior Nature

The fact that certain sentences are more appropriate for inclusion in a summary regardless of the specific document they appear in.

Summary Qualities

Different aspects of a summary that can be specified for evaluation, such as factual consistency, fluency, coherence, and informativeness.

Summary Vector Degeneration

A phenomenon where the decoder generates less informative and overly generic summaries due to simple average vector aggregation.

Summary Writing

A complex task involving various linguistic operations that is useful for developing student linguistic proficiency including text comprehension and composition.

Summary-Level Correlation Analysis

A scenario where the correlation between the candidate metric and human judgments is computed for each topic individually and then averaged over topics.

Summary-Level Evaluation

The process of evaluating the quality of a summary as a whole, rather than evaluating individual sentences or phrases.

Summary-Level Framework

A novel approach to extractive summarization that formulates it as a semantic text matching problem, where a good summary should be more semantically similar as a whole to the source document than the unqualified summaries.

Summary-Level Optimization

A training strategy that optimizes the summary-level quality of a summary, rather than optimizing individual sentences or phrases.

Supervised Data

Data that has been labeled or annotated by humans for machine learning algorithms to learn from

Supervised Training

Training models with large training corpora comprising pairs of long texts and their summaries

Supervision

The process of providing labeled data to train a machine learning model

Surface Features

Features like the TF-IDF cosine similarity between a sentence and the query that are inadequate to measure the query relevance.

Surprisal

A measure of how unexpected a word or phrase is in a given context

Syntactic Arguments

The components of a sentence that provide information about the verb, such as the subject and object.

Syntactic Constituency Parses

A method of analyzing the grammatical structure of a sentence by breaking it down into its constituent parts.

Syntactic Inflections

Changes in the form of a word to indicate tense, number, or gender.

Syntactic Relationship

A type of relational information among words that has been proven to be useful for downstream tasks.

Syntactic Structure

Beneficial for generating compressed yet informative summaries

Syntax Annotation

A related task that exploits word-level syntax to generate high-quality summaries from the language modeling perspective, and thus alleviates the issues of incomplete sentences and duplicated words.

Synthetic Summary-Review Pairs

Artificial pairs of summaries and reviews created for training a summarization model.

Synthetically Generated Data

Data generated specifically to train models on the factuality detection task.

System Approaches

Oracle methods, baselines, and state-of-the-art approaches are used to evaluate summarisation quality.

System Bias

The bias towards certain sub-aspect functions in different summarization systems.

Systemic Bias

Bias learned on a news corpus that can be reduced by modulation with semantic sub-aspects.

Tailored Summaries

Summaries that match the interests of the reader and are required in manifold settings, such as summarization of complex event streams with a focus on regions, entities or topics of interest for journalists or analysts, understanding reviews or opinions from different perspectives, the summarization of electronic health records with a focus on the medical sub-specialty of the physician reader, or any other form of personalized summarization targeting explicitly defined or implicitly mined preference parameters.

Target Domain

The domain for which the neural summarization system is being trained.

Task Mixing Strategies

Strategies for combining multiple tasks during fine-tuning.

Task-Adaptive Pre-Training (Tapt)

A pre-training method based on an unlabeled small-scale task-related corpus.

Task-Agnostic Approach

The model's task-agnostic approach allows it to implicitly learn and leverage content plans directly from the data.

Task-Agnostic Pretraining Methods

These are pretraining methods that do not take into account the specific downstream task. Examples of such methods include corrupted span prediction (T5), masked language model (BERT), denoising objective (BART), and vanilla language model (GPT).

Task-Specific Adapters

Modules added to the model to enable it to effectively share knowledge from multiple tasks.

Taxonomy

A classification system used to categorize different types of interactions in AI-assisted text generation.

Template Based Summarization

A traditional approach to abstractive summarization that uses manually defined rules to fill in incomplete sentences.

Template-Based Summarization

An approach to traditional abstractive summarization that involves manually creating hard templates by domain experts and populating key snippets to form the final summaries.

Tensor Product Representations (Tprs)

Explicitly-compositional vector embeddings of symbolic structures that encode a constituent in a symbolic structure as a composite of a role (encodes structural information) and a filler (encodes content).

Terminology Variations

The discourse and terminology variations between the citing and the referenced authors that traditional IR models relying on term matching for finding relevant information are ineffective.

Text Autoencoders

Neural networks that can encode and decode text.

Text Categorization

A related task that improves the quality of locating salient information of the text by learning a category-specific text encoder.

Text Cohesion

A theory that covers a broad range of points of correspondence, including entity and event coreference, shared words/concepts between sentences, and more.

Text Encoder

A model that converts text into a numerical representation

Text Encoders

Techniques used to convert text into numerical representations that can be processed by neural networks

Text Generation Model

Model that takes as input a pair of a document and a corresponding gold summary and perturbs the summary to render it factually inconsistent with the original document

Text Generation Systems

Systems that aim to produce text that is fluent, coherent, relevant, and factually correct.

Text Generation Tasks

Tasks that involve generating new text based on existing text.

Text Overlap Based Metrics

Automatic evaluation metrics that compare two summaries based on matching their tokens, either through some lexical or embedding-based similarity

Text Ranking Tasks

Tasks that involve ranking text documents based on their relevance to a user's topic of interest.

Text Realization

The step where the proposed architecture conducts sentence search based on fluency to return the final extracted summaries.

Text Similarity Metrics

a way to evaluate summarization systems by comparing machine-generated summaries to human summaries

Text Simplification

The process of making a text easier to read and understand, often by using simpler vocabulary and sentence structures.

Text Span Generation

a self-supervised objective used in CoCoNet's pre-training, where each sequence in the corpus is divided into two spans with some overlapping words, and the first span is used to generate the second by copying.

Text Summaries

Short overviews of long documents or document collections that allow readers to understand the content without the need to read full documents.

Textual Coherence

The coherence of a summary achieved by considering keywords in a sentence for coherence to other sentences and capturing interactions between sentences through discourse dependency trees

Textual Labels

Natural language descriptions of the relationships between concepts shown on the edges

Textual Non-Redundancy

A desideratum that encourages summaries to cover diverse information in the input documents.

Theory-Based Methods

Methods for modeling importance that formalize the concept of importance and develop general-purpose systems by modeling the background knowledge of readers

Thesis

The main argument or point of an editorial.

Three-Phase Task

Document encoding, information selection, and summary decoding

Token Likelihoods

The likelihood of a token appearing in the summary

Token Sub-Sampling

A specific type of loss truncation that involves downweighting certain tokens during training

Token-Level Accuracy

The optimization of the likelihood of individual tokens in a summary.

Top-K Strategy

A strategy used by previous models to extract a constant number of sentences from different documents, which conflicts with the real world.

Topic Concentration

The ratio of sentences within a dataset that are relevant to the query

Topic Embedding Model

Produces document-level topic vector and merges it to traditional dense word embedding obtained by an extension of BERT.

Topic Keywords

important words or phrases that capture the main idea of a text.

Topic Modeling

An alternative approach to document clustering that discovers hidden thematic structure

Topic Vector

A vector representing the topic of the summary to be generated

Topic-Aware Models

Introduce topic as guidance to help generate abundant topic-related words and maintain the original ideas of documents.

Topic-Based Summaries

Summaries produced by extractive summarization using sentence-level features that have been leveraged for producing query-focused or topic-based summaries.

Topic-Centric Training Corpus

A novel approach used to artificially create a dataset containing articles with multiple topic-oriented summaries.

Topic-Focused Summarization

The task of generating a summary given a source text and a specific query or topic.

Topical Coherence

The measure of semantic relatedness between words and the topical coherence of a document

Topical Graph

A graph used for summarization where one set of nodes corresponds to topics

Topic–Sentence Document Graph

A graph that consists of sentence and topic nodes and efficiently captures inter-sentence relationships for summarization.

Topological Sorting

A method of ordering nodes in a tree based on their dependencies.

Trade-Off Between Abstractiveness And Faithfulness

The limitation of current models that requires a balance between generating abstract summaries and maintaining faithfulness to the source documents.

Trade-Off Curve

A curve indicating the trade-off between abstractiveness and faithfulness

Train-Test Distribution Gap

A distribution gap between the training and test distributions in the meta-learning stage.

Training And Evaluation Of Iqe

The paper explains that IQE uses pairs of a post and reply candidate to train the model. The model requires replies only during the training and not during the evaluation.

Training And Test Datasets

The data used to train and evaluate a model

Training Complex Neural Models

Requires large amounts of data

Training Data

Abundance of training data is necessary for the performance of neural models. Lack of sufficient training data worsens the model’s ability to generalize patterns in training data to unseen data.

Training Data Generation

Training data is generated from source documents by applying a series of rule-based transformations inspired by error-analysis of neural summarization model outputs.

Training Dynamics

The study of how models learn during the training process

Transformer Conditional Language Model

The paper uses a Transformer conditional language model (CLM) that is trained with a ‘leave-one-out’ objective by attending to other reviews of the product.

Transformer Distillation

An alternate method for summarization model generation that is superior in terms of performance and resource utilization.

Transitive Negative Sampling

A negative sampling strategy that considers not only negative examples directly sampled from the data, but also negative examples that are indirectly related to the positive examples.

Tree Induction Problem

The novel conceptualization of extractive summarization as a problem of inducing document-level dependency trees.

Tree-Based Decoder

A decoder that predicts dependency arcs between words in the partial summary.

Tree-Like Text Graph

Guides neural summarization system to identify summary-worthy content and compose summaries that preserve vital meaning of source texts

Triencoder Framework

a neural network architecture that integrates three separate encoders to consider the context of the original text, topic keywords, and knowledge structure simultaneously.

Trigram Blocking

A method of removing duplication in extractive summarization by skipping sentences that have trigram overlapping with the previously selected sentences.

Truncation Operation

The process of cutting off parts of a document, which can lead to information loss

Trust

The level of confidence users have in the accuracy and reliability of the AI-assisted text generation system.

Truthfulness

Concern about whether all facts of a generated summary are mentioned in the source text

Truthfulness Of Summaries

The important feature of summarization for it to be widely accepted in real-world applications

Tunable Parameter

A parameter that can be adjusted to change the output of the system

Two-Level Encoder And Decoder

One for words and the other for sentences

Two-Stage Decoding Process

A process that generates the summary using a left-context-only decoder in the first stage and predicts the refined word one-by-one using a refine decoder in the second stage

Type I Errors

False positives in statistical analysis

Unbiased Data

A dataset created by permuting the order of sentences in training articles to reduce position bias

Uncurated, Automatically Collected Datasets

Part of the current research setup for text summarization

Underconstrained Task

The summarization task is underconstrained in that the importance of a piece of information highly depends on the expectations and prior knowledge of a reader

Undesirable Behavior Of Abstractive Summarization Systems

Inaccurately reproducing factual details, inability to deal with out-of-vocabulary (OOV) words, and repeating themselves

Unfaithful Content

Content that is not faithful to the original text.

Unfaithful Generation

The generation of summaries that contain information that is not factually consistent with the source documents.

Unigram Language Models

Query-specific information utilized by recent unsupervised approaches for effective performance.

Unpaired Abstractive Summarization

A method of summarizing text without paired summaries

Untruthful Article-Headline Pairs

One of the reasons why the model sometimes exhibits such an untruthful behavior lies in untruthful article-headline pairs, which are used for training the model

Upper Bounds

The highest possible performance that can be achieved in the task of generating summary-worthy aggregations

Upper-Bound Performance

The maximum possible performance of a metric

Usefulness

The degree to which a summary is helpful or valuable

User Generated Reviews On Products

Reviews written by users on products sold online.

User-Aware Sequence Network (Usn)

A user-based selective mechanism that considers different user preferences on review content when summarizing a review and applies user-specific vocabulary to consider user’s writing styles when generating a summary.

User-Generated Content

Content created by users, such as comments, that can be combined with a news article to provide a perspective viewpoint regarding an event.

Users’ Desires

The wants and needs of people who use automatic text summarization

Users’ Needs

the requirements and expectations of individuals who use automatically generated summaries

Utterances

Fragments of speech transcripts that are not well-formed grammatical sentences

Value Function

A function that represents the goodness of an action on a given state in RL

Value-Based Method

An RL technique that takes a value-based approach such as Q-learning or the combination of policy and value-based approaches such as Asynchronous Advantage Actor-Critic

Variable Length Summaries

Summaries that can have varying lengths as opposed to fixed lengths

Vector Space Models

Traditional models used to characterize sentence singletons and pairs.

Vocabulary Tuning

Incorporating linguistic preferences of the readers into the summary

Weak Supervision

Supervision data that is not directly paired with documents but is used to train models

Weighted-Sum Pooling

A pooling technique that represents the document by taking the sum of sentence embeddings weighted by the automatically learned query relevance of a sentence.

Word Alignment Accuracy

The accuracy of aligning target words with their corresponding source words.

Word Centrality Scores

Scores derived from matching words against hidden topics, used to determine the importance of a sentence in a text.

Word-Sentence Graph

A constructed heterogeneous graph that connects each sentence to its contained words, allowing for different granularities of information to be fully used through multiple message passing processes.

Word-Sentence Hierarchy

A structure that takes into account the relationship between words and sentences in a document.

Zero-Shot

The process of directly applying a pre-trained model to a target task without fine-tuning

Zero-Shot Customization Technique

allows users to control important aspects of the generated summary at test time

Abstraction

The ability of models to generate summaries that capture the essence of the input text.
The quality of being abstract or general rather than concrete or specific

Abstractive Document Summarization

A technique that aims to produce a condensed representation of the most salient information of a document, which may not appear as parts of the original input text.
Employing an end-to-end language model to encode a document into high-dimensional representations and then decode the representations into an abstractive summary

Abstractive Generation

Generating text that is not a direct copy of the input, but rather a summary or paraphrase.
A type of text generation where the output is not limited to the input text and can be more creative and free-form.

Abstractive Summarization Methods

Approaches that generate text beyond the original input to produce more coherent and concise summaries
Approaches that generate text beyond the original input to produce more coherent and concise summaries.

Abstractiveness

The degree to which a reference summary uses natural language to convey the meaning of the original document.
A key component of abstractive summarization models that refers to the degree to which the summary contains new information not present in the source text

Affinity

The degree of similarity between the negative summaries generated using MFMA and the source text.
A real number in 0, 1 which quantifies the model’s propensity for including a sentence in the summary.

Aspect/Sentiment-Aware Summaries

Summaries that capture the aspect and sentiment information from the product reviews, which play a vital role in helping customers to make quick and informed decisions on certain products.
Summaries that capture the aspect and sentiment information from product reviews.

Attention

A mechanism that allows the model to focus on specific parts of the input document.
Our model uses attention for both encoder and decoder to learn a better document representation that incorporates local as well as global document features along with attention to sentences to capture the notion of saliency of a sentence.

Attention-Based Neural Network Model

A data-driven approach trained to generate informative, concise, and fluent opinion summaries
A data-driven approach trained to generate informative, concise, and fluent opinion summaries.

Automatic Impression Generation (Aig)

The process of automatically generating an impression section in a radiology report.
A process of automatically generating impressions in radiology reports.

Benchmark Datasets

Standard datasets used for evaluating the performance of machine learning models.
Datasets used to evaluate the performance of the proposed framework, which achieved better results than the state-of-the-art models.

Centrality-Based Models

Models that use a global context of posts to predict the informativeness of a post.
Models that select important sentences based on their similarity to other sentences in the document.

Claim Generation

The novel task of generating a central claim from a set of arguments on the same topic.
The novel task of generating a claim from a set of arguments on the same topic

Coarse-To-Fine Approach

A method proposed in this paper that first identifies the most important sentences of the document and then generates the headline according to the important sentences
A method of summarization that first extracts salient sentences and then rewrites them in parallel.

Conditional Text Generation

Recent progress in generating fluent, topical summaries.
Models that generate highly fluent abstractive summaries given large-scale datasets

Context Vector

A vector that captures relevant information from all the tokens in the source sequence
A vector that summarizes the relevant parts of the input for producing an output

Contrastive Loss

A loss function introduced into the training objectives to better distinguish between positive and synthesized negative sample pairs
A loss function used to train the evaluation model, which is defined over different candidate summaries generated by pre-trained abstractive models and is based on ranking-based or contrastive learning.

Controllability

The ability to better control the generation by manipulating the alignment trajectory.
The ability to control the content of the generated summaries by providing user-specified guidance signals.

Convolution Layers

Layers in the topic-conditioned neural model that capture long-range dependencies between words in the document more effectively compared to recurrent neural networks.
Layers in a neural network that capture long-range dependencies between words in the document more effectively compared to recurrent neural networks.

Coreference Resolution

A technique used to identify all expressions that refer to the same entity in a text.
An NLP task on TV show transcripts that focuses on identifying and resolving references to the same entity

Coverage

The degree to which a reference summary covers the important information in the original document.
Prevalence of each point included in the summary

Coverage Mechanism

A technique introduced by Tu et al. (2016) and See et al. (2017) to solve the repetition problem by introducing a coverage vector to keep track of previous decisions at each decoding step and adding it into the attention calculation.
A neural component that ensures that all important information in the input text is included in the summary

Crowdsourcing

A method of obtaining information or input into a task by enlisting the services of a large number of people, either paid or unpaid, typically via the internet.
Obtaining information or input into a task or project by enlisting the services of a large number of people, either paid or unpaid, typically via the internet

Data-Driven Approaches

Learning frameworks such as sequence-to-sequence models with attention, variational auto-encoders, and reinforcement learning that require a large amount of parallel data for supervision
More recent approaches for extractive summarization that implement a variety of neural networks, mainly with an encoder-decoder framework.

Dirichlet Distribution

A probability distribution used to model the distribution of probabilities over a set of categories.
A probability distribution used to model the background knowledge of readers in Louis' method

Encoder-Decoder Attention

Attention mechanisms in the decoder that consist of self-attention and encoder-decoder attention, with techniques such as local attention applicable to self-attention in both the encoder and decoder.
a mechanism used in sequence-to-sequence models to align the input and output sequences.

End-To-End Learning

A machine learning approach in which the entire system is trained to optimize a single objective function.
A strategy used by modern abstractive summarizers to perform content selection and generation.

Entity-Centric Summarization

A type of controllable summarization that focuses on entities as control aspects, given that they are usually key aspects in documents and their summaries.
A type of extractive summarization that focuses on selecting summary sentences regarding an entity in a document.

Entity-Level Hallucination And Factuality Annotation

A fine-grained annotation of entities in a summary to determine whether they are hallucinated and whether they are factual.
A fine-grained annotation created on the XSUM dataset to identify hallucinations and factuality of entities.

Extrinsic Evaluation

Asking humans for their opinion about the quality of automatically generated summaries
Evaluation of summarization system using stock buy/sell classification task and stock portfolio construction

Extrinsic Hallucinations

Adding information not directly inferable from the input information.
Information introduced in a summary that is not grounded in the input document.

Fact Extraction

The process of identifying and extracting factual information from a text
Extracting facts from the input document with the help of the prototype facts.

Fact Fabrication

A serious problem in abstractive summarization.
The generation of summaries that contain factual errors or even contradict the input article

Factual Inconsistencies

Limitations of model-generated summaries due to the lack of automatic evaluation metrics that can detect such errors.
Inaccuracies in summarization that do not correspond to the facts in the original article

Factuality Metrics

Automated metrics designed to assess the factuality of generated summaries.
Metrics used to measure the factuality of generated text, which are rapidly progressing in search for better factuality-driven summarization metrics.

Generative Adversarial Network (Gan)

A network composed of a generator and a discriminator that work together to generate human-like summary sentences
A technique that employs a discriminative model to guide the training of the generative model in an adversarial process to generate more plausible, high-quality, and human-like abstractive summaries.

Generator

A system that explores the space of all possible lengths to produce multiple variants of the target summary that contain diverse content.
A transformer language model fine-tuned for abstractive summarization that proposes a pool of candidate summaries

Gold Standard

The ideal or perfect version of something
A benchmark or reference point used for evaluation.

Gold Summaries

Editor's summary and author's abstract used for evaluation
High-quality summaries created by humans that serve as the ground truth for training and evaluation of summarization systems

Graph Convolutional Network (Gcn)

A type of graph neural network commonly used on various inter-sentential graphs in extractive text summarization.
A type of neural network used in DISCOBERT to capture long-range interactions among EDUs.

Graph Encoder

A type of encoder that models the relation information among key words in the word graph.
Does structural meaning on QSG.

Graph-Based Algorithms

The most prevalent approach for unsupervised summarization, in which each vertex is a sentence and the weights of the edges are measured by sentence similarity
Approaches that rank sentences in a document using a graph where each node is a sentence and weights of edges are measured by sentence similarities.

Graph-Based Methods

Use Graph Neural Networks to model underlying relationships in text graph
Methods that represent document sentences as nodes in a graph, where the edge value is the similarity between sentences

Graph-Based Summarization

A variation of extractive summarization that utilizes graphs, where the nodes represent text units and the links represent some measure of semantic similarity.
A variation of extractive summarization that utilizes graphs, where nodes represent text units and links represent some measure of semantic similarity.

Greedy Algorithm

Our system outputs extractive summaries using a greedy algorithm to minimize redundancy.
An algorithm that makes locally optimal choices at each step

Guidance Loss

An auxiliary loss computed by the divergence between the copy distribution and the centrality distribution, which aims to encourage the model to focus on important words.
A type of loss function that encourages the sentence generator to produce the encoder's embedding for the target next sentence.

Heterogeneous Graph

Graphs that contain multiple node types and/or multiple edge types
A graph composed of word-, sentence-, and section-level nodes with edge connections mirroring the intrinsic word-sentence-section discourse hierarchy in documents.

Heuristics

Rules of thumb or guidelines used to solve a problem or make a decision
Rules or methods used to solve a problem that may not be optimal but are practical and sufficient

Hierarchical Structure

The natural document-sentence hierarchical structure that is necessary and useful for neural document modeling.
Reduces burden of computing by carrying out token-level and turn-level understanding

Hierarchical Transformer

A transformer model that has a token-level transformer to learn sentence representations and a sentence-level transformer to learn interactions between sentences with self-attention.
A model adopted by the authors of the paper instead of hierarchical LSTM, as recent studies show transformer models perform better

Hierarchical Variational Autoencoder (Vae) Model

A generative model that captures the content of individual reviews and product representations, which can store overall sentiment, common topics, and opinions expressed about the product.
A generative model of a collection of reviews for various products and businesses, which can control the “amount of novelty” going into the new review or, equivalently, vary the extent to which it deviates from the input.

Human Evaluation Protocol

A benchmark evaluation method for summarization systems that involves human judges rating the relevance and quality of system-generated summaries.
A method of assessing the effectiveness of abstractive summaries in preserving the original meaning and avoiding introducing new meanings.

Hybrid Approach

The authors choose to opt for a hybrid approach, using transformer-based models for the generation of salient points, but ultimately generating an extractive summarization to ensure factual consistency.
A combination of extractive and abstractive approaches to summarization.

Hybrid Models

Models that combine extractive and abstractive summarization approaches.
Models that use the nearest neighbor models as content selectors followed by abstractive summarization

Importance-Based Sampling Method

A method that allows the encoder to integrate information from an important subset of input text by estimating the importance score of a text unit from a novel regression model with pairwise preference-based regularizer.
A method that allows the encoder to integrate information from an important subset of input text

Impression

A section of the radiology report that summarizes the most critical findings.
The section of a radiology report summarizing the most critical observations.

Information Bottleneck Principle

A principle that formalizes the trade-off between the size of the extracted evidence and the information provided for the generation of the final output.
The idea of compressing a signal under the guidance of another correlated signal.

Information Theory

A theoretical framework for discussing the abstract concept of information.
A mathematical theory that deals with the quantification of information

Informativeness And Abstraction

The proposed method uses the entire document as background knowledge to generate informative and abstractive summary with high quality
The goal of the proposed method for generating informative and abstractive summary with high quality

Inverted Pyramid Structure

A structure commonly used in news reports where the most important information is presented at the beginning and less important information is presented later.
A news structure commonly used in journalism where the most newsworthy and key events are presented first, followed by additional details.

Low-Resource Domains

Domains where only a few labeled data are available.
Domains where there is limited training data available

Maximum Likelihood Estimation (Mle)

A common training method for abstractive summarization models, which involves maximizing the predictive probability of the reference output given the gold sub-sequence before it.
A framework commonly used to train Seq2Seq models, which introduces a gap between the objective function and the evaluation metrics.

Meta Evaluation

The process of evaluating the performance of evaluation metrics by comparing their results to human judgments on a dataset.
the process of evaluating the degree to which automatic metrics correlate with human judgments using a dataset annotated with human judgments

Meteor

A metric used to evaluate the quality of summarization by comparing the generated summary with the reference summaries.
A metric used to evaluate the quality of machine translation and summarization systems.

Natural Language Generation

The process of generating human-like language using computer algorithms.
The other critical subtask of text summarization, which involves generating a summary in natural language

Natural Language Processing

The study of how computers can understand and generate human language.
The field of computer science and artificial intelligence concerned with the interaction between computers and human languages.

Negative Log-Likelihood (Nll)

A training paradigm for abstractive models that minimizes the difference between the model predicted word distributions and the gold summary
A training objective that maximizes the likelihood of each token in a given ground-truth reference

Negative Samples

Samples created by perturbing the training samples to compare with the original samples and obtain the contrastive loss function.
Refers to factually inconsistent summaries used in training to help models differentiate between accurate and inaccurate summaries.

Neural Abstractive Models

Models used for summarization of news articles, which have been the focus of recent research efforts.
Abstractive summarization models that use neural networks to identify salient content and produce fluent summaries.

Objective

Not influenced by personal opinions or feelings, and based on facts or evidence
A specific goal or aim

Open Information Extraction (Openie)

A technique for automatically extracting structured information from unstructured text.
A relatively mature task that represents a fact by a relation triple consisting of (subject; predicate; object).

Optimization Perspective

The idea that the benefits of pretraining may be due to better initialization of the model rather than knowledge transfer
A way to solve diversified ranking problem for query-oriented situations

Performance

The evaluation and comparison of the summarization method with state-of-the-art systems.
The accuracy and effectiveness of a model in completing a given task.

Perplexity

A quantitative language metric used to evaluate the quality of language models.
A measure of how well a language model predicts a given text.

Personalized Review Summarization

Review summarization in E-commerce platforms that takes into account personalized characteristics of users and products.
Addressing personalization issues of review summarization where different users may care about different contents according to their own experiences or thoughts.

Pointer Generator

An encoder-decoder approach that allows the summary to include words that are not in the input vocabulary
A solution developed to tackle the rare words and out-of-vocabulary (OOV) problem associated with generative-based models by using attention as a pointer to determine the probability of generating a word from both a vocabulary distribution and the source text.

Pre-Trained Language Model

A language model that has been trained on a large corpus of text and can be fine-tuned for specific tasks.
a strategy for tackling the problem of long-range contextual learning and various NLP tasks, such as QA and summarization

Qa-Based Metrics

Evaluation metrics that represent the information of a reference summary using QA pairs, then estimate how much of this information is contained in a candidate summary by calculating the proportion of questions it can answer
Metrics that compute a factual consistency score based on a QA model’s ability to answer questions generated from the summary using the input document.

Quantized Transformer (Qt)

An unsupervised neural model inspired by Vector-Quantized Variational Autoencoders (VQ-VAE) that is used for popularity-driven summarization
An extractive summarization approach that leverages vector quantization for assigning texts to a latent representation that is supposed to capture a semantic sense.

Query-Focused Summarization (Qfs)

Summarizing a document cluster in response to a specific input query
A task to generate a summary that can answer a query from the essential information of a source document.

Ranking Task

The core of extractive summarization, where the system ranks the sentences from the original document based on their importance in representing the overall document.
Ranking the most salient sentences first

Recurrent Neural Networks (Rnn)

A type of neural network commonly used in natural language processing tasks.
A type of neural network commonly used to extract sentence features in extractive text summarization.

Recurrent Neural Networks (Rnns)

A type of architecture that has obtained state-of-the-art results for text summarization.
Capable of generating fluent language and have shown promising results on the abstractive summarization task

Reference Summaries

Summaries used to train document summarisation systems, which are expensive to obtain.
Manually written summaries used as a benchmark for evaluating summarization systems

Salient Entities

Important information in the input that underscores the content of the article
Entities that contain important information in the source document and appear in the target summary.

Scientific Summarization

Downstream task in IR/NLP that citation texts have been previously used to enhance, and our contextualization models can enhance summarizing scientific articles.
Providing readers with concise and informative representation of contributions or findings of an article to facilitate the difficulty of keeping up with the developments in their respective fields due to the expanding rate at which articles are being published in each scientific field.

Self-Supervised Learning

A paradigm that aims to learn from the intrinsic structure of raw data by constructing training signals directly from the structured raw data.
A machine learning technique where the model learns from the data itself, without the need for external supervision

Semantic Similarity

A metric used to evaluate the degree to which a summary captures the meaning of the original text.
A good aspect-based summarization system should establish semantic similarity between aspect and document content.

Semantic Units

Divided text spans with a fixed-size sliding window, each containing brief semantic concepts in itself
Units of meaning in a text.

Sentence Centrality

A measure of the importance of a sentence in the original document.
A measure of the importance of a sentence within a document.

Sentiment Analysis

A form of opinion mining that focuses on identifying the sentiment expressed in a review
The process of determining the sentiment or emotion expressed in a piece of text.

Sentiment Prediction

determining the sentiment of the extracted aspects
A subtask of opinion summarization that determines the sentiment orientation (positive or negative) on the aspects found in the first step.

Sequence-To-Sequence Framework

A framework for abstractive sentence summarization that has been successfully applied on machine translation and uses large-scale sentence summary corpora
A framework that has shown success in natural-language sequence transduction tasks such as machine translation

Single Document Extractive Summarization

The process of selecting a subset of sentences from an original article to create a summary.
The process of selecting a subset of sentences from a document and assembling them into informative and concise summaries.

Soft Templates

Summaries of specific training articles that provide similar guidance to the summarization as hard templates.
Existing summaries of similar sentences that can provide a reference point to guide the input sentence summarization process.

Sparsity Budget Parameter

A parameter that controls the trade-off between the length of the evidence spans and the quality of the final abstractive output.
A parameter that controls the trade-off between the length of the evidence spans and the final abstractive output's quality.

Structure-Aware Paper Summarization

A method that considers the facet structure and domain knowledge of papers to effectively aggregate information from different facets and improve the diversity and coverage of summaries.
A method of automatic summarization that considers the facet structure and domain knowledge of scientific papers to effectively aggregate information from different facets and improve the diversity and coverage of summaries.

Structured Summary

A well-structured summary can effectively facilitate comprehension by focusing on specific facets of a scientific publication.
A summary containing multiple sections summarizing particular aspects of input content, which is found easier to read and more welcomed by readers.

Summarization Metrics

Methods used to evaluate the quality of a summary
Evaluation methods for assessing the factuality of summaries and their strengths and weaknesses in detecting specific types of factual errors

System Rankings

induced by performance scores or user preferences, differ across different demographic groups
The ranking of different summarization systems based on their performance

Textual Entailment

Another term for natural language inference.
Whether an article (source document) entails its headline

Tldr

"Too Long; Didn’t Read", a common practice on social media platforms that aims at removing unnecessary information from lengthy posts and presenting its gist information in a few words.
TLDR generation is a new form of extreme summarization for scientific papers that focuses on the key aspects of the paper, such as its main contributions, while eschewing nonessential background or methodological details. TLDRs can enable readers to quickly discern a paper’s key points and decide whether it’s worth reading. SCITLDR is a new dataset of 5,411 TLDRs of computer science papers that contains both author-written and expert-derived TLDRs. CATTS is a simple yet effective learning strategy for generating TLDRs that uses titles as an auxiliary training signal. The system-generated TLDRs are evaluated for informativeness and factual correctness.

Topic-Conditioned Neural Model

A novel deep learning model based entirely on convolutional neural networks that is well-suited to the extreme summarization task.
A novel deep learning model based entirely on convolutional neural networks that associates each word with a topic vector capturing whether it is representative of the document's content and conditions each word prediction on a document topic vector.

Trade-Off

The balance between abstractiveness and factual consistency in abstractive summarization
Between importance and diversity (non-redundancy) in the two phases, sentence scoring and sentence selection.

Transformer Architectures

A type of neural network architecture used in text summarization.
A type of neural network architecture used for document summarization.

Two-Stage Decoder

A popular solution to build a summarization system that extracts salient sentences and then rewrites, compresses, or matches these sentences.
A third popular solution to the issue of redundant phrases by building a summarization system with a two-stage decoder, with the first stage to extract some fragments of the original text and the second stage to select or modify on the basis of these fragments.

Unsupervised Summarization Methods

Traditional extractive summarization methods that are mostly unsupervised, extracting sentences based on n-grams overlap, relying on graph-based methods for sentence ranking, or identifying important sentences with a latent semantic analysis technique.
The paper discusses various unsupervised summarization methods, including graph-centrality, centroid of vectors, Kullback-Leibler divergence, reconstruction loss, and path scores of word graphs.

User Reviews

Reviews written by users on products and services
Opinions expressed by customers about a company's services or products.

Weighted Graphs

Graphs in which the main features of the sentence are represented in the ordering stage based on sentence centrality
Graphs employed to aggregate node features of neighbors in word-sentence graphs.

Word Embeddings

A technique used to represent words as vectors in a high-dimensional space based on their contextual usage.
A technique used to represent words as vectors in a high-dimensional space.

Word Graph

A graph constructed from the automatically extracted entities and dependency tree of the input findings.
A graph constructed by identifying important words in the findings section of a radiology report and building connections among them via different typed relations.

Zero-Shot Conditional Summarization

The ability of a summarization model to generate summaries for domains it has not been trained on.
Enabling summarization on previously unseen passages for previously unseen tasks.

Zero-Shot Setting

Evaluating the model without any supervised training data.
A scenario where a model is trained on one domain and tested on another domain without any additional training data

Ablation Study

Analyzing the robustness of QAGS to various factors such as underlying model quality and domain mismatch.
A type of experiment that tests the effectiveness of individual components of a model by removing them one by one.
A study conducted by discarding certain components of a model to evaluate their importance.

Abstractive Models

Summarize a document using newly generated tokens and phrases that may not be found in the original article
Summarization systems that generate summaries from scratch without being constrained to reuse phrases from the original text
Summarization models that aim to create a summary by paraphrasing, fusing, synthesizing, and inferring information from various parts of the document.

Abstractive Sentence Summarization

Generating concise and informative summaries based on the core meaning of source sentences
A type of abstractive summarization that involves retelling, pruning, and generation at the sentence level.
A task that generates a shorter version of a given sentence while attempting to preserve its original meaning.

Abstractive Summarization Systems

Systems that generate summaries by paraphrasing and rephrasing the source text rather than copying it directly.
Systems that generate summaries by paraphrasing and rephrasing the source text.
Systems that generate summaries by understanding the meaning of the source text and generating new sentences that convey the same information in a more concise manner

Aspect-Based Summarization

Summarizing a document given a specific point of interest.
Summarizing a document with regard to a given aspect or topic
A subtask of summarization that provides targeted summaries of a document from different perspectives, allowing readers to fulfill focused information needs more easily and quickly.

Attention Mechanisms

Mechanisms that allow encoder-decoder architectures to automatically focus on key regions of a document
a common approach for handling long text sequences in abstractive settings, which aim to imitate the attentive reading behaviour of humans
Used to model the effects of different loci in the input sequence during decoding

Beam Search

A heuristic algorithm used for text decoding that factorizes global optimization to multiple local optimizations.
A search algorithm that explores all possible paths in a search space to find the optimal solution.
A decoding method that maintains a list of top-k best candidates and outputs a single best one.

Copying Mechanism

A network that primarily includes the words in the source document in the summary by increasing the probability of including those words
a technique used in abstractive summarization to copy words from the input sequence to form part of the summary.
A mechanism that directly copies words from input to generate summaries.

Correlation

The measure of similarity between the scores of an automatic metric and human judgments, used to determine which metric is a better approximation of human judgments.
The relationship between two variables
The degree to which two variables are related to each other.

Curriculum Learning

A multi-task learning method that supports sentence extraction from a document while solving document classification.
Reorders training samples based on external criteria
A method to change the order of training data to improve convergence speed and the performance of models.

Data Augmentation

The paper proposes a data augmentation pipeline to further boost summarization performance, including the automatic creation of multi-perspective bullet-point answer summaries.
The process of generating new data from existing data.
A technique for increasing the diversity of training samples by applying adversarial transformations to the training data.

Decoder

A component of the Transformer architecture that performs better in the case of longer input sequences compared to recurrent neural network (RNN) and Transformer encoder-decoder models
A component of the model that generates the summary of an input sentence
The part of the model that generates the summary from the memory state.

Domain Adaptation

The ability to transfer large pretrained models to new domains with little or no in-domain data, which is necessary for real-world applications.
The ability to leverage available out-of-domain abstracts or extractive summaries to help train a neural summarization system for a new domain.
The process of adapting a pre-trained model to a specific domain

Encoder-Decoder Models

A type of neural network architecture used for tasks such as translation, summarization, question-answering, and dialogue response generation.
Models that use an encoder to encode the input sequence and a decoder to generate the output sequence.
The use of encoder-decoder models, such as the T5 language model, in generative linguistic tasks, such as abstractive summarization and query generation, have been shown to significantly improve performance over existing methods.

Evaluation

the process of assessing the effectiveness and quality of automatically generated summaries
We evaluate our approach in two ways
The paper evaluates the proposed metrics on existing and newly collected datasets and compares them to existing automatic metrics, finding that Lite2Pyramid consistently performs the best.

Extractive Document Summarization

The task of creating a summary by identifying and concatenating the most important sentences in a document.
The process of extracting relevant sentences from a document and reorganizing them as a summary.
A summarization paradigm that selects important phrases or sentences from input documents to build a summary.

Extractive Summaries

Summaries that select complete sentences from the source document to ensure grammaticality.
Summaries generated by early work on QFS that may contain unreadable sentence ordering and lack cohesiveness.
Summaries that frequently borrow words and phrases from their source text.

Extractive Summarization Models

Models that generate a short text summary by extracting salient sentences in a document
Models that select and extract the most important information from a text to create a summary.
Models that show strong performance by extracting text directly from the source document.

Factual Hallucinations

Content that is verifiable by world knowledge but not inferable from source text.
Errors in abstractive summarization where new phrases and sentences are generated that are not present in the original document
Content that is verifiable by world knowledge but not inferable from the source text.

Faithful Summaries

Summaries that are more accurate and contain fewer factual errors and hallucinated content.
Summaries that are more faithful to the source context because of the explicit alignment grounding.
Generated conditioned on accurate guidance information

Generic Summaries

Summaries that lack specific details and are not faithful to the input document
Summaries of documents that are not specific to any particular aspect or topic
Summaries that provide a single, topic-agnostic summary of a document

Graph Neural Networks (Gnns)

Networks that are widely explored to model cross-sentence relationships for summarization task by building an effective document graph.
Neural networks that can capture inter-sentence relationships in complex documents by requiring node features and graph structure as input.
Neural networks that can operate on graph data structures.

Hallucinations

Distorting or fabricating facts in generated summaries, leading to inconsistency between a summary and the corresponding original document.
A problem in current abstractive summarization models where a summary contains facts or entities not present in the original document.
Output that reflects the training data rather than the test data

Hard Attention

A well-trained attention on exact one input state that is conducive to more accurate results compared to the soft attention.
A type of attention that focuses on a small subset of the input
An approach that uses an initial extractive step to reduce the amount of context for a subsequent abstractive step.

Headline Generation

A subtask of text summarization that covers a single document, is often written in a different style (Headlinese), and is much shorter (frequently limited to a single sentence).
A summarization task where the goal is to generate a short, one sentence summary of an article
A specialized task of text summarization where the target text is a sentence, useful in scenarios like compressing text for mobile device users, generating content table, and machine writing

Importance

A vague and undefined term referring to the selection of important information in summarization.
Focusing on important scenes and characters
A property of automatic summarizers that ensures the summary contains the important information of the input document.

Intrinsic Evaluation

An evaluation that measures the performance of a model on a specific task, without considering its performance on other tasks
A traditional evaluation method that compares system summaries against human reference summaries.
Evaluation of generated summaries using manually summarized finance domain documents

Lead-3 Baseline

Generates summaries by extracting the first three sentences of the source document
A simple baseline method that selects the first three sentences of a document as the summary.
A strong extractive baseline that selects the first 3 sentences

Linguistic Quality

Evaluation of the quality of the generated summaries by human judges.
The degree to which a summary is grammatically correct and uses appropriate language.
The quality of the language used in the summary

Multi-Task Learning

A method used to incorporate similar constraints into the training process of more complex end-to-end abstractive summarization models.
A framework for summarization that comprises two components
The use of multiple tasks to improve the performance of a model on a main task.

Natural Language Inference

The process of determining whether a statement is true, false, or unknown based on the information in a text
The process of determining whether a statement is true or false based on the information presented in a text
A task that involves determining whether a statement is entailed, contradicted, or neutral with respect to another statement

Oracle Summary

A set of sentences that have a maximum ROUGEn score and can be used as a benchmark for evaluating the performance of summarization systems.
A summary created by a human expert that represents the most important information in the article
A summary that is considered to be the ideal summary for a given source document.

Paraphrasing

The act of rephrasing a given article to generate summaries.
Simplifying the input question and submitting the simplified version to QA systems.
A new approach for data synthesis

Pre-Training

A technique used to improve the performance of a model by training it on a large corpus of data before fine-tuning it on a specific task.
The process of training a model on a large dataset before fine-tuning it on a specific task
The process of training a model on a large amount of unlabeled data before fine-tuning it on a smaller labeled dataset.

Readability

The quality of a summary in being easy to read and understand.
The ease with which a reader can understand a written text
Ensuring that the summary is grammatical and coherent

Reference-Free Metrics

Evaluation metrics that do not require a reference summary.
Automatic metrics used to judge the quality of system-generated summaries that do not require reference summaries in the evaluation stage. Examples include SummaQA, BLANC, and SUPERT.
Metrics that do not rely on human-authored summaries for evaluation.

Salient Content

The most important and relevant information in the input text.
Content that is deemed most important and relevant to the existing summary, appearing at the beginning of selected transcript chunks to provide good jump-in points for users to start listening.
The important content that needs to be included in the summary.

Sentence Scoring

A process of assigning scores to sentences based on various statistical and linguistic features or graph-based centrality method for capturing the relative importance of textual units.
A phase in extractive summarization where an affinity score is computed for each sentence using neural networks such as bidirectional RNNs or BERT.
Assigning an importance score to each sentence using feature-based methods, graph-based methods, and neural networks

Sentence Selection

A phase in extractive summarization where sentences are selected based on their scores using methods such as predicting a label, ranking, or sequentially sampling.
Choosing content sentence by sentence using Maximal Marginal Relevance, Integer Linear Programming, and Submodular functions
A process in sentence regression that discards redundant sentences that are similar to the already selected sentences

Seq2seq Learning

A widely used technique in summarization tasks that has produced promising results.
A sequence-to-sequence learning problem where a document is viewed as a sequence of words and its summary another sequence of words
A type of neural network architecture used for sequence-to-sequence learning tasks

Single Document Summarization

The process of shortening a text and preserving the most important ideas of the source document.
A technique for summarizing a single document with small amounts of reference summaries.
The front-runner in automatic summarization due to the availability of large datasets

Summarization Datasets

Recently published datasets that differ in domain and summary form
Collections of text used to train and evaluate summarization models.
Datasets used for training and evaluating summarization models, which are drawn from natural sources and do not come with inherent quality assurance guarantees.

Summarization Evaluation

The need to revisit fundamental issues in evaluating neural abstractive models to develop a comprehensive evaluation scheme that captures all relevant aspects of summary quality.
Summarization evaluation relies almost exclusively on ROUGE, an automated tool that cannot directly assess importance of summary content, or novel wording for the same information.
The process of evaluating the quality of summarization systems

Supervised Approaches

methods that use human abstracts to create annotations for extraction units
Automatic summarization methods that require a large amount of documents and summary pairs for training.
Require in-domain parallel data for teacher-forcing training

Abstractive Approaches

Methods of review summarization that summarize a review by generating new phrases and sentences that do not appear in the review.
Generate new words or phrases not in the original document
attempt to generate summaries which are maximally informative and minimally redundant without simply rearranging passages from the original opinions
Approaches that generate a summary using words not found in the input text

Abstractive Summarisation

A method of summarising discussions that involves grouping information and presenting points and counterpoints in a high-level, summarised view.
Reads a document and then generates a summary from it, which can contain phrases not appearing in the document
Summaries that provide both high-level and low-level details.
A summarisation technique that generates a summary by paraphrasing and rephrasing the original text

Abstractive Summarizers

Abstractive summarizers are beginning to replicate the first two of these behaviors, as illustrated in many published examples based on encoder-decoder and pointergenerator neural architectures.
Less reliable despite impressive performance on benchmark datasets due to hallucinating facts and struggling to keep original meanings intact
Systems that excel at finding and extracting important content from source texts
Further paraphrase the selected content for better clarity and brevity.

Coherence

The logical and smooth flow of information in the summary
The quality of being logical and consistent
The quality of the summary in terms of how well it flows and makes sense
A property of automatic summarizers that ensures the summary comprises diverse and important information of the input document, and its sentences are connected to one another in a coherent and easy-to-read manner.

Compressive Summarization

Selecting a set of sentences from the input document and compressing them by removing unnecessary words while keeping the summaries informative, concise, and grammatical
A summarization approach that combines the strengths of both extractive and abstractive summarization approaches.
Summarization technique that reduces the length of the original text by selecting the most important sentences or phrases.
A type of summarization that jointly optimizes sentence selection and word selection to create a compressed summary.

Controllable Summarization

Allowing users to indicate their preference to control different aspects of the generated summaries
Summarization methods that take into account user preferences, such as summary length, terms of interest, or entities.
A framework that allows readers to customize the shape and content of summaries to suit their varying interests and time budgets.
A growing area of research that aims to provide a summary specific to a user's information need, which could be a target entity, aspect, topic, preferred style, or length.

Encoder-Decoder Architecture

A type of neural network architecture commonly used in abstractive summarization, where the encoder processes the source text and the decoder generates the summary.
A type of neural network architecture commonly used in natural language processing tasks.
A type of neural network that consists of an encoder that processes the input and a decoder that generates the output.
A specific type of sequence-to-sequence architecture

Entities

Important named entities such as person names, locations, and organizations that are crucial for generating informative and coherent summaries.
Persons, locations, dates, cardinal numbers, etc. that are necessary to express the most salient pieces of information in a summary.
Persons, locations, dates, cardinal numbers, and other salient pieces of information in a summary.
Named objects, people, or places mentioned in the source text

Extractive Summarization Methods

Approaches that select phrases or sentences from the original documents for inclusion in the summary.
aim to select salient snippets, sentences or passages directly from the input document
Methods that aim to generate a summary by selecting and extracting important sentences from a document or a set of documents
Approaches that select phrases or sentences from original documents for inclusion in the summary

Factual Errors

Incorrect facts generated by text summarization systems that cannot be supported by the source document.
Errors in the summaries generated by abstractive summarization models
Mistakes or inaccuracies in a summary that deviate from the information in the source text.
Errors in a summary that do not accurately reflect the information in the source text

Fluency

The quality of being smooth and natural-sounding
The degree to which the generated summary is grammatically correct and easy to read
The degree to which the generated text is natural and easy to read.
The ability of a summary to be easily understood and read.

Joint Model

A model that recognizes both the type of news structure and its news elements simultaneously, built on top of a hierarchical BiLSTM neural network.
Combines topic inference and summarization in an end-to-end manner.
A model that integrates summarization and parsing algorithms to generate summary sentences and parse trees simultaneously.
A model that solves the isolation problem by jointly learning the relevance ranking and saliency ranking.

Lead Bias

Introduced by the journalistic convention of writing using an inverted pyramid structure, placing the most important information in the beginning of an article
The journalistic convention that the most important information in a news report usually appears near the beginning of the article
The tendency of summarization systems to prioritize lead sentences over other sentences in the article
The tendency for sentences at the beginning of a news article to contain summary-worthy information.

N-Grams

A sequence of N words in a document.
A contiguous sequence of n items from a given sample of text
Contiguous sequences of n items from a given sample of text
A contiguous sequence of n items from a given sample of text or speech

Non-Redundancy

Avoiding repetition of information in the summary
A property of automatic summarizers that ensures the summary contains non-redundant information and is diverse.
Characteristics of summary highlights that suggest redundant content should not appear in a summary
The quality of not being repetitive or duplicative

Opinion Mining

Deriving the opinion or the attitude of a speaker.
The process of automatically summarizing people's attitudes towards an entity using online reviews
The process of extracting information from reviews to aid decision making
The process of extracting and analyzing opinions from text.

Pretrained Models

Large models, such as BART and Pegasus, that have been introduced and have resulted in summaries that are sometimes preferred over human-written summaries.
Models that have been trained on large amounts of data and can be fine-tuned for specific tasks
Models such as BERT and BART that have been trained on various tasks, including summarization, and can be used to achieve good performance with less training time.
Models that have been trained on a large corpus of data and then fine-tuned on specific downstream NLP tasks

Redundancy

The problem of top-ranked sentences sharing much redundant information
The problem of repeating information in a summary that has not been well-addressed in previous approaches.
The degree to which a summary contains sentences that provide additional information. It is measured by the PMI of sentence pairs within the summary.
The amount of repetition in a text.

Reference-Based Metrics

Evaluation metrics that compare generated summaries to a reference summary.
Evaluation metrics that use a human-written reference summary as the gold standard and score a candidate summary based on how similar its content is to the reference
Automatic metrics used to judge the quality of system-generated summaries that require reference summaries in the evaluation stage. Examples include ROUGE, BLEU, CIDEr, METEOR, S3, MoverScore, and BertScore.
Metrics that compare the output of a summarizer with one or multiple human-authored summaries.

Review Summarization

Generating a condensed summary for a review or multiple reviews to alleviate information overload problem.
The process of generating a concise summary that expresses the key opinions and sentiment of a review.
The process of automatically generating concise and readable summaries for product reviews
The process of generating a condensed summary for a review or multiple reviews.

Salient Sentences

Sentences that represent key subjects mentioned in the document.
Sentences in a document that are important and need to be included in the summary.
The most important sentences in a document
Key sentences in the original document that convey the information used to produce a summary, with the hypothesis that dynamically constraining the encoder-decoder attention to these sentences can reduce computation cost.

Sentence Fusion

The process of combining disparate sentences that contain fundamentally different content but remain related to make fusion sensible.
The process of merging disparate sentences containing fundamentally different content but remain related to make fusion sensible.
The process of combining multiple sentences into a single summary sentence.
The method of shortening text by merging content from two or more sentences into one

Sentiment Classification

The task of predicting the sentiment label of a review, indicating the sentiment attitude of the review.
The task of classifying the sentiment of a given text.
The process of classifying text as positive, negative, or neutral.
Assigns a sentiment label to determine the attitude or the opinion inside the text.

Summary Generation

Grounded to the intermediate summary representation
A subtask of opinion summarization that presents the identified opinions to the user.
The process of weaving together selected source and vocabulary words to form a coherent summary.
presenting the identified opinions to the user

Textrank

A popular algorithm for extractive single-document summarization that represents document sentences as nodes in a graph with undirected edges whose weights are computed based on sentence similarity
Encodes sentences in the article as nodes in an undirected graph for extractive summarization
A graph-based summarization system
An early study that built similarity graphs among sentences and leveraged PageRank to score them.

Transformer Architecture

A type of neural network architecture commonly used in abstractive summarization, known for its ability to handle long-range dependencies and produce high-quality summaries.
A neural network architecture that uses self-attention mechanisms to process sequential data.
A neural network architecture used for natural language processing
A popular approach for NLP tasks that uses large-scale pre-training and can be computationally expensive due to the quadratic growth of self-attention's time and memory with sequence length.

Unsupervised Approach

A method of summarization that does not require a large number of reference summaries for training
Reduces the human effort for collecting and annotating large amount of paired training data
A method of summarisation that does not require human input or supervision.
A technique that is starting to attract interest in automatic text summarization due to its advantage of not requiring costly parallel corpora

Aspect Extraction

The process of identifying relevant aspects (such as amenities, culture, etc.) about the entity being reviewed.
A form of opinion mining that focuses on identifying the specific aspects of a product or service that are being discussed in a review
A subtask of opinion summarization that aims to find specific features pertaining to the entity of interest (e.g., battery life, sound quality, ease of use) and identify expressions that discuss them.
finding features pertaining to the target of interest, such as battery life or sound quality
The process of identifying the different aspects or features of a product or service that are mentioned in a review.

Automatic Evaluation

A means of measuring the validity of a summarization system using automated methods
A replacement for time consuming and pricey human evaluation in large-scale experiments
An evaluation method that uses metrics and algorithms to evaluate the performance of summarization models.
Evaluation of summarization methods using automated metrics.
Demonstrates that the proposed MADY method outperforms the strong competitors by a substantial margin.

Diversity

A crucial objective in summarization to avoid redundancy
Incorporating different elements to avoid redundancy
The degree of variation among the negative summaries generated using MFMA.
The degree to which the generated summary contains a variety of different words and phrases
The degree to which a summary contains a variety of different information.

Extractive Models

Select sentences from the input article as the summary
Models that favor selecting and extracting important content from the input rather than generating new content.
More reliable than abstractive models, but require sentence level labels which are usually not included in most summarization datasets
Summarization systems that generate summaries by cropping important segments from the original text and putting them together to form a coherent summary
Summarization models that identify and concatenate the most important sentences in a document.

Factuality

The accuracy of the information presented in the summary
The degree to which a summary is factually accurate
The quality of being based on facts or reality, maintained during the summary creation process.
A measure of whether eventualities are characterized as corresponding to facts, possibilities, or situations that do not hold in the world
The degree to which a summary accurately reflects the information in the original document.

Faithfulness

The quality of a summary that only contains information that can be derived from the source document, without any fabricated or hallucinated statements
The ability to produce a summary that is entailed by the information presented in the source document.
Being factually-accurate and being faithful to the source document
The degree to which a summary accurately represents the content of the original document.
The quality of being factually consistent with information in the source documents.

Hallucination

The generation of content in a summary that is not supported by the source document.
a major problem in abstractive summarization where the model generates fictional content
The phenomenon of generating summaries that contain fabricated or hallucinated statements, which can lead to factual errors and hinder the practical use of summarization models
The generation of summaries that contain information that is not present in the source document.
Summaries containing hallucinated content or content not supported by the input documents.

Informativeness

The index required for a summary, which reflects the relevance and importance of the information in a post.
The quality of a summary in conveying the important information from the original text.
The quality of being informative, providing useful or interesting information.
The amount of new information provided by a piece of information.
a concept in summarization that refers to the degree to which a sentence conveys new information

Interpretability

The ability to understand and explain how a system works.
The ability to understand and explain the results of a model
the ability to understand and explain how a model makes its predictions
The ability to understand and explain how a model or system works.
The ability of the model to explain how document content contributes to its decisions.

Knowledge Distillation

A technique that transfers knowledge from a large, pretrained model to a smaller, customized model.
A method that leverages the output of a large teacher model to guide the training of a smaller student model.
A technique used to transfer knowledge from a large model (teacher) to a smaller model (student) by using soft targets (class probabilities) instead of hard targets (0/1 labels).
The process of transferring knowledge from a larger "teacher" network to a smaller "student" model by training the student to imitate the teacher's outputs
A key component of the extractive model that includes weak supervision for the intermediate latent variables of iterative refinement. A teacher algorithm of knowledge distillation is designed to produce high entropy soft labels at a high temperature, and progressively reduce the temperature along with iteration until a temperature of 1.

Natural Language Inference (Nli)

A task that aims to identify the relationship between two sentences.
Current NLI datasets focus on classifying logical entailment between short, single sentence pairs, but verifying factual consistency requires the entire source document.
Task of classifying a hypothesis sentence as entailed, neutral, or contradicted by a given premise sentence
The task of classifying a hypothesis sentence as entailed by, neutral, or contradicting a premise sentence
A technique in natural language processing that detects factual errors by studying the relationship between two sentences, where one sentence entails the other.

Pre-Trained Language Models

Models that can considerably boost the performance of summarization by effectively capturing context features.
Models that use massive data in a self-supervised fashion to boost the performance of NLP models
Models, such as Transformer, that have been trained on large amounts of data and can be fine-tuned for specific tasks.
Large language models that have been trained on diverse datasets and can be used for various natural language processing tasks
Models that have been trained on large amounts of data and can be fine-tuned for specific tasks such as abstractive text summarization

Saliency

Characteristics of summary highlights that give the main points of the documents
A key factor of document summarization, which has not been addressed by existing neural abstractive models, despite its importance for summary generation.
The importance or relevance of a sentence to the overall document.
Choosing the most important information from the input document.
The quality of being particularly noticeable or important

Salient Information

Important information in the input document that should be included in the summary
The most important and relevant information
Important information that should be included in summaries
Information that is most important or relevant to the topic being summarized
Important or relevant information.

Seq2seq Models

Consist of an encoder, a decoder and attention mechanism
Little attention has been paid to using pre-trained models to warm-start sequence-to-sequence (seq2seq) models.
Abstractive models are mostly based on sequence to sequence models
Models used for abstractive summarization that generate coherent and grammatically correct sentences.
A general framework of models where the document is fed to an encoder network and another (recurrent) network learns to decode the summary

Sequence Labeling Task

Selecting the sentences labeled as being part of the summary
A task where the labels indicate whether a sentence should be included in the summary.
A technique used in extractive text summarization where labels indicate if a sentence should be included in the generated summary.
A process where each sentence is individually processed and determined whether it should be extracted or not.
A natural language processing task that involves assigning a label to each element in a sequence, such as a sentence or a word.

Sequence-To-Sequence Model

A framework for sentence summarization that encodes a source sequence into a latent representation and outputs another sequence
A popular framework used in abstractive summarization, introduced by Sutskever et al. (2014).
A popular model for abstractive text summarization that consists of an encoder and a decoder.
A type of neural network architecture that can be used for tasks such as machine translation and summarization, where the input and output are both sequences of variable length.
A model that uses two machine translation engines to produce better results than query-based abstractive summarizers trained for the specific dataset.

Transfer Learning

A model is first pretrained on one or more data-rich tasks before being fine-tuned on a downstream task of interest.
A technique where a model is pre-trained on a large dataset and then fine-tuned on a smaller dataset for a specific task.
A strategy designed to improve resource-poor CQA tasks with large-scale supervision data.
Using summarization-specific corpora for transfer learning can improve performance.
Transfer learning has contributed to the success of deep learning-based approaches in summarization.

Unsupervised Summarization

A type of summarization where only unlabeled documents are used during training.
A process of summarization that does not use any summaries for training.
Utilizing the frequency of contents in the source text to generate summaries without the need for human-written summaries
Summarization techniques that do not require labeled data and instead rely on large amounts of unlabeled data.
A summarization approach that simply needs unlabeled documents

Abstractive Approach

A text summarization approach that imitates human behavior to produce new sentences based on the extracted information from the source document.
Generating novel words and phrases not copied from the source text
Generating novel words and phrases not featured in the source text
A text summarization method that generates a summary word-by-word after encoding the full document.
A type of summarization system that generates an output text whose tokens are not necessarily present in the input text.
A method of opinion summarization that generates summaries from scratch. It attempts to model the prevalent opinions in the input and generate text that articulates them.

Abstractive Summaries

Summaries that describe the contents of articles primarily using new language.
Summaries that allow users to verify the information consistency of summary parts against the original audio clips.
Summaries that emulate human summarization process and produce more concise summaries.
Summaries that are generated by a model rather than being copied from the input reviews.
Summaries that involve sentences which deviate from those of the source document in structure or content.
Highly prone to contain contents that are unfaithful and nonfactual to the original document.

Abstractive Summarization Models

Models that generate condensed versions of a source article and outperform Lead-3 baselines on most benchmark datasets.
Models that generate summaries by paraphrasing and rephrasing the input text
Models that extract essential information from long documents and generate short, concise, and readable text.
Models that generate summaries by identifying and aggregating salient content from source documents.
Models that can paraphrase to generate summary text in addition to concatenating text from the source document.
Summarization models that have made significant progress and are able to generate fluent and meaningful summaries

Automatic Metrics

Metrics used to evaluate the quality of summarization systems automatically
Metrics used to evaluate the quality of summarization systems without human intervention.
Metrics used to evaluate the quality of generated summaries, such as ROUGE.
The paper discusses the limitations of automatic metrics in reliably substituting human evaluation.
Metrics developed to approximate human judgments of summary quality, as manual annotation is costly and time-consuming.
Tools developed to measure the quality of summarization systems automatically.

Contrastive Learning

A paradigm used to directly optimize the model with the corresponding evaluation metrics, thereby mitigating the gaps between training and test stages in MLE training.
Refers to a machine learning technique that improves representation learning by contrasting positive samples with negative samples.
A type of machine learning that learns by contrasting positive and negative examples
A method that emphasizes key words in findings to improve AIG.
A machine learning technique that learns representations by contrasting positive and negative examples.
A method for unsupervised representation learning that minimizes distances between feature representations of different views of the same image (positive examples) and maximizes distances between feature representations of views of different images (negative examples)

Extractive Approaches

A technique in natural language processing where a summary is generated by selecting and combining sentences from the input document.
Methods of review summarization that extract sentences or phrases from a review.
Approaches to summarization where parts of the input reviews are copied and arranged onto a summary.
Approaches that select and order units from the input text
Generate summaries by extracting parts of the original document, usually sentences
Summarization methods that generate summaries by copying parts of the source document, usually whole sentences

Extractive Summarisation

Copies parts of a document (often whole sentences) to form a summary
directly copying the most relevant fragments
Selecting representative documents from a larger collection.
A common practice for text summarisation which aims to select the salient sentences of a given document to form its summary.
A summarisation technique that selects and combines important sentences from the original text
A type of summarisation where the summary consists of sentences taken directly from the input document

Manual Evaluation

The process of evaluating the quality of summarization systems manually by human judges
A means of measuring the validity of a summarization system using human-in-the-loop protocols
Evaluation performed by humans, but limited by the huge effort required
the gold-standard in text summarization evaluation, exemplified by the Pyramid method
The gold-standard in text summarization evaluation, which involves human annotators evaluating the quality of a summary.
Considered necessary for measuring progress in summarization

Pyramid Method

The Pyramid method is a human evaluation protocol that involves extracting Summary Content Units (SCUs) from reference summaries and checking their presence in system summaries.
The gold-standard manual evaluation metric for summarization systems
The manual pyramid method requires human annotators to identify Summary Content Units (SCUs) by grouping phrases from different reference summaries into the same SCU if they express the same propositional content.
A manual evaluation methodology that relies on a small set of manually-crafted reference summaries and summary content units (SCUs)
A manual evaluation method for summarization systems that identifies Summary Content Units (SCUs) in reference summaries and constructs a pyramid by collecting semantically equivalent SCUs
A more accurate but expensive evaluation method requiring gold standard summaries

Salience

The quality of a summary that captures the most important information from the source document
A score assigned to each review sentence based on its importance in summarizing opinions about various aspects of the place being reviewed.
The importance or relevance of content in a review
the quality of being particularly noticeable or important; in this context, it refers to the importance of a sentence in a document
A domain-dependent notion that deletions should maximize content selection from the standpoint of ROUGE.
The combination of relevance and sentiment strength to determine the importance of an opinion.

Automatic Document Summarization

A process of concluding given documents by a piece of concise text
A task in text mining and information retrieval that involves creating a summary of a document.
The process of condensing a large amount of text into a shorter summary
The task of condensing a document into its shorter form with important content preserved, which requires wide-coverage understandings of the document, rather than specific words or phrases.
The task of rewriting a document into a shorter form while retaining important content
the task of compressing a textual document to a shorter highlight that contains the most representative information of the original text
The task of rewriting a long document into its shorter form while still retaining its most important content.

Exposure Bias

The tendency of a model to perform poorly on generating long sentences because errors accumulate during the decoding process
Error accumulation at test time due to the input of the decoder in each time step being from the true summary during training, but from the previous word in testing
A problem where the model is trained on ground truth data but tested on its own generated data, leading to a mismatch
A phenomenon that occurs during inference when the model must generate the output based on possibly erroneous previous steps, which can hurt model performance.
A form of mismatch between training and testing data distributions which can hurt performance.
A phenomenon in machine learning where a model is biased towards the training data it has been exposed to, leading to poor generalization to new data.
The gap between the training and test stages in Seq2Seq model training, where errors made in previous steps accumulate during autoregressive generation.

Extractive Approach

A type of summarization system that identifies the most informative pieces from the input text and concatenates them to form the output summary.
A conventional approach to query-biased summarization that uses overlapping words as cues to calculate the salience score of a sentence
Assembling summaries directly from the source text typically selecting one whole sentence at a time
A text summarization approach that directly extracts salient sentences from the input text as the summary.
A method of opinion summarization that selects a subset of salient sentences from the input reviews. It produces well-formed text, but selecting the sentences that approximate the most popular opinions in the input is challenging.
A text summarization method that directly selects sentences from the document to assemble into a summary.
Assembling summaries exclusively from passages taken directly from the source text

Relevance

The degree to which something is important or useful in a particular context
a concept in summarization that refers to the degree to which a sentence is related to the topic of the document
The degree to which a summary allows the reader to infer the content of the document. It is measured by the PMI of the summary with the document.
The degree to which a piece of information is related to the topic of the text.
The degree to which a summary accurately captures the important information in a text
The degree to which the generated text is relevant to the input.
The degree to which a sentence answers the query.

Abstractive Text Summarization

A method of summarizing text that involves generating a summary that is not a verbatim copy of the original text
Generating condensed and concise summaries that retain the salient information and overall meaning of the source articles, potentially containing new phrases and sentences that do not appear in the source documents.
A technique that generates new text summaries instead of selecting sentences or phrases from the input
Generating a short and concise summary that captures the salient ideas of the source text, potentially containing new phrases and sentences that may not appear in the source text.
A technique that generates condensed and concise summaries that retain the salient information and overall meaning of the source articles. It potentially contains new phrases that do not appear in the source articles and has broad applications in natural language processing (NLP).
A technique that generates a summary of a text that is not simply a copy of a sentence from the original text, but rather a new sentence that captures the main idea of the text
Generating a short and concise summary that captures the salient ideas of the source text, potentially containing new phrases and sentences not in the source text
Generating a headline or a short summary consisting of a few sentences that captures the salient ideas of an article or a passage, using compressed paraphrasing of the main contents of the document, potentially using vocabulary unseen in the source document.

Content Selection

The problem of deciding what information should be included in the summary.
The process of determining which information should be expressed in the output text in natural language generation tasks
The process of selecting important information from the input document to include in the summary
The process of identifying relevant content for summarization.
The process of selecting important segments from a document to aid in generating a summary.
The process of selecting the most relevant information from the input document for summarization
Tricky without redundancy across multiple input documents as a guide and simple positional information is often hard to beat.
The process of selecting relevant content for summarization.

Factual Inconsistency

The problem of producing summaries that contain hallucinated facts that are not supported by the source text
Inconsistency in the facts presented in the generated summary
When the content of the summary is not semantically entailed by the input document
The phenomenon where abstractive summarization models distort or fabricate facts in the article, with up to 30% of summaries containing such inconsistencies.
The lack of accuracy or truthfulness in the generated summary
The presence of information in a summary that is inconsistent with the source text.
A problem for conditional text generation where system-generated abstractive summaries are often factually inconsistent with respect to the source text.
The issue of text generation systems yielding text that contains distorted or fabricated facts about the source text, leading to factual inconsistencies.

Neural Abstractive Summarization

A type of summarization that involves generating a summary sentence using a neural network.
A technique that uses neural networks to generate a summary of a given document.
a technique that allows the system to either copy words from the source texts or generate new words from a vocabulary
Advancements in summarization using neural networks to generate summaries
A type of text summarization that generates a summary by understanding the meaning of the text and generating new sentences.
A method that has potential drawbacks, including altering or falsifying objective details and introducing new meanings not present in the original text.
A technique that generates summaries by rewarding the summarizer for generating summaries that contain the same words as human abstracts, measured by automatic metrics such as ROUGE.
Recent progress in end-to-end trained models excelling at producing fluent summaries

Reinforcement Learning (Rl)

A paradigm used to mitigate the gaps between training and test stages in Seq2Seq model training, but suffers from the noise gradient estimation problem and sensitivity to hyper-parameters.
A technique to build document summarisation systems that directly optimises the summariser to maximise the rewards, which measure the quality of the generated summaries
A potential solution for the no paired data situation in automatic text summarization
A machine learning technique that optimizes sequence-level metrics during training
A type of machine learning where an agent learns to make decisions by interacting with its environment and receiving rewards or punishments
An algorithm that searches for near-optimal trajectories by directly optimizing non-differentiable objective functions.
A type of machine learning algorithm that trains an agent to maximize the reward by interacting with an environment.
A technique used to learn model parameters through evaluation metrics, such as ROUGE.

Sequence-To-Sequence Models

Deep-learning based models that map an input sequence into another output sequence, successful in many problems such as machine translation, speech recognition, and video captioning.
Models that have led to the great advancement of summarization.
Conditional language models that use an encoder network to build a representation of the input document and a decoder network to generate a summary by attending to the source representation.
Models that are trained in an end-to-end fashion with a maximum likelihood estimation loss
Models used for generating summaries in a word-by-word fashion, thus ‘generating’ new sentences.
Models with attention mechanism that have found great success in generating abstractive summaries
Models used for paragraph-level article summarization that generate words based on the source article representations and previous generated words.
Models that use neural networks to map an input sequence to an output sequence.

Single-Document Summarization

A type of summarization that has consistently attracted attention in NLP research.
Generating a shorter version of a document while retaining its most important content
The task of automatically generating a shorter version of a document while retaining its most important information.
Generating a short summary for a given document that maintains the most important information in the source document
A less studied topic in recent years that involves summarizing a single document.
A task of summarizing a single document without a query
Summarization of a single document into a shorter summary.
A focus of most research efforts, such as news document summarization

Human Evaluation

An evaluation that involves human judges assessing the quality of a model's output
Necessary to better understand how well the email thread summarization models perform and investigate the correlation between automatic metrics and human judgment
Necessary to understand and improve the quality of summarization systems
Evaluation of summarization methods using human judges.
Human evaluation further shows that our summaries are preferred over comparison systems across multiple criteria.
An evaluation method that involves human judges to evaluate the quality of summaries generated by summarization models.
A method to verify the effectiveness of curriculum learning using the Appropriateness Estimator in improving the performance of summarization models.
Demonstrates that the proposed MADY method outperforms the strong competitors by a substantial margin.
Evaluation of the quality of a summary by human judges

Abstractive Methods

Summarization techniques that synthesize an abridgment by using vocabulary words
Generate summaries as new sentences and rely on the reordering and paraphrasing required for summary and title generation
Synthesizing information from the input document to generate summary using arbitrary words and expressions
Methods for review summarization that use an attentional encoder-decoder framework to generate a more concise summary.
Methods for document summarization that generate new sentences to summarize the original document.
Able to generate better summaries with the use of arbitrary words and expressions, but generating abstractive summaries is much more difficult in practice. Abstractive summarization involves sophisticated techniques including meaning representation, content organization, and surface realization.
Text summarization techniques that generate summaries freely and are able to produce novel words and sentences.
summarization methods that apply natural language paraphrasing and/or compression on a given text, generating summaries with better readability but may include superfluous text and decline in quality over longer textual inputs
Summarization methods that may generate new words or phrases which are not in the document
Headline generation methods that may treat phrases, concepts, or events as candidates and exploit sentence synthesis techniques to generate the headline

Opinion Summarization

The process of generating concise and fluent summaries of opinionated text about an entity or a topic.
Generating a concise and digestible summary of user opinions, especially helpful when the large and growing number of such opinions becomes overwhelming for users to read and process.
A form of text summarization that aims to create a summary of opinions found in multiple reviews
A process that involves finding relevant aspects about the entity being reviewed, discovering sentiments expressed towards the identified aspects, and generating a concise and digestible summary of opinions.
the task of automatically generating summaries for a set of opinions about a specific target
The task of automatically generating digests for an entity from user opinions in online forums to enable faster comparison, search, and better consumer feedback understanding.
The process of summarizing user opinions expressed in online resources, such as blogs, reviews, social media, or internet forums, for various information access applications, such as creating digests, search, and report generation.
The process of summarizing opinions expressed in customer reviews. It can be done through extractive or abstractive methods, with the latter attempting to model prevalent opinions and generate text that articulates them. The task is challenging and rarely relies on gold-standard summaries for training. Recent work has utilized end-to-end unsupervised architectures, but this paper proposes a pipeline framework with three components
Generating concise and fluent summaries of opinionated text
The aggregation of user opinions as expressed in online reviews, blogs, internet forums, or social media, with the potential for various information access applications.

Reinforcement Learning

A method of achieving summary-level scoring in extractive summarization by using a reinforcement learning algorithm to optimize the selection of sentences for the summary.
A method introduced to consider the semantics of extracted summary, which combines the maximum-likelihood cross-entropy loss with the rewards from policy gradient to directly optimize the evaluation metric for the summarization task.
A type of machine learning where the system learns through trial and error
A type of machine learning that uses rewards to train models, such as the ROUGE metric for summarization.
Used to train the abstract generator with rewards that promote informativeness and optionally boost coherence, conciseness, and clarity of the summary
a technique used to explore the space of extractive summaries
A type of machine learning where the model directly optimizes the evaluation metric by combining cross-entropy loss with rewards and trains the model with policy gradient reinforcement learning.
A type of machine learning that uses rewards to optimize G for a highly rewarded summary
A type of machine learning that uses rewards to train models.
Another solution to the issue of redundant phrases by introducing reinforcement learning for the decoder to consider the semantics of the entire target summary, which combines the maximum-likelihood cross-entropy loss with the rewards from policy gradient to directly optimize the evaluation metric for the summarization task.
A method of training a model to directly maximize a measure of summary quality, such as the ROUGE score between the generated summary and a ground-truth abstractive summary.

Extractive Methods

Produce the summary of a document by extracting sentences from the original document. They have the advantage of producing fluent sentences and preserving the meaning of original documents, but also inevitably face the drawbacks of information redundancy and incoherence between sentences.
Summarization techniques that directly select phrases and sentences from the original text as summaries
Headline generation methods that treat sentences from the original document as candidates and exploit sentence compression techniques to produce the headline
Text summarization techniques that identify the most suitable words or sentences from the input document and concatenate them to form a summary.
Proven effective in many systems for automatic text summarization
Based on selection of sentences from source texts without using reordering or paraphrasing
select representative segments (usually sentences) from the source text
Methods for document summarization that extract important sentences from the original document.
summarization methods that select and order text fragments (e.g., sentences) from the original text source, allowing to preserve important parts, e.g., keyphrases, facts, opinions, etc.
Textual summaries are created following mostly extractive methods, and various formats ranging from lists of words to phrases and sentences.
Generating summary for a document by directly selecting salient sentences from the original document
Have better performance on the first two aspects, but are typically less coherent and more redundant than abstractive ones.

Factual Consistency

The accuracy of a summary in reflecting the information in the source document.
The concern of whether the facts conveyed in the summary agree with the source text.
The degree to which the generated summary accurately reflects the information in the source text
The accuracy of the generated summary in reflecting the information in the source document
The accuracy of the information presented in a summary compared to the source document
The problem of verifying factual consistency between source documents and generated summaries is a challenge in summarization.
The precision of a summary in generating texts that are factually consistent with their source documents
The degree to which a summary accurately reflects the information in the source document.
The degree to which a summary contains only statements that can be inferred from the source document
The quality of a summary that ensures it accurately reflects the information in the source text.
A key component of abstractive summarization models that refers to the degree to which the summary accurately reflects the facts presented in the source text
The degree to which a summary accurately reflects the information in the source text

Attention Mechanism

A mechanism used to focus on specific parts of the input during the summarization process
A mechanism proposed by Bahdanau et al. that focuses on important parts of the input during generation
A technique used in sequence-to-sequence models that allows the decoder to refer to (and weigh) all the encoding steps’ hidden states.
Helps the model decide how much to focus on the semantic units when generating text
A mechanism in neural networks that allows the model to focus on specific parts of the input.
A mechanism that learns alignment between various modalities.
A mechanism used in seq2seq models where the decoder extracts information from the encoder based on the attention scores on the source-side information
A central component in state-of-the-art sequence to sequence models that builds connections between the source sequence and target words
a mechanism that allows the model to focus on specific parts of the input when making predictions
A technique used in neural networks to focus on specific parts of the input sequence when generating an output.
A mechanism used to guide the model to generate coherent summaries which cover the most relevant topics discussed within the document.
A technique proposed by Bahdanau et al. (2014) to enhance the sequence-to-sequence model by allowing salient features to dynamically come to the forefront as needed to make up for the incapability of memorizing the long input source.
Allows the decoder to choose a weighted context representation at each generation step

Automatic Evaluation Metrics

Metrics used to evaluate the quality of summaries.
A method used by researchers to approximate how humans would rate the quality of a summarization system using scores calculated by the metric
Metrics used in text generation tasks to evaluate the quality of generated text
Metrics used to evaluate natural language generation (NLG) systems
Features and metrics used to characterize good summaries for podcasts
Tools used to measure the performance of automatic summarization models.
Methods for evaluating the quality of generated summaries automatically
metrics used for evaluating summarization models
Metrics used to evaluate the quality of generated summaries automatically, without human intervention.
such as ROUGE, JS-2, S3, BERTScore, MoverScore, etc. used due to the time and cost required for manual evaluation
Methods for evaluating the quality of automatically generated summaries.
Evaluation metrics used in text summarization that are exclusively automatic, such as ROUGE, JS-2, S3, BERTScore, and MoverScore.
Metrics that aim to produce a semantic similarity score between the candidate summary and a pool of reference summaries previously written by human annotators.

Document Summarization

The task of transforming a long document into its shorter version while still retaining its important content.
The task of generating a fluent and condensed summary for a document while retaining the gist information.
A task of creating a concise summary from a given document while keeping the original content
Producing a shorter version of a document while preserving salient information to help people deal with information overload
The process of automatically rewriting a document into a shorter version while retaining its most important content
Producing a shorter version of a document while preserving salient information.
A subjective task in NLP that requires human judgment input to quantify what makes a "good summary". Collecting human feedback and evaluating the crafted summaries from documents for building the training datasets is time-consuming and labor-expensive, particularly where domain knowledge is required.
Condensing given documents and generating fluent summaries with salient information automatically
Generating a fluent and condensed summary for a document while retaining the gist information
The process of creating a shorter version of a document while retaining its key information.
The task of transforming a long document into a shorter version while retaining its most important content.
The task of rewriting a long document into a shorter form while still preserving its important content
A useful means for people to quickly read and browse news articles in the big data era.
The process of creating a shorter version of a document while retaining its main ideas and key points.
A task to generate a fluent, condensed summary for a document, and keep important information.
A task in natural language processing that aims to generate a shorter version of one or multiple documents while retaining the most important information.

Automatic Summarization

The process of selecting and condensing information from a text into a shorter version.
The process of generating a shorter version of a text while retaining its most important information.
A process that can aid podcast creators in writing descriptions, augment manually written descriptions, or assist in the construction of audio trailers
Condensing information of input document into a shorter summary
The process of condensing a long text into a concise summary while retaining essential information.
The process of identifying important content and generating a summary is a central problem in Natural Language Processing (NLP).
The process of generating informative and coherent summaries of long scientific articles.
The process of generating a summary of a longer text automatically
the task of selecting spans, typically sentences, from a source text such that they best convey the overall meaning
A core NLP problem that aims to extract key information from a large document and present it to the user with the role of assisting them to digest the core information in the document faster and more easily.
A process that has gained attention in the natural language processing community due to its potential for processing redundant information.
A natural language generation task that has been studied for decades
The task of generating a shorter text that expresses the salient information of a source document fluently and succinctly
The process of generating a shorter version of a text while retaining its most important information
The process of automatically generating a summary that retains the most important content of the original text document.
Shortening a text document while maintaining the salient information of the original text
The process of generating summaries for news articles using computer algorithms instead of manual effort. Concept
The process of producing summaries that are succinct, coherent, relevant, and factually correct.
The task of compressing a lengthy text to a more concise version that preserves the information of the original text

Automatic Text Summarization

The task of producing a shorter text with maximum information content, fluency, and coherence given a document. It can be classified into extractive and abstractive summarization.
A technique for helping humans to grasp the content of documents effortlessly
Generating summaries of an input document while retaining the important points
the process of creating a shorter version of a text while retaining its most important information
Refining integrant information from long texts to a short summary for convenient understanding.
The process of generating a summary of a given text using computer algorithms.
Generating a short and coherent summary of a given text
The process of generating informative and representative natural language summaries which are capable of retaining the main ideas of source articles.
The task of condensing large amounts of information in texts
The task of compressing a textual document to a shorter highlight while keeping salient information of the original text
The task of shortening and creating a concise version of a text that represents the most important or relevant information within the original text.
The task of compressing a textual document to a shorter highlight while keeping salient information on the original text.
A challenging task that has been driven by benchmarks collected by scraping web-pages, including Gigaword, CNN/DailyMail, Newsroom, and XSum. These datasets contain a substantial portion of articles that are paired with texts that are not summaries, negatively impacting research in two ways
Generating a short and coherent summary from one or multiple documents while preserving the main ideas of the original documents
The process of compressing a document while preserving key information content and meaning.
Producing a brief piece of text that preserves the most important information in it
A process of identifying important content and extracting them to form a summary
The process of formulating a shorter output text than the original while capturing its core meaning
Condensing a text into a shorter version while maintaining essential information
The process of generating a shorter version of a given text while retaining its key information
The task of automatically summarizing a long document into a relatively short text while preserving most of the information.
The task of generating/extracting short text snippet that embodies the content of a larger document or a collection of documents in a concise fashion.
The process of creating a shorter version of a text while retaining its most important information
The process of using artificial intelligence to generate a condensed version of a longer text.
A challenging NLP task due to complex cognitive processing involved
The process of outputting the most salient parts of an input in a concise and readable form.
The process of distilling the most important content of a given text in a concise form.
A process used in NLP applications to produce digests, headlines, and reports. Two main types of summarization methods are explored

Summarization

A difficult NLG task that requires information selection and generating a shorter summary than the source document
The process of condensing user opinions expressed in online resources into shorter versions for various information access applications.
The process of identifying the most important information from a source to produce a comprehensive output for a particular user and task.
A task in natural language processing in which automatic systems generate summaries from documents.
The task of condensing a piece of text to a shorter version that contains the main information from the original
The task of shortening a given document(s) while maintaining the most important information.
The process of generating a shorter version of a document that captures its main points.
The process of rewriting a long article into a short and fluent version while maintaining the most salient content.
The process of condensing a document into a short paragraph or a single sentence while retaining core information.
The process of creating a shorter version of a longer text while retaining its key information.
The process of condensing a document into a shorter version while retaining its key information.
The process of summarizing scientific papers using the most informative and diversified part of their citation summaries.
The process of creating a condensed version of a text
The process of conveying important information from a text in a concise and comprehensible manner while maintaining factual consistency
the process of generating a shorter version of a text while retaining its essential information.
The task of condensing a document into a shorter version without losing key information
The process of reducing a text to its essential information, often by creating a shorter version of the original text.
Producing a condensed representation of an input text that captures the core meaning of the original
The process of extracting a set of representative sentences from student responses to form a textual summary
The process of creating a shorter version of a text while retaining its key information.
A process of generating a concise and meaningful summary of user opinions expressed in online resources, such as blogs, reviews, social media, or internet forums, which has potential for various information access applications, such as creating digests, search, and report generation.
The process of creating a summary of a large amount of information
A method of measuring reading proficiency in ESOL assessment that tests both cognitive and contextual dimensions of reading.
Compressing a longer text into a shorter version while preserving the salient information in the original text
The task of preserving key information in a text while reducing its length
the task of automatically generating brief summaries of longer documents or collections of documents
The task of condensing a document’s main points into a shorter document
The process of creating a brief and concise summary of an input document

Text Summarization

The task of generating a shorter version of a longer text while retaining its most important information.
The process of quickly locating key sentences in an article using extractive and abstractive models
The task of compressing a long sequence of text into a more concise form
A task to generate a shorter and concise version of a text while preserving the meaning of the original text
Identifying important information in long source documents and expressing it in human readable summaries
The process of generating a shorter version of a longer text while retaining its most important information
the process of creating a shorter version of a longer text while retaining its key information
The process of compressing a long text into a shorter version while preserving key information and significance of the content
A NLP task that involves summarizing a text document into a shorter version while retaining its key information.
The process of converting long documents into shorter forms that retain the most important aspects from the source document.
Generating concise summaries for given texts while preserving the key information
The task of distilling salient contents from a textual document
Condensing a piece of text into a shorter version that contains the salient information
Generating a compressed shorter highlight of a given document that conforms to natural language constraints and covers the most important information conveyed in the source text
The process of creating a succinct representation of the content, which plays an important role in quick and efficient consumption.
the process of generating a condensed and cohesive version of a text, allowing readers to grasp the main points without reading the full text.
The task of automatically condensing a piece of text to a shorter version while maintaining the important points
Generating summaries from input documents while keeping salient information
The process of creating a shorter version of a longer text while retaining its key information.
Condensing information of an input document into a concise summary
The process of generating a shorter version of a text while retaining its key information
automatically compressing a document into a shorter version while preserving its content
The task of generating accurate and concise summaries from source document(s) Jones (1993)
The process of compressing long documents into a short and fluent form that preserves important information.
Generating a condensed version of a passage while preserving its meaning
The task of condensing a long piece of text into a short summary without losing salient information.
A task where phrases and sentences from the original text are extracted to create a summary
The task of condensing a given document or a set of documents into a shorter piece of textual summary (a.k.a. single-document or multi-document summarization), which preserves the main contents of the input.
The task of generating a short summary of a document preserving its informative content
Generating a coherent and succinct summary of an article containing the most salient information from the original article
compressing long documents into a short, fluent, and human-readable form that preserves the most salient information from the source document
Condensing a complex input to a concise expression by retaining the core information
The process of compressing long documents into a concise summary while retaining important information
Producing a simplified version of the source document while retaining salient information
Generating a brief summary from an input document while retaining key information
A task of producing a condensed version of text while preserving its meaning
Generating a summary with the major points of the original text.
The process of automatically generating natural language summaries from an input document while retaining the important points
The process of creating a shorter version of a text while retaining its key information
Generating a concise sequence of text as summary, given a longer document as source. A high-quality summary conveys the most important points of its associated source.
Compressing long textual documents into a short, human readable form that contains the most important information from the source
The process of condensing large amounts of text into a shorter version while retaining the most important information.

Rouge

A predominant evaluation metric for summarization that measures the overlap between the generated summary and the reference summary.
The current method of choice for evaluating automated text summarization, relying on gold standard summaries
A metric used to evaluate the performance of summarization models
A de facto standard evaluation metric that calculates the n-gram overlap between machine-generated summaries and reference summaries.
A non-differentiable summarization metric that is used to optimize rewards in reinforcement learning approaches.
A metric for evaluating the quality of summarization systems
A commonly used automatic evaluation metric for summarization that relies on crossentropy loss to produce readable phrases
A n-gram matching metric used to measure word overlap between generated summary and references
A metric used to evaluate the quality of summaries based on lexical similarity.
A widely used metric for evaluating summarization tasks for the past two decades that relies solely on the direct lexical overlap between the model summary and the reference summary.
A widely popular metric for evaluating text summarization tasks, which is NOT suitable for the evaluation of SOS task.
A family of metrics used in the LongSumm shared tasks, including recall of unigrams, bigrams, and longest common subsequences. However, there are issues with the application of ROUGE to new datasets.
A popular n-gram similarity metric used to measure the similarity between a summary and its source text.
Recall-oriented understudy for gisting evaluation, a metric for evaluating the quality of text summarization
A metric used to evaluate the quality of summarization by comparing the overlap between the generated summary and the reference summary.
A metric used to evaluate the performance of the summarization system.
The most commonly used metric for evaluating text summarization, criticized for considering direct lexical overlap and not being semantic-aware
A metric used to evaluate the quality of automatic summarization.
Greedy matching approach used in extractive summarization
A measure of summarization performance.
A popular automatic evaluation measure that counts the number of ngrams/basic elements that match those in manual reference summaries
A standard evaluation metric for summarization systems that is based on n-gram co-occurrences and emphasizes recall.
Evaluation metrics for summarization models that use word overlap with the ground-truth summary
A metric used to evaluate the quality of a summary by comparing it to a set of reference summaries.
A popular automatic evaluation metric that measures n-gram overlap between the generated summary and the source documents.
A metric that measures the similarity between system summaries and reference summaries by the n-gram co-occurrences and the longest common subsequence.
A popular metric that compares summaries based on their lexical overlap
A limited evaluation metric that relates the system summary to a fixed number of gold-standard summaries.
A metric used to measure the concordance of system-generated summaries and human-generated reference summaries by determining n-grams, word sequences, and word pair matches
A widely used automatic evaluation for summarization that relies on token overlap between reference and system summary.
A metric used to evaluate the quality of summaries by comparing them to reference summaries
Recall-Oriented Understudy for Gisting Evaluation
A metric for automatic evaluation of summarization systems
A metric used for evaluating the system's summaries' information coverage.
A sequence-level evaluation metric for summarisation
An automatic metric for evaluating summary quality that has received criticism for poor correlation with human judgments.
A standard evaluation measure for summarization systems that measures the overlap between the system-generated summary and a set of reference summaries.
A common method for summarization evaluation that measures the overlap between the candidate summary and the reference summary.
A metric used to evaluate the quality of summarization by comparing the generated summary with the reference summaries.
The accepted standard for automatic evaluation of content selection due to its simplicity and good correlation with human judgments.
The first and most widely used automatic summarization metric, adapted from the BLEU score
A set of metrics used to evaluate the quality of a summary by comparing it to a set of reference summaries.
Evaluation metric based on an overlap of words with reference summaries
A set of metrics used to evaluate the quality of summarization models by comparing the generated summary to a reference summary.
An automated metric used to measure the quality of text summarization by comparing the generated summary to a reference summary.
A specific implementation of lexical overlap evaluation metric
The most popular automatic evaluation metric used in text summarization.
A metric used to evaluate the quality of summarization systems.
An evaluation metric used to compare the holistic similarity between the gold references and system outputs in abstractive summarization.
Automatic measures used to compare summaries produced against reference summaries
A set of automatic metrics used to evaluate the quality of a summary
The most widely accepted automatic evaluation metric for summarisation
A commonly used evaluation metric for extractive summarization that yields a noisy surrogate evaluation compared to the much more meaningful evaluation based on human judgments.

Extractive Summarization

the most popular approach to automatic summarization, which involves selecting sentences from a source text
A summarization approach that selects and combines important sentences from the original document
a method of summarization that simply draws out and concatenates the key topic sentences in a document
A technique that selects sentences or phrases from the input to form the summaries
Summarization technique that selects important sentences from the original text.
A technique that aims to extract passages or entire sentences from the original document, as opposed to abstractive summarization which generates an entirely new summary.
Selects a subset of existing words in the original text to form the summary.
A type of text summarization that selects and combines existing sentences from the original text to create a summary
A type of summarization that involves selecting and combining important sentences from the original text to create a summary. Concept
A technique that involves selecting and condensing the most important information from a text.
Extracts important words, phrases, or sentences from a source text to compile a summary
Selecting salient text spans from input document
Identifying and directly copying salient fragments of the source document into the summary
a type of summarization that produces a summary by taking important sentences from the original text and combining these extracts.
A text summarization method that assembles summaries from the source article.
A method of automatic summarization that selects text segments as summaries.
A method of summarizing text by selecting salient phrases, sentences or elements from the original text.
A method of summarizing a document by identifying and concatenating salient text spans (e.g. sentences) from the source document.
Selecting salient phrases or sentences verbatim from the original document to create a summary
Identifying salient sentences and concatenating them to form the final summary.
Selecting parts of the input document to create its summary
A data-driven approach that selects the most salient sentences from the source article as the summary, which are precise at content selection making the result informative but suffer from high redundancy since it does not edit the sentences.
The process of constructing a summary by selecting a few representative input sentences
A common approach to summarization where models directly copy salient parts of the source document into the summary.
Selecting sentences/textual units from the input article and putting them together into a summary
Summarization systems that form summaries by copying parts of the input
A type of summarization where the summary is created by selecting and extracting important sentences or phrases from the input document
Selecting a subset of the sentences to assemble a summary
Summarizing a long article by selecting and combining important sentences
Selecting informative sentences or phrases from a set of reviews of a product
A text summarization method that selects and combines important sentences or phrases from the original text to create a summary.
Summarization based on selecting rather than rewriting
A type of summarization that involves selecting and extracting important sentences from the original text
Generating summaries by selecting salient sentences or other semantic units from the source text
A paradigm for summarization that generates summaries by extracting text from original documents.
A method for summarization that involves selecting and combining important sentences or phrases from the original document.
A straightforward and effective method of summarization that creates a summary by selecting and concatenating the most salient semantic units in a document.
A genre of automatic summarization techniques that produces a summary by selecting important pieces of the source document and concatenating them verbatim.
A type of summarization that involves selecting and combining important sentences from the original text.
Retrieving essential sentences from the source document
A method of summarizing text that involves selecting sentences from the original text and combining them to form a summary.
A task to create summaries by pulling out snippets of text from the original text and combining them to form a summary
A type of opinion summarization that creates summaries by selecting review sentences to reflect the popular opinions corresponding to an entity.
A type of text summarization that involves selecting and condensing important information from a longer text.
A method of summarization that directly picks words, phrases, and sentences from the source text to form a summary.
A type of summarization that involves producing summaries by copying parts of the input reviews.
A technique where sentences are extracted from the original document based on how well they represent the overall document and then concatenated to create the summary.
A summarization model that creates summaries by extracting text from source documents
A type of summarization that aims to select a subset of the sentences in the source document
An approach to summarization where the model extracts salient parts of the source document
A summarization approach where key sentences are selected from the source document to create a summary.
A method of summarization that directly selects and outputs the most important sentences or phrases from the original document.
A summarization method that selects original text segments from the input to form a summary.
A method to produce summaries that accurately represent the source text and are grammatical
A summarization approach that selects and combines important sentences from the source document to create a summary.
Selecting important sentences in the source article to form a summary
The strategy of selecting a subset of words, phrases or sentences from the input document to form a summary.
A type of summarization that selects essential sentences from the source text.
A method of summarizing text by selecting and combining important sentences or phrases.
Composes each summary by extracting a subset of passages (sentences or phrases) from the input text.
A technique for summarizing a document by identifying and selecting the most salient parts without generating new words.
A method for summarization where a set of sentences is selected from an article and concatenated as-is to form the summary.
A technique of summarizing a text by selecting and combining important sentences or phrases from the original text.
Selecting and assembling salient words, phrases and sentences from the source text to form the summary
A task of extracting sentences from documents to form summaries.
A technique for producing short and accurate summaries that adhere to the content of the source text and present the key points therein.
Utilizing crop and stitch techniques to produce a condensed version of the text
A summarization technique that selects and combines important sentences from the source text to create a summary
Form summaries by copying and concatenating the most important spans (usually sentences) in a document.
A task where sentences from the original document are ranked and concatenated to create a summary
A method of summarization that involves extracting complete sentences from the input document
An approach to summarization that selects important units, such as phrases or sentences, from the original text.
A technique of summarization that involves selecting important snippets, sentences or passages from a document.
A technique that identifies and concatenates relevant sentences from a document to create its summary while preserving its original information content.
A type of summarization method explored in NLP applications where the summary is generated by selecting important sentences from the original text.
A type of summarization strategy that directly copies text snippets from the source to form summaries.
The process of summarization accomplished through sentence (and occasionally, phrase) extraction
a technique where salient word sequences are extracted from the source document and concatenated to form a summary
Summarization technique that selects the most important sentences or phrases from the original text and presents them as a summary.
A method of summarization that involves selecting and copying text snippets from the document to form a summary.
A text summarization method that selects a subset of sentences from the original text to create a summary.
A summarization method that selects and combines important sentences from the original document to create a summary.
A type of automatic document summarization that aims to extract important sentences from the input document and concatenate them as the corresponding output summary. The relative orders of the selected sentences in the summary are the same as their relative orders in the input document.
Shortening the original article while retaining key information through selecting sentences from the original article.
Picking sentences directly from the original document based on their importance and forming the summary as an aggregate of these sentences
Extracting a few sentences or keywords from the source text
Selecting sentences from the source document to construct the final answer.
Extracting part of the text to create a summary
A method of summarization that creates a summary by selecting and concatenating the most important semantic units in a document.
a type of summarization that selects and combines important sentences from the original text
A summarization approach that selects a subset of textual units of the documents such as sentences, clauses, and phrases that can optimize an objective for sentence scoring and satisfy a length constraint.
A type of summarization that simply extracts the most salient sentence(s) from a document and treats them as a summary.
A summarization technique that selects and extracts sentences from the input text to generate a summary.
A type of summarization that selects and extracts sentences from the original text to create a summary
A subtask of automatic text summarization that involves selecting and extracting important sentences or phrases from the original text to create a summary.
Focuses on picking up original crucial objects from the text.
A method of summarization that selects and combines important sentences from the original text
A type of summarization that selects sentences from the original document to create a summary.
Generating summary by selecting salient sentences or phrases from the source text
Selects text segments as summaries, which is easier to be applied practically and keep grammar correct.
Generating summaries by selecting salient sentences or phrases from a source text
A method of automatic text summarization that selects and extracts sentences (or smaller semantic units) from the original text to form a summary.
Congregate summary sentences merely from text segments taken straightly from the input text
A type of summarization where a model selects important sentences from the original text and aggregates them into a summary.

Abstractive Summarization

a type of summarization that generates a summary by paraphrasing and rephrasing the original text
Attempting to produce a bottom-up summary, including aspects that may not appear in the original text
A summarization approach that generates a summary by understanding the entire document and rewriting it in a shorter form
A type of summarization that involves generating new phrases, possibly rephrasing or using words that were not in the original text.
Generating a summary that captures the essence of the original text in a new way
Restructuring and rephrasing essential content into a paraphrased summary
A method of summarization that generates a summary by paraphrasing and generalizing the original text
A technique that mimics human expert's capabilities of inference making and producing a summary in their own writing style.
A branch of methods in which generated text is free from constraint on the tokens that appeared in the source
A task of generating high-quality summaries from long documents.
A type of summarization that generates a summary by understanding the meaning of the input text and producing new sentences that capture the most important information.
Discriminating salient parts in source documents using learned frequency information of semantic units
A type of summarization that uses natural language generation technology to produce a word-by-word summary
Generating summaries that may have words or phrases not present in the input document
A type of summarization method explored in NLP applications where the summary is generated by paraphrasing and rephrasing the original text.
A type of summarization where the summary is generated by the model rather than being extracted from the original text.
A technique of summarization that involves paraphrasing the information content in a document.
Produces each summary based on an underlying generative model, where the output may include the words or phrases beyond the input text.
Generating a shorter version of a source text without reusing sentences from the original source while preserving the meaning of its salient contents.
A technique that generates a summary of a text that captures the main ideas and concepts, rather than simply copying sentences from the original text.
Creating new textual elements to summarize the text
A summarization approach where the input text is rephrased and compressed to create a summary.
A data-driven approach that can generate more concise summary via compressing and paraphrasing, while current models are weak at content selection and easy to lose crucial information.
An approach to summarization that requires the ability to paraphrase.
Generating a summary that captures the essence of the input document in a coherent and concise manner
Generating summary sentences from scratch
Summarizing text by generating new sentences; Extractive summarization
A task that condenses source texts to summaries that are concise, grammatical, and preserve the important meaning of the original texts.
Detecting and paraphrasing salient parts of the source document to form the final output
A genre of automatic summarization techniques that generates summaries based on the core ideas of the document, therefore the summaries could be paraphrased in more general terms.
A method of summarization that generates (samples) words from a fixed-size vocabulary instead of copying from text directly.
A task where the source text is mapped to the target summary using sequence-to-sequence learning
A method used to extend the advantages of summarization and generate topic-tuned summaries.
Generating the summary from scratch, containing novel words and phrases that are paraphrased from important parts of the original text
A process of generating new phrases, possibly rephrasing or using words that were not in the original text, which can synthesize content across documents avoiding redundancy.
Generating a short but meaningful summary for a single or multiple reviews of a product
Builds an internal semantic representation and then uses natural language generation techniques to create a summary that is closer to what a human might express.
A technique that aims to produce concise and informative summaries to promote efficient information consumption and knowledge acquisition.
Summarization technique that generates a summary by paraphrasing and rephrasing the original text.
A method of summarization that aims to generate concise summaries with paraphrasing.
A technique for summarizing text that involves generating a summary that is not simply a copy of parts of the original text, but rather a new text that captures the most important information in the original text.
A paradigm for summarization that rewrites documents by paraphrasing or deleting some words or phrases.
A type of summarization that allows for more complex operations on sentences, including deletion, substitution, and reordering.
A technique for summarizing a document by rewriting salient parts in a concise form, usually introducing novel words along the way by utilizing key abstraction techniques such as paraphrasing, compression, or sentence fusion.
a type of summarization that involves interpreting and paraphrasing the input when generating a summary, similar to how humans would summarize a text.
An approach to summarization where the model not only extracts but also concisely paraphrases the important parts of the document via generation
A task in Natural Language Generation (NLG) field that aims to produce a concise, informative, and faithful summary for a given original document.
A technique that condenses source text while maintaining grammaticality and conveying the intended meaning of the original text.
An approach to summarization that paraphrases the information content of the original text.
A type of opinion summarization that produces fluent summaries using novel phrases, but suffers from problems like hallucination, text degeneration, and topic drift.
A type of summarization strategy that generates summaries containing novel sentences not found in the source.
A type of automatic document summarization that rewrites the source text and generates the corresponding summary which may contain novel words and phrases not featured in the input. The output summary is closely related to the input document.
retaining the most important facts and expressing them via paraphrasing, aggregating and even inferring new facts
A method of summarization that involves rewriting the summary and generating novel words from the full vocabulary.
A type of summarization that involves generating new sentences that capture the essence of the original text. Concept
A text summarization method that generates a summary by paraphrasing and rephrasing the original text.
Generating fluent and concise text from the original input document using deep learning technology.
Involving a process of paraphrasing or generating sentences to write a summary
Producing a paraphrasing of the main contents of the given text.
A technique to shorten a source article or paragraph by rewriting while preserving the main idea.
A technique that compresses an input text into a concise, fluent summary while retaining its main idea.
A text summarization method that generates summaries based on the information from the source text and external vocabulary.
Generating a natural short summary of a long document.
generates summaries that may have words or phrases not present in the input
A type of summarization that builds one-sentence summaries from one or two-sentence input.
The task of producing a concise and fluent summary that is salient and faithful to the source document(s).
A summarization method that creates novel sentences based on natural language generation techniques.
Summarization technique that generates new sentences to summarize the original text.
A type of text summarization where the summary is generated by the model rather than being extracted from the original text
A type of text summarization that involves generating a summary by paraphrasing and rephrasing the original text.
A technique for generating summaries that do not simply copy text from the input document, but instead generate new, more concise and coherent summaries.
A summarization method that generates a summary by paraphrasing and rephrasing the original document.
Generates summaries which are rewritten and refined.
A method of summarizing text that involves generating new words not present in the source text, as opposed to simply copying verbatim from the source text.
A technique that involves generating a summary that captures the essence of the original text in a new way.
A subtask of automatic text summarization that involves generating a summary by paraphrasing and rephrasing the original text to create a more natural and coherent summary.
A process of stitching portions of text together into a sentence, which involves choosing which sentences to fuse, what content from each of them to retain and how best to present that information.
Summarization systems that generate new phrases, possibly rephrasing or using words that were not in the original text
Generating summaries without constraint on words and phrases
A type of summarization where the summary is generated by understanding the meaning of the text and creating a new summary in natural language.
A technique that simplifies and rephrases the source text to generate more useful summaries compared to extractive techniques.
Generating new words and phrases not in the source text to construct the summary
A type of text summarization model that can generate summaries without being constrained by specific words or phrases. This format is more similar to human-edited summaries and is both flexible and informative.
A type of summarization that aims to generate a summary that is not a verbatim copy of the input document.
A technique in natural language processing where a summary is generated by freely choosing words from a large vocabulary rather than reusing full sentences from the input document.
A summarization technique that generates a summary by paraphrasing and rephrasing the source text
a method of summarization that reorders words and sentences and even generates new language to produce a concise and eloquent piece of the given content
A type of summarization that aims to cover all the salient points of a document in a compact and coherent fashion.
Generating sentences that do not appear in the original document
Focus on generating novel word and phrases not appeared in the original text
A method of summarizing text that involves generating a summary that is not simply a selection of sentences from the original text, but rather a new summary sentence that captures the main ideas of the text in a more concise form.
The NLG task of compressing and rewriting a document into a short, relevant, salient, and coherent summary
A method of summarizing a document by reading the original document and writing a concise summary through natural language generation techniques like paraphrasing, reordering, and word replacing.
A summarization model that rewrites documents by paraphrasing or deleting some words or phrases
Generating one or several short sentences that cover the main idea of original article
Various text rewriting operations generate summaries using words or phrases that were not in the original text.
The process of creating sentences summarizing content and capturing key ideas and elements of the source text, usually involving significant changes and paraphrases of text from the original source sentences.
A summarization technique that generates a summary by paraphrasing and rephrasing the input text, rather than selecting and extracting sentences.
A type of text summarization that generates new sentences to create a summary
A type of summarization that generates a summary by understanding the meaning of the input text and expressing it in a shorter form
Treating the summarization problem as a natural language generation task and producing new phrases and sentences directly in the summary
A method of summarizing text by generating summaries from scratch without the constraint of reusing phrases from the original text.
Involves more complex linguistic operations (e.g., abstraction, paraphrasing, and compression) to generate a new text
A sequence mapping task that maps the source text to the target summary using deep neural networks.
Generating short summaries that capture the essentials of a long document.
Producing more coherent and logical summaries to answer the given question.
Generating a summary that captures the essence of the input text
A complex task involving several components such as content selection and rewriting that are performed implicitly by end-to-end models
A common approach to summarization where the important parts are paraphrased to form novel sentences.
A task to generate summaries from scratch without the restriction to use the available words from the original text
The process of generating a summary by creating new sentences that articulate prevalent opinions in the input reviews
An alternative approach to summarization where the generated summary may contain novel words and phrases and is more similar to how humans summarize documents
A technique that produces a summary that concisely expresses key points of the input document rather than merely extracting pieces of it
A type of summarization that involves generating a summary by paraphrasing and rephrasing the original text
Focuses on generating new expressions to summarize the text.
Paraphrasing a long article with fewer words
A method of presenting the main points of an article in a succinct and coherent manner by rewriting source sentences into a more concise form and fusing multiple source sentences into one.
A method of automatic summarization that generates summaries which are rewritten and refined.
A type of summarization where the summary is generated by the model rather than being extracted from the input document
A technique that aims to produce concise and fluent summaries of the original text by paraphrasing its semantics and topics. It is used to display summaries on mobile devices or websites with space limitations, and involves generating complete summaries within desired lengths and selecting proper information to summarize based on desired lengths.
A task that requires the generative ability to rephrase and restructure sentences to compose a coherent and concise summary
A type of summarization where a model generates a summary by paraphrasing and rephrasing the original text.
Paraphrasing summaries of the input article
Attempting to mimic what humans do by first extracting content from the source document and then producing new sentences that aggregate and organize the extracted information
A method for summarization that involves generating new sentences that capture the most important content of the original document.
Paraphrasing and restructuring sentences to compose the summary
A summarization approach that generates a summary by paraphrasing and rephrasing the source document.
A type of summarization that involves generating new sentences that capture the key information from the original text.
A technique that generates a concise and coherent summary while retaining the most important information from the source document
A type of summarization that generates paraphrases of the source text.

Glossary terms are ordered alphabetically and by the number of their definitions extracted from various papers.