Extracting Software Requirements from Unstructured Documents

Standard

Extracting Software Requirements from Unstructured Documents. / Ivanov, Vladimir; Sadovykh, Andrey; Naumchev, Alexandr и др.

Extracting Software Requirements from Unstructured Documents. Springer, 2022. стр. 17-29 2 (Recent Trends in Analysis of Images, Social Networks and Texts; Том 1573).

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › глава/раздел › научная › Рецензирование

Harvard

Ivanov, V, Sadovykh, A, Naumchev, A, Bagnato, A & Yakovlev, K 2022, Extracting Software Requirements from Unstructured Documents. в Extracting Software Requirements from Unstructured Documents., 2, Recent Trends in Analysis of Images, Social Networks and Texts, Том. 1573, Springer, стр. 17-29. https://doi.org/10.1007/978-3-031-15168-2_2

APA

Ivanov, V., Sadovykh, A., Naumchev, A., Bagnato, A., & Yakovlev, K. (2022). Extracting Software Requirements from Unstructured Documents. в Extracting Software Requirements from Unstructured Documents (стр. 17-29). [2] (Recent Trends in Analysis of Images, Social Networks and Texts; Том 1573). Springer. https://doi.org/10.1007/978-3-031-15168-2_2

Vancouver

Ivanov V, Sadovykh A, Naumchev A, Bagnato A, Yakovlev K. Extracting Software Requirements from Unstructured Documents. в Extracting Software Requirements from Unstructured Documents. Springer. 2022. стр. 17-29. 2. (Recent Trends in Analysis of Images, Social Networks and Texts). doi: 10.1007/978-3-031-15168-2_2

Author

Ivanov, Vladimir ; Sadovykh, Andrey ; Naumchev, Alexandr и др. / Extracting Software Requirements from Unstructured Documents. Extracting Software Requirements from Unstructured Documents. Springer, 2022. стр. 17-29 (Recent Trends in Analysis of Images, Social Networks and Texts).

BibTeX

@inbook{8f1a4a6651c0441c844361f797c98338,

title = "Extracting Software Requirements from Unstructured Documents",

abstract = "Requirements identification in textual documents or extraction is a tedious and error prone task that many researchers suggest automating. We manually annotated the PURE dataset and thus created a new one containing both requirements and non-requirements. Using this dataset, we fine-tuned the BERT model and compare the results with several baselines such as fastText and ELMo. In order to evaluate the model on semantically more complex documents we compare the PURE dataset results with experiments on Request For Information (RFI) documents. The RFIs often include software requirements, but in a less standardized way. The fine-tuned BERT showed promising results on PURE dataset on the binary sentence classification task. Comparing with previous and recent studies dealing with constrained inputs, our approach demonstrates high performance in terms of precision and recall metrics, while being agnostic to the unstructured textual input.",

keywords = "BERT, ELMo, FastText, Requirements elicitation, Sentence classification, Software requirements",

author = "Vladimir Ivanov and Andrey Sadovykh and Alexandr Naumchev and Alessandra Bagnato and Kirill Yakovlev",

year = "2022",

month = aug,

day = "30",

doi = "10.1007/978-3-031-15168-2_2",

language = "English",

isbn = "978-3-031-15167-5",

series = "Recent Trends in Analysis of Images, Social Networks and Texts",

publisher = "Springer",

pages = "17--29",

booktitle = "Extracting Software Requirements from Unstructured Documents",

address = "United States",

}

RIS

TY - CHAP

T1 - Extracting Software Requirements from Unstructured Documents

AU - Ivanov, Vladimir

AU - Sadovykh, Andrey

AU - Naumchev, Alexandr

AU - Bagnato, Alessandra

AU - Yakovlev, Kirill

PY - 2022/8/30

Y1 - 2022/8/30

N2 - Requirements identification in textual documents or extraction is a tedious and error prone task that many researchers suggest automating. We manually annotated the PURE dataset and thus created a new one containing both requirements and non-requirements. Using this dataset, we fine-tuned the BERT model and compare the results with several baselines such as fastText and ELMo. In order to evaluate the model on semantically more complex documents we compare the PURE dataset results with experiments on Request For Information (RFI) documents. The RFIs often include software requirements, but in a less standardized way. The fine-tuned BERT showed promising results on PURE dataset on the binary sentence classification task. Comparing with previous and recent studies dealing with constrained inputs, our approach demonstrates high performance in terms of precision and recall metrics, while being agnostic to the unstructured textual input.

AB - Requirements identification in textual documents or extraction is a tedious and error prone task that many researchers suggest automating. We manually annotated the PURE dataset and thus created a new one containing both requirements and non-requirements. Using this dataset, we fine-tuned the BERT model and compare the results with several baselines such as fastText and ELMo. In order to evaluate the model on semantically more complex documents we compare the PURE dataset results with experiments on Request For Information (RFI) documents. The RFIs often include software requirements, but in a less standardized way. The fine-tuned BERT showed promising results on PURE dataset on the binary sentence classification task. Comparing with previous and recent studies dealing with constrained inputs, our approach demonstrates high performance in terms of precision and recall metrics, while being agnostic to the unstructured textual input.

KW - BERT

KW - ELMo

KW - FastText

KW - Requirements elicitation

KW - Sentence classification

KW - Software requirements

UR - https://www.mendeley.com/catalogue/e7bd97ea-a8bd-3d98-b281-aeee3efbb657/

U2 - 10.1007/978-3-031-15168-2_2

DO - 10.1007/978-3-031-15168-2_2

M3 - Chapter

SN - 978-3-031-15167-5

T3 - Recent Trends in Analysis of Images, Social Networks and Texts

SP - 17

EP - 29

BT - Extracting Software Requirements from Unstructured Documents

PB - Springer

ER -

ID: 65524499