{"id":125337,"date":"2024-10-19T05:06:38","date_gmt":"2024-10-19T05:06:38","guid":{"rendered":"https:\/\/pdfstandards.shop\/product\/uncategorized\/ieee-3168-2024-2\/"},"modified":"2024-10-24T23:14:59","modified_gmt":"2024-10-24T23:14:59","slug":"ieee-3168-2024-2","status":"publish","type":"product","link":"https:\/\/pdfstandards.shop\/product\/publishers\/ieee\/ieee-3168-2024-2\/","title":{"rendered":"IEEE 3168-2024"},"content":{"rendered":"
New IEEE Standard – Active. The natural language processing (NLP) services using machine learning have rich applications in solving various tasks and have been widely deployed and used, usually accessible by application programming interface (API) calls. The robustness of the NLP services is challenged by various well-known general corruptions and adversarial attacks. Inadvertent or random deletion, addition, or repetition of characters or words are examples of general corruptions. Adversarial characters, words, or sentence samples are generated by adversarial attacks, causing the models underpinning the NLP services to produce incorrect results. A method for quantitatively evaluating the robustness the NLP services is proposed by this standard. Under the method, different cases the evaluation needs to perform against are specified. Robustness metrics and their calculation are defined. With the standard, understanding of the robustness of the services can be developed by the service stakeholders including the service developer, service providers, and service users. The evaluation can be performed during various phases in the life cycle of the NLP services, the testing phase, in the validation phase, after deployment, and so forth.<\/p>\n
PDF Pages<\/th>\n | PDF Title<\/th>\n<\/tr>\n | ||||||
---|---|---|---|---|---|---|---|
1<\/td>\n | IEEE Std 3168\u2122-2024 Front cover <\/td>\n<\/tr>\n | ||||||
2<\/td>\n | Title page <\/td>\n<\/tr>\n | ||||||
4<\/td>\n | Important Notices and Disclaimers Concerning IEEE Standards Documents <\/td>\n<\/tr>\n | ||||||
8<\/td>\n | Participants <\/td>\n<\/tr>\n | ||||||
9<\/td>\n | Introduction <\/td>\n<\/tr>\n | ||||||
10<\/td>\n | Contents <\/td>\n<\/tr>\n | ||||||
11<\/td>\n | 1.\u2002Overview 1.1\u2002Scope 1.2\u2002Purpose 1.3\u2002Word usage <\/td>\n<\/tr>\n | ||||||
12<\/td>\n | 2.\u2002Normative references 3.\u2002Definitions, acronyms, and abbreviations 3.1\u2002Definitions 3.2\u2002Acronyms and abbreviations <\/td>\n<\/tr>\n | ||||||
13<\/td>\n | 4.\u2002Evaluation target 5.\u2002Evaluation cases for NLP services 5.1\u2002Overview of evaluation cases <\/td>\n<\/tr>\n | ||||||
14<\/td>\n | 5.2\u2002General corruptions <\/td>\n<\/tr>\n | ||||||
15<\/td>\n | 5.3\u2002Adversarial attacks 6.\u2002Robustness metrics of NLP services 6.1\u2002Metrics overview <\/td>\n<\/tr>\n | ||||||
16<\/td>\n | 6.2\u2002Utility metrics 6.3\u2002Corruption resistant metrics 6.4\u2002Adversarial resistant metrics <\/td>\n<\/tr>\n | ||||||
17<\/td>\n | 6.5\u2002Quality metrics 6.6\u2002Metrics calculation for NLP services <\/td>\n<\/tr>\n | ||||||
24<\/td>\n | 7.\u2002Test cases 7.1\u2002Test cases for utility metrics <\/td>\n<\/tr>\n | ||||||
25<\/td>\n | 7.2\u2002Test cases for general corruption 7.3\u2002Test cases for adversarial attacks <\/td>\n<\/tr>\n | ||||||
27<\/td>\n | Annex\u00a0A (Informative) Defense against adversarial attacks <\/td>\n<\/tr>\n | ||||||
28<\/td>\n | Annex\u00a0B (Informative) Bibliography <\/td>\n<\/tr>\n | ||||||
29<\/td>\n | Back cover <\/td>\n<\/tr>\n<\/table>\n","protected":false},"excerpt":{"rendered":" IEEE Standard for Robustness Evaluation Test Methods for a Natural Language Processing Service That Uses Machine Learning (Published)<\/b><\/p>\n |