ChatGPT for Vulnerability Detection: Questions & Answers

Information Technology

ChatGPT for Vulnerability Detection: Questions & Answers

Admin Cyber
24 Oktober 2023
0 Comment

Software vulnerabilities are essentially errors in code that malicious actors can exploit. Advanced language models such as CodeBERT, GraphCodeBERT, and CodeT5 can detect these vulnerabilities, provide detailed analysis assessments, and even recommend patches to address them.

These models have proven to be highly effective in identifying and mitigating software vulnerabilities, making them an essential tool for any organization looking to enhance their security posture.

A tool named AIBugHunter in VSCode uses these models for adequate software security.

While ChatGPT and other large language models excel in code-related tasks, no comprehensive studies have assessed their potential for the entire vulnerability workflow, including-

Detection
Type explanation
Severity estimation
Repair suggestions

Recently, the following cybersecurity researchers from Monash University, Clayton, Australia, have explored ChatGPT’s use in software vulnerability tasks, including prediction, classification, and smart contract correction:-

Michael Fu
Chakkrit (Kla) Tantithamthavorn
Van Nguyen
Trung Le

Some previous studies examined large language models in automated program repair but not the latest ChatGPT versions.

ChatGPT Vulnerability Detection

Cybersecurity researchers analyzed the ability of ChatGPT for the following four vulnerability prediction tasks:-

Function and line-level software vulnerability prediction (SVP)
Software vulnerability classification (SVC)
Severity estimation
Automated vulnerability repair (APR)

ChatGPT’s 1.7 trillion parameters vastly exceed those of source code-oriented models like CodeBERT, making prompt-based usage essential. Fine-tuning for vulnerability tasks isn’t possible due to ChatGPT’s proprietary parameters.

An example prompt for function and line-level vulnerability prediction

An example prompt for function and line-level vulnerability prediction (Source – Arxiv)

Security analysts evaluate ChatGPT (get-3.5-turbo and gpt-4) against code-specific models.

They compared it with AIBugHunter, CodeBERT, GraphCodeBERT, and VulExplainer on four vulnerability tasks using Big-Vul and CVEFixes datasets, addressing four research questions.

Here, we have mentioned all four research questions below, along with their respective results:-

(RQ1) How accurate is ChatGPT for function and line-level vulnerability predictions?

Results: ChatGPT achieves F1-measure of 10% and 29% and top-10 accuracy of 25% and 65%, which are the lowest compared with other baseline methods.

(RQ2) How accurate is ChatGPT for vulnerability type classification?

Results: ChatGPT achieves the lowest multiclass accuracy of 13% and 20%, 45%-52% lower than the best baseline.

(RQ3) How accurate is ChatGPT for vulnerability severity estimation?

Results: ChatGPT gave the most inaccurate severity estimation with the highest mean squared error (MSE) of 5.4 and 5.85, while other baseline methods achieved MSE of 1.8 to 1.86.

(RQ4) How accurate is ChatGPT for automated vulnerability repair?

Results: ChatGPT failed to generate correct repair patches, while other baselines correctly repaired 7%-30% of vulnerable functions.

Prompt for CWE-ID classification

Prompt for CWE-ID classification (Source – Arxiv)

ChatGPT didn’t produce correct repair patches, whereas fine-tuned baselines repaired 7%-30%. BLEU and METEOR scores confirm baseline patches are closer to true ones.

This highlights the challenge of vulnerability repair, suggesting ChatGPT requires domain-specific fine-tuning.

Comments

Leave Comment

If you want to leave a comment, please log in first.

Welcome to TantoCyber!Enjoy free books and information at your fingertips.

Expand your knowledge, explore new insights, and stay informed—completely free! 🚀📚

Menu

DeFi will take over as the standard banking interface.

5 Agustus 2025

Types of Blockchain

3 Agustus 2025

How Does Blockchain Work?

3 Agustus 2025

What is Blockchain

1 Agustus 2025

Web1 vs. Web2 vs. Web3

18 Juli 2025

Core components of Web3

18 Juli 2025

What's Web3? What you need to know to get started with the future open internet

18 Juli 2025

OPPORTUNITIES FOR VE APPLICATION

29 April 2024

Developing a FAST Model

28 April 2024

Defining the Scope of the FAST Model

28 April 2024

DeFi will take over as the standard banking interface.

Types of Blockchain

How Does Blockchain Work?

What is Blockchain

Web1 vs. Web2 vs. Web3

Core components of Web3

What's Web3? What you need to know to get started with the future open internet

OPPORTUNITIES FOR VE APPLICATION

Developing a FAST Model

Defining the Scope of the FAST Model

ChatGPT for Vulnerability Detection: Questions & Answers

ChatGPT Vulnerability Detection

Comments

Leave Comment

Color

Dark

RTL

Welcome to TantoCyber!Enjoy free books and information at your fingertips.

Expand your knowledge, explore new insights, and stay informed—completely free! 🚀📚

Menu

ChatGPT for Vulnerability Detection: Questions & Answers

Recent Post

DeFi will take over as the standard banking interface.

5 Agustus 2025

Types of Blockchain

3 Agustus 2025

How Does Blockchain Work?

3 Agustus 2025

What is Blockchain

1 Agustus 2025

Web1 vs. Web2 vs. Web3

18 Juli 2025

Core components of Web3

18 Juli 2025

What's Web3? What you need to know to get started with the future open internet

18 Juli 2025

OPPORTUNITIES FOR VE APPLICATION

29 April 2024

Developing a FAST Model

28 April 2024

Defining the Scope of the FAST Model

28 April 2024

Trending News

DeFi will take over as the standard banking interface.

Types of Blockchain

How Does Blockchain Work?

What is Blockchain

Web1 vs. Web2 vs. Web3

Core components of Web3

What's Web3? What you need to know to get started with the future open internet

OPPORTUNITIES FOR VE APPLICATION

Developing a FAST Model

Defining the Scope of the FAST Model

ChatGPT for Vulnerability Detection: Questions & Answers

ChatGPT Vulnerability Detection

Comments

Leave Comment

Abstract :

Author:

Publisher:

Sub Category:

Choose your Delivery Location

Select a Location

Alabama

Arizona

California

Colorado

Florida

Georgia

Kansas

Minnesota

New York

Washington

Deal Today

Blended Instant Coffee 50 g Buy 1 Get 1 Free

$52.57 57.62 500 G

Blended Instant Coffee 50 g Buy 1 Get 1 Free

$52.57 57.62 500 G

Blended Instant Coffee 50 g Buy 1 Get 1 Free

$52.57 57.62 500 G

Blended Instant Coffee 50 g Buy 1 Get 1 Free

$52.57 57.62 500 G

Color

Dark

RTL