FIO30-C. Exclude user input from format strings

Never call a formatted I/O function with a format string containing a tainted value . An attacker who can fully or partially control the contents of a format string can crash a vulnerable process, view the contents of the stack, view memory content, or write to an arbitrary memory location. Consequently, the attacker can execute arbitrary code with the permissions of the vulnerable process [Seacord 2013b]. Formatted output functions are particularly dangerous because many programmers are unaware of their capabilities. For example, formatted output functions can be used to write an integer value to a specified address using the %n conversion specifier.

Noncompliant Code Example

The incorrect_password() function in this noncompliant code example is called during identification and authentication to display an error message if the specified user is not found or the password is incorrect. The function accepts the name of the user as a string referenced by user. This is an exemplar of untrusted data that originates from an unauthenticated user. The function constructs an error message that is then output to stderr using the C Standard fprintf() function.

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
 
void incorrect_password(const char *user) {
  int ret;
  /* User names are restricted to 256 or fewer characters */
  static const char msg_format[] = "%s cannot be authenticated.\n";
  size_t len = strlen(user) + sizeof(msg_format);
  char *msg = (char *)malloc(len);
  if (msg == NULL) {
    /* Handle error */
  }
  ret = snprintf(msg, len, msg_format, user);
  if (ret < 0) { 
    /* Handle error */ 
  } else if (ret >= len) { 
    /* Handle truncated output */ 
  }
  fprintf(stderr, msg);
  free(msg);
}

The incorrect_password() function calculates the size of the message, allocates dynamic storage, and then constructs the message in the allocated memory using the snprintf() function. The addition operations are not checked for integer overflow because the string referenced by user is known to have a length of 256 or less. Because the %s characters are replaced by the string referenced by user in the call to snprintf(), the resulting string needs 1 byte less than is allocated. The snprintf() function is commonly used for messages that are displayed in multiple locations or messages that are difficult to build. However, the resulting code contains a format-string vulnerability because the msg includes untrusted user input and is passed as the format-string argument in the call to fprintf().

Compliant Solution (`fputs()`)

This compliant solution fixes the problem by replacing the fprintf() call with a call to fputs(), which outputs msg directly to stderr without evaluating its contents:

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
 
void incorrect_password(const char *user) {
  int ret;
  /* User names are restricted to 256 or fewer characters */
  static const char msg_format[] = "%s cannot be authenticated.\n";
  size_t len = strlen(user) + sizeof(msg_format);
  char *msg = (char *)malloc(len);
  if (msg == NULL) {
    /* Handle error */
  }
  ret = snprintf(msg, len, msg_format, user);
  if (ret < 0) { 
    /* Handle error */ 
  } else if (ret >= len) { 
    /* Handle truncated output */ 
  }
  fputs(msg, stderr);
  free(msg);
}

Compliant Solution (`fprintf()`)

This compliant solution passes the untrusted user input as one of the variadic arguments to fprintf() and not as part of the format string, eliminating the possibility of a format-string vulnerability:

#include <stdio.h>
 
void incorrect_password(const char *user) {
  static const char msg_format[] = "%s cannot be authenticated.\n";
  fprintf(stderr, msg_format, user);
}

Noncompliant Code Example (POSIX)

This noncompliant code example is similar to the first noncompliant code example but uses the POSIX function syslog() [IEEE Std 1003.1:2013] instead of the fprintf() function. The syslog() function is also susceptible to format-string vulnerabilities.

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <syslog.h>
 
void incorrect_password(const char *user) {
  int ret;
  /* User names are restricted to 256 or fewer characters */
  static const char msg_format[] = "%s cannot be authenticated.\n";
  size_t len = strlen(user) + sizeof(msg_format);
  char *msg = (char *)malloc(len);
  if (msg == NULL) {
    /* Handle error */
  }
  ret = snprintf(msg, len, msg_format, user);
  if (ret < 0) { 
    /* Handle error */ 
  } else if (ret >= len) { 
    /* Handle truncated output */ 
  }
  syslog(LOG_INFO, msg);
  free(msg);
}

The syslog() function first appeared in BSD 4.2 and is supported by Linux and other modern UNIX implementations. It is not available on Windows systems.

Compliant Solution (POSIX)

This compliant solution passes the untrusted user input as one of the variadic arguments to syslog() instead of including it in the format string:

#include <syslog.h>
 
void incorrect_password(const char *user) {
  static const char msg_format[] = "%s cannot be authenticated.\n";
  syslog(LOG_INFO, msg_format, user);
}

Risk Assessment

Failing to exclude user input from format specifiers may allow an attacker to crash a vulnerable process, view the contents of the stack, view memory content, or write to an arbitrary memory location and consequently execute arbitrary code with the permissions of the vulnerable process.

Rule	Severity	Likelihood	Remediation Cost	Priority	Level
FIO30-C	High	Likely	Medium	P18	L1

Automated Detection

Tool	Version	Checker	Description
Astrée	24.04		Supported via stubbing/taint analysis
Axivion Bauhaus Suite	7.2.0	CertC-FIO30	Partially implemented
CodeSonar	8.1p0	IO.INJ.FMT MISC.FMT	Format string injection Format string
Compass/ROSE
Coverity	2017.07	TAINTED_STRING	Implemented
GCC	4.3.5		Can detect violations of this rule when the `-Wformat-security` flag is used
Helix QAC	2024.1	DF4916, DF4917, DF4918
Klocwork	2024.1	SV.FMTSTR.GENERIC SV.TAINTED.FMTSTR
LDRA tool suite	9.7.1	86 D	Partially Implemented
Parasoft C/C++test	2023.1	CERT_C-FIO30-a CERT_C-FIO30-b CERT_C-FIO30-c	Avoid calling functions printf/wprintf with only one argument other than string constant Avoid using functions fprintf/fwprintf with only two parameters, when second parameter is a variable Never use unfiltered data from an untrusted user as the format parameter
PC-lint Plus	1.4	592	Partially supported: reports non-literal format strings
Polyspace Bug Finder	R2023b	CERT C: Rule FIO30-C	Checks for tainted string format (rule partially covered)
PVS-Studio	7.30	V618
Splint	3.1.1

Related Vulnerabilities

Two examples of format-string vulnerabilities resulting from a violation of this rule include Ettercap and Samba.

In Ettercap v.NG-0.7.2, the ncurses user interface suffers from a format-string defect. The curses_msg() function in ec_curses.c calls wdg_scroll_print(), which takes a format string and its parameters and passes it to vw_printw(). The curses_msg() function uses one of its parameters as the format string. This input can include user data, allowing for a format-string vulnerability.

The Samba AFS ACL mapping VFS plug-in fails to properly sanitize user-controlled file names that are used in a format specifier supplied to snprintf(). This security flaw becomes exploitable when a user can write to a share that uses Samba's afsacl.so library for setting Windows NT access control lists on files residing on an AFS file system.

Search for vulnerabilities resulting from the violation of this rule on the CERT website.

Related Guidelines

Key here (explains table format and definitions)

Taxonomy	Taxonomy item	Relationship
CERT Oracle Secure Coding Standard for Java	IDS06-J. Exclude unsanitized user input from format strings	Prior to 2018-01-12: CERT: Unspecified Relationship
CERT Perl Secure Coding Standard	IDS30-PL. Exclude user input from format strings	Prior to 2018-01-12: CERT: Unspecified Relationship
ISO/IEC TR 24772:2013	Injection [RST]	Prior to 2018-01-12: CERT: Unspecified Relationship
ISO/IEC TS 17961:2013	Including tainted or out-of-domain input in a format string [usrfmt]	Prior to 2018-01-12: CERT: Unspecified Relationship
CWE 2.11	CWE-134, Uncontrolled Format String	2017-05-16: CERT: Exact
CWE 2.11	CWE-20, Improper Input Validation	2017-05-17: CERT: Rule subset of CWE

Bibliography

[IEEE Std 1003.1:2013]	XSH, System Interfaces, `syslog`
[Seacord 2013b]	Chapter 6, "Formatted Output"
[Viega 2005]	Section 5.2.23, "Format String Problem"

Space shortcuts

Page tree

Noncompliant Code Example

Compliant Solution (`fputs()`)

Compliant Solution (`fprintf()`)

Noncompliant Code Example (POSIX)

Compliant Solution (POSIX)

Risk Assessment

Automated Detection

Related Vulnerabilities

Related Guidelines

Bibliography

11 Comments

Robert Seacord

Douglas A. Gwyn

David Svoboda

Alex Volkovitsky

Alex Volkovitsky

Masaki Kubo

Robert Seacord

Masaki Kubo

David Svoboda

Ted Johnson

David Svoboda

Space shortcuts

Page tree

Noncompliant Code Example

Compliant Solution (fputs())

Compliant Solution (fprintf())

Noncompliant Code Example (POSIX)

Compliant Solution (POSIX)

Risk Assessment

Automated Detection

Related Vulnerabilities

Related Guidelines

Bibliography

11 Comments

Compliant Solution (`fputs()`)

Compliant Solution (`fprintf()`)