Reference

Opening reference

Preparing the article, table of contents, and metadata. 본문, 목차, 메타데이터를 준비하고 있습니다.

Loading category cards

Preparing references and filters for this topic. 이 주제의 레퍼런스와 필터를 준비하고 있습니다.

C구조체와 문자열

`isdigit`, `isalpha`, `isspace`, `toupper`, `tolower`는 입력 검증과 문자 정규화에서 자주 쓰입니다. 특히 `unsigned char` 캐스트가 왜 필요한지 중심으로 정리합니다.

마지막 수정 2026년 4월 6일

ctype.h 함수는 문자 분류와 변환 함수이지만, 실제 핵심은 unsigned char 전제를 지키며 입력을 안전하게 분류하는 데 있습니다.

if (isdigit((unsigned char)c)) { }
if (isspace((unsigned char)c)) { }
ch = (char)tolower((unsigned char)ch);

ctype.h 함수는 보통 두 종류로 나뉩니다.

반환값은 bool처럼 보여도 정확히 1이 아니라 0/비0로 읽어야 합니다.

int c = getchar();

if (c != EOF && isdigit((unsigned char)c)) {
    puts("digit");
}

문자열 전체를 정규화할 때는 이런 패턴이 자주 쓰입니다.

void to_lower_str(char *s) {
    for (; *s != '\0'; s++) {
        *s = (char)tolower((unsigned char)*s);
    }
}

이 카드의 핵심은 (unsigned char) 캐스트입니다. ctype.h 함수는 unsigned char 범위 값이나 EOF를 기대하므로, signed char가 음수로 해석되는 환경에서 그대로 넘기면 정의되지 않은 동작이 될 수 있습니다.

char ch = '\xE9';

/* if (isalpha(ch)) { } */          // 위험할 수 있음
if (isalpha((unsigned char)ch)) { } // 안전한 쪽

ctype.h 함수는 작고 익숙해서 대충 써도 될 것처럼 보이지만, signed char 환경에서의 UB와 == 1 같은 잘못된 반환값 비교 습관 때문에 미묘한 버그가 나오기 쉽습니다. 입력 검증 코드일수록 캐스트를 빼먹지 않는 편이 좋습니다.

1 sources