Notepad [DOS Charset Edition] — Legacy DOS Charset Toolkit
What it is
- A lightweight text editor focused on accurately displaying and editing files encoded with legacy DOS/OEM code pages (e.g., CP437, CP850, CP866).
Key features
- Code page selection: Open and save files using multiple DOS code pages.
- Accurate glyph rendering: Maps OEM characters (box-drawing, accented letters, special symbols) to visible glyphs rather than replacing them with �.
- Automatic detection: Heuristics to guess the most likely DOS code page on open.
- Conversion tools: Convert between DOS code pages and Unicode (UTF-8/UTF-16) preserving character semantics.
- Line ending support: Correctly handles CR/LF and mixed line endings common in DOS-era files.
- Byte-level view: Hex/byte inspector to examine raw byte values and manual edit.
- Search & replace: Byte-aware search supporting code-page-specific patterns.
- Font options: Bundled monospaced fonts optimized for OEM glyphs or ability to use system bitmap fonts.
- Portable mode: Single executable with no installer for use on removable media.
Use cases
- Restoring or reading old documentation and configuration files from DOS-era systems.
- Editing code or scripts originally written with OEM encodings.
- Converting archival text collections to Unicode for modern use.
- Viewing legacy game text, logs, or BBS files that rely on box-drawing characters.
Limitations
- Not intended as a full IDE — minimal syntax highlighting.
- Some rare OEM glyphs may depend on available fonts; exact visual match to vintage systems can vary.
- Automatic detection may occasionally guess incorrectly for very short files.
Getting started
- Open a file and select the detected code page or choose one manually (CP437, CP850, CP866, etc.).
- Use the byte-level view to verify unusual characters.
- Convert to UTF-8 if you need modern compatibility; keep a backup of the original bytes.
Compatibility
- Windows-focused (native font and OEM support), but can run on other OSes with bundled fonts and rendering fallback.