
[]
We’ve compiled a list of the most used and useful APIs that are built in to the standard Node.js runtime. For each module you’ll find simple english explanations and examples to help you understand.
This guide has been adapted from my course Node.js: Novice to Ninja. Check it out there to follow comprehensive course to build your own multi-user real time chat application. It also includes quizzes, videos, code to run your own docker containers.
When building your first Node.js application it’s helpful to know what utilities and APIs node offers out of the box to help with common use cases and development needs.
Useful Node.js APIs
- Process: Retrieve information on environment variables, args, CPU usage and reporting.
- OS: Retrieve OS and system related information that Node is running on: CPUs, Operating system version, home directories, etc.
- Util: A collection of useful and common methods that help with decoding text, type checking and comparing objects.
- URL: Easily create and parse URLs.
- File System API: Interact with the file system to create, read, update, and delete files, directories, and permissions.
- Events: For emitting and subscribing to events in Node.js. Works similarly to client-side event listeners.
- Streams: Used to process large amounts of data in smaller and more manageable chunks to avoid memory issues.
- Worker Threads: Used to separate the execution of functions on separate threads to avoid bottleneck. Useful for CPU-intensive JavaScript operations.
- Child Processes: Allows you to run sub-processes that you can monitor and terminate as necessary.
- Clusters: Allow you to fork any number of identical processes across cores to handle the load more efficiently.
Process
The process object provides information about your Node.js application as well as control methods. Use it to get information like environment variables, and CPU and Memory usage. process is available globally: you can use it without import, although the Node.js documentation recommends you explicitly reference it:
import process from ‘process’;
- process.argv returns an array where the first two items are the Node.js executable path and the script name. The item at index 2 is the first argument passed.
- process.env: returns an object containing environment name/value pairs—such as process.env.NODE_ENV.
- process.cwd(): returns the current working directory.
- process.platform: returns a string identifying the operating system: ‘aix’, ‘darwin’ (macOS), ‘freebsd’, ‘linux’, ‘openbsd’, ‘sunos’, or ‘win32’ (Windows).
- process.uptime(): returns the number of seconds the Node.js process has been running.
- process.cpuUsage(): returns the user and system CPU time usage of the current process—such as { user: 12345, system: 9876 }. Pass the object back to the method to get a relative reading.
- process.memoryUsage(): returns an object describing memory usage in bytes.
- process.version: returns the Node.js version string—such as 18.0.0.
- process.report: generates a diagnostic report.
- process.exit(code): exits the current application. Use an exit code of 0 to indicate success or an appropriate error code where necessary.
OS
The os API has similarities to process (see the “Process” section above), but it can also return information about the Operating System Node.js is running in. This provides information such as what OS version, CPUs and up time.
- os.cpus(): returns an array of objects with information about each logical CPU core. The “Clusters” section below references os.cpus() to fork the process. On a 16-core CPU, you’d have 16 instances of your Node.js application running to improve performance.
- os.hostname(): the OS host name.
- os.version(): a string identifying the OS kernel version.
- os.homedir(): the full path of the user’s home directory.
- os.tmpdir(): the full path of the operating system’s default temporary file directory.
- os.uptime(): the number of seconds the OS has been running.
Util
The util module provides an assortment of useful JavaScript methods. One of the most useful is util.promisify(function), which takes an error-first callback style function and returns a promise-based function. The Util module can also help with common patterns like decoding text, type checking, and inspecting objects.
import util from ‘util’; util.types.isDate( new Date() ); util.types.isMap( new Map() ); util.types.isRegExp( /abc/ ); util.types.isAsyncFunction( async () => {} );
URL
URL is another global object that lets you safely create, parse, and modify web URLs. It’s really useful for quickly extracting protocols, ports, parameters and hashes from URLs without resorting to regex. For example:
{ href: ‘https://example.org:8000/path/?abc=123#target’, origin: ‘https://example.org:8000’, protocol: ‘https:’, username: ”, password: ”, host: ‘example.org:8000’, hostname: ‘example.org’, port: ‘8000’, pathname: ‘/path/’, search: ‘?abc=123’, searchParams: URLSearchParams { ‘abc’ => ‘123’ }, hash: ‘#target’ }
You can view and change any property. For example:
myURL.port = 8001; console.log( myURL.href );
You can then use the URLSearchParams API to modify query string values. For example:
myURL.searchParams.delete(‘abc’); myURL.searchParams.append(‘xyz’, 987); console.log( myURL.search );
There are also methods for converting file system paths to URLs and back again.
The dns module provides name resolution functions so you can look up the IP address, name server, TXT records, and other domain information.
File System API
The fs API can create, read, update, and delete files, directories, and permissions. Recent releases of the Node.js runtime provide promise-based functions in fs/promises, which make it easier to manage asynchronous file operations.
You’ll often use fs in conjunction with path to resolve file names on different operating systems.
The following example module returns information about a file system object using the stat and access methods:
import { constants as fsConstants } from ‘fs’; import { access, stat } from ‘fs/promises’; export async function getFileInfo(file) { const fileInfo = {}; try { const info = await stat(file); fileInfo.isFile = info.isFile(); fileInfo.isDir = info.isDirectory(); } catch (e) { return { new: true }; } try { await access(file, fsConstants.R_OK); fileInfo.canRead = true; } catch (e) {} try { await access(file, fsConstants.W_OK); fileInfo.canWrite = true; } catch (e) {} return fileInfo; }
When passed a filename, the function returns an object with information about that file. For example:
{ isFile: true, isDir: false, canRead: true, canWrite: true }
The main filecompress.js script uses path.resolve() to resolve input and output filenames passed on the command line into absolute file paths, then fetches information using getFileInfo() above:
#!/usr/bin/env node import path from ‘path’; import { readFile, writeFile } from ‘fs/promises’; import { getFileInfo } from ‘./lib/fileinfo.js’; let input = path.resolve(process.argv[2] || ”), output = path.resolve(process.argv[3] || ”), [ inputInfo, outputInfo ] = await Promise.all([ getFileInfo(input), getFileInfo(output) ]), error = [];
The code validates the paths and terminates with error messages if necessary:
if (outputInfo.isDir && outputInfo.canWrite && inputInfo.isFile) { output = path.resolve(output, path.basename(input)); } if (!inputInfo.isFile || !inputInfo.canRead) error.push(`cannot read input file ${ input }`); if (input === output) error.push(‘input and output files cannot be the same’); if (error.length) { console.log(‘Usage: ./filecompress.js [input file] [output file|dir]’); console.error(‘n ‘ + error.join(‘n ‘)); process.exit(1); }
The whole file is then read into a string named content using readFile():
console.log(`processing ${ input }`); let content; try { content = await readFile(input, { encoding: ‘utf8’ }); } catch (e) { console.log(e); process.exit(1); } let lengthOrig = content.length; console.log(`file size ${ lengthOrig }`);
JavaScript regular expressions then remove comments and whitespace:
content = content .replace(/ns+/g, ‘n’) .replace(///.*?n/g, ”) .replace(/s+/g, ‘ ‘) .replace(//*.*?*//g, ”) .replace(//g, ”) .replace(/s*([<>(){}}[]])s*/g, ‘$1’) .trim(); let lengthNew = content.length;
The resulting string is output to a file using writeFile(), and a status message shows the saving:
let lengthNew = content.length; console.log(`outputting ${output}`); console.log(`file size ${ lengthNew } – saved ${ Math.round((lengthOrig – lengthNew) / lengthOrig * 100) }%`); try { content = await writeFile(output, content); } catch (e) { console.log(e); process.exit(1); }
Run the project code with an example HTML file:
node filecompress.js ./test/example.html ./test/output.html
Events
You often need to execute multiple functions when something occurs. For example, a user registers on your app, so the code must add their details to a database, start a new logged-in session, and send a welcome email. The Events module :
async function userRegister(name, email, password) { try { await dbAddUser(name, email, password); await new UserSession(email); await emailRegister(name, email); } catch (e) { } }
This series of function calls is tightly coupled to user registration. Further activities incur further function calls. For example:
try { await dbAddUser(name, email, password); await new UserSession(email); await emailRegister(name, email); await crmRegister(name, email); await emailSales(name, email); }
You could have dozens of calls managed in this single, ever-growing code block.
The Node.js Events API provides an alternative way to structure the code using a publish–subscribe pattern. The userRegister() function can emit an event—perhaps named newuser —after the user’s database record is created.
Any number of event handler functions can subscribe and react to newuser events; there’s no need to change the userRegister() function. Each handler runs independently of the others, so they could execute in any order.
Events in Client-side JavaScript
Events and handler functions are frequently used in client-side JavaScript—for example, to run a function when the user clicks an element:
document.getElementById(‘myelement’).addEventListener(‘click’, e => { console.dir(e); });
In most situations, you’re attaching handlers for user or browser events, although you can raise your own custom events. Event handling in Node.js is conceptually similar, but the API is different.
Objects that emit events must be instances of the Node.js EventEmitter class. These have an emit() method to raise new events and an on() method for attaching handlers.
The event example project provides a class that triggers a tick event on predefined intervals. The ./lib/ticker.js module exports a default class that extends EventEmitter:
import EventEmitter from ‘events’; import { setInterval, clearInterval } from ‘timers’; export default class extends EventEmitter {
Its constructor must call the parent constructor. It then passes the delay argument to a start() method:
constructor(delay) { super(); this.start(delay); }
The start() method checks delay is valid, resets the current timer if necessary, and sets the new delay property:
start(delay) { if (!delay || delay == this.delay) return; if (this.interval) { clearInterval(this.interval); } this.delay = delay;
It then starts a new interval timer that runs the emit() method with the event name “tick”. Subscribers to this event receive an object with the delay value and number of seconds since the Node.js application started:C
this.interval = setInterval(() => { this.emit(‘tick’, { delay: this.delay, time: performance.now() }); }, this.delay); } }
The main event.js entry script imports the module and sets a delay period of one second (1000 milliseconds):Copy
import Ticker from ‘./lib/ticker.js’; const ticker = new Ticker(1000);
It attaches handler functions triggered every time a tick event occurs:
ticker.on(‘tick’, e => { console.log(‘handler 1 tick!’, e); }); ticker.on(‘tick’, e => { console.log(‘handler 2 tick!’, e); });
A third handler triggers on the first tick event only using the once() method:
ticker.once(‘tick’, e => { console.log(‘handler 3 tick!’, e); });
Finally, the current number of listeners is output:
console.log(`listeners: ${ ticker.listenerCount(‘tick’) }`);
Run the project code with node event.js.
The output shows handler 3 triggering once, while handler 1 and 2 run on every tick until the app is terminated.
Streams
The file system example code above (in the “File System” section) reads a whole file into memory before outputting the minified result. What if the file was larger than the RAM available? The Node.js application would fail with an “out of memory” error.
The solution is streaming. This processes incoming data in smaller, more manageable chunks. A stream can be:
- readable: from a file, a HTTP request, a TCP socket, stdin, etc.
- writable: to a file, a HTTP response, TCP socket, stdout, etc.
- duplex: a stream that’s both readable and writable
- transform: a duplex stream that transforms data
Each chunk of data is returned as a Buffer object, which represents a fixed-length sequence of bytes. You may need to convert this to a string or another appropriate type for processing.
The example code has a filestream project which uses a transform stream to address the file size problem in the filecompress project. As before, it accepts and validates input and output filenames before declaring a Compress class, which extends Transform:
import { createReadStream, createWriteStream } from ‘fs’; import { Transform } from ‘stream’; class Compress extends Transform { constructor(opts) { super(opts); this.chunks = 0; this.lengthOrig = 0; this.lengthNew = 0; } _transform(chunk, encoding, callback) { const data = chunk.toString(), content = data .replace(/ns+/g, ‘n’) .replace(///.*?n/g, ”) .replace(/s+/g, ‘ ‘) .replace(//*.*?*//g, ”) .replace(//g, ”) .replace(/s*([<>(){}}[]])s*/g, ‘$1’) .trim(); this.chunks++; this.lengthOrig += data.length; this.lengthNew += content.length; this.push( content ); callback(); } }
The _transform method is called when a new chunk of data is ready. It’s received as a Buffer object that’s converted to a string, minified, and output using the push() method. A callback() function is called once chunk processing is complete.
The application initiates file read and write streams and instantiates a new compress object:
const readStream = createReadStream(input), writeStream = createWriteStream(output), compress = new Compress(); console.log(`processing ${ input }`)
The incoming file read stream has .pipe() methods defined, which feed the incoming data through a series of functions that may (or may not) alter the contents. The data is piped through the compress transform before that output is piped to the writeable file. A final on(‘finish’) event handler function executes once the stream has ended:
readStream.pipe(compress).pipe(writeStream).on(‘finish’, () => { console.log(`file size ${ compress.lengthOrig }`); console.log(`output ${ output }`); console.log(`chunks readStream.pipe(compress).pipe(writeStream).on(‘finish’, () => { console.log(`file size ${ compress.lengthOrig }`); console.log(`output ${ output }`); console.log(`chunks ${ compress.chunks }`); console.log(`file size ${ compress.lengthNew } – saved ${ Math.round((compress.lengthOrig – compress.lengthNew) / compress.lengthOrig * 100) }%`); });
Run the project code with an example HTML file of any size:
node filestream.js ./test/example.html ./test/output.html
This is a small demonstration of Node.js streams. Stream handling is a complex topic, and you may not use them often. In some cases, a module such as Express uses streaming under the hood but abstracts the complexity from you.
You should also be aware of data chunking challenges. A chunk could be any size and split the incoming data in inconvenient ways. Consider minifying this code:
Two chunks could arrive in sequence:
Processing each chunk independently results in the following invalid minified script:
The solution is to pre-parse each chunk and split it into whole sections that can be processed. In some cases, chunks (or parts of chunks) will be added to the start of the next chunk.
Minification is best applied to whole lines, although an extra complication occurs because and /* */ comments can span more than one line. Here’s a possible algorithm for each incoming chunk:
- Append any data saved from the previous chunk to the start of the new chunk.
- Remove any whole and /* to */ sections from the chunk.
- Split the remaining chunk into two parts, where part2 starts with the first