This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: "Hash function" – news · newspapers · books · scholar · JSTOR (July 2010) (Learn how and when to remove this message)

A hash function is any function that can be used to map data of arbitrary size to fixed-size values, though there are some hash functions that support variable length output.^[1] The values returned by a hash function are called hash values, hash codes, hash digests, digests, or simply hashes.^[2] The values are usually used to index a fixed-size table called a hash table. Use of a hash function to index a hash table is called hashing or scatter storage addressing.

Hash functions and their associated hash tables are used in data storage and retrieval applications to access data in a small and nearly constant time per retrieval. They require an amount of storage space only fractionally greater than the total space required for the data or records themselves. Hashing is a computationally and storage space-efficient form of data access that avoids the non-constant access time of ordered and unordered lists and structured trees, and the often exponential storage requirements of direct access of state spaces of large or variable-length keys.

Use of hash functions relies on statistical properties of key and function interaction: worst-case behaviour is intolerably bad but rare, and average-case behaviour can be nearly optimal (minimal collision).^[3]^: 527

Hash functions are related to (and often confused with) checksums, check digits, fingerprints, lossy compression, randomization functions, error-correcting codes, and ciphers. Although the concepts overlap to some extent, each one has its own uses and requirements and is designed and optimized differently. The hash function differs from these concepts mainly in terms of data integrity. Hash tables may use Non-cryptographic hash functions, while cryptographic hash functions are used in cybersecurity to secure sensitive data such as passwords.

v t e Data structures and algorithms
Data structures	Array Associative array Binary search tree Fenwick tree Graph Hash table Heap Linked list Queue Segment tree Stack String Tree Trie
Algorithms	Backtracking Binary search Breadth-first search Brute-force search Depth-first search Divide and conquer Dynamic programming Graph traversal Fold Greedy Hash function Minimax Online Randomized Recursion Root-finding Sorting Streaming Sweep line String-searching Topological sorting
List of data structures List of algorithms

Overview

Hash tables

Specialized uses

Properties

Uniformity

Testing and measurement

Efficiency

Universality

Applicability

Deterministic

Defined range

Variable range

Variable range with minimal movement (dynamic hash function)

Data normalization

Hashing integer data types

Identity hash function

Trivial hash function

Folding

Mid-squares

Division hashing

Algebraic coding

Unique permutation hashing

Multiplicative hashing

Fibonacci hashing

Zobrist hashing

Customised hash function

Hashing variable-length data

Middle and ends

Character folding

Word length folding

Radix conversion hashing

Rolling hash

Fuzzy hash

Perceptual hash

Analysis

History

See also

Notes

References

External links