You do not have permission to edit this page, for the following reasons:

This IP address has been blocked from editing Wikipedia.
This does not affect your ability to read Wikipedia pages.
Most people who see this message have done nothing wrong. Some kinds of blocks restrict editing from specific service providers or telecom companies in response to recent abuse or vandalism, and can sometimes affect other users who are unrelated to that abuse. Review the information below for assistance if you do not believe that you have done anything wrong.

The IP address or range 44.223.0.0/16 has been blocked by ‪ST47‬ for the following reason(s):

The IP address that you are currently using has been blocked because it is believed to be a web host provider. To prevent abuse, web hosts may be blocked from editing Wikipedia.
You will not be able to edit Wikipedia using a web host provider because it hides your IP address, much like a proxy or VPN.
We recommend that you attempt to use another connection to edit. For example, if you use a proxy or VPN to connect to the internet, turn it off when editing Wikipedia. If you edit using a mobile connection, try using a Wi-Fi connection, and vice versa. If you are using a corporate internet connection, switch to a different Wi-Fi network. If you have a Wikipedia account, please log in.
If you do not have any other way to edit Wikipedia, you will need to request an IP block exemption.

How to appeal if you are confident that your connection does not use a web host's IP address:
If you are confident that you are not using a web host, you may appeal this block by adding the following text on your talk page: ((unblock|reason=Caught by a web host block but this host or IP is not a web host. My IP address is _______. Place any further information here. ~~~~)). You must fill in the blank with your IP address for this block to be investigated. Your IP address can be determined here. Alternatively, if you wish to keep your IP address private you can use the unblock ticket request system. There are several reasons you might be editing using the IP address of a web host (such as if you are using VPN software or a business network); please use this method of appeal only if you think your IP address is in fact not a web host.

Administrators: The IP block exemption user right should only be applied to allow users to edit using web host in exceptional circumstances, and they should usually be directed to the functionaries team via email. If you intend to give the IPBE user right, a CheckUser needs to take a look at the account. This can be requested most easily at SPI Quick Checkuser Requests. Unblocking an IP or IP range with this template is highly discouraged without at least contacting the blocking administrator.

This block will expire on 23:54, 31 July 2024. Your current IP address is 44.223.80.188.

Even when blocked, you will usually still be able to edit your user talk page, as well as email administrators and other editors.

For information on how to proceed, please read the FAQ for blocked users and the guideline on block appeals. The guide to appealing blocks may also be helpful.

Other useful links: Blocking policy · Help:I have been blocked
This IP address range has been globally blocked.
This does not affect your ability to read Wikipedia pages.
Most people who see this message have done nothing wrong. Some kinds of blocks restrict editing from specific service providers or telecom companies in response to recent abuse or vandalism, and can sometimes affect other users who are unrelated to that abuse. Review the information below for assistance if you do not believe that you have done anything wrong.

This block affects editing on all Wikimedia wikis.
The IP address or range 44.223.0.0/16 has been globally blocked by ‪AntiCompositeNumber‬ for the following reason(s):

Open proxy/Webhost: Visit the FAQ if you are affected

This block will expire on 13:57, 18 November 2024. Your current IP address is 44.223.80.188.

Even while globally blocked, you will usually still be able to edit pages on Meta-Wiki.

If you believe you were blocked by mistake, you can find additional information and instructions in the No open proxies global policy. Otherwise, to discuss the block please post a request for review on Meta-Wiki. You could also send an email to the stewards VRT queue at stewards@wikimedia.org including all above details.

Other useful links: Global blocks · Help:I have been blocked

You can view and copy the source of this page:

((Short description|Process of converting physical media into digital media))
((Multiple issues|
((More citations needed|date=January 2016))
((Original research|date=January 2016))
))

[[Image:Scribe Book Scanner.jpg|thumb|[[Internet Archive]] Scribe book scanner in 2011]]
[[File:Internet Archive book scanner 1.jpg|thumb|Internet Archive book scanner]]
'''Book scanning''' or '''book digitization''' (also: '''magazine scanning''' or '''magazine digitization''') is the process of converting physical [[book]]s and [[magazine]]s into [[digital media]] such as [[digital image|images]], [[electronic text]], or [[e-book|electronic books]] (e-books) by using an [[image scanner]].<ref name="hurix">((cite web |url=https://www.hurix.com/digitizing-books-at-scale/ |title=6 Factors to Consider while Digitizing Books at Scale |date=July 22, 2019 |website=hurixdigital |access-date=October 17, 2022 |archive-url=https://web.archive.org/web/20220117230715/https://www.hurix.com/digitizing-books-at-scale/ |archive-date=January 17, 2022))</ref> Large scale book scanning projects have made many books available online.<ref name="kitaboo">((cite web |url=https://kitaboo.com/digitization-for-book-publishers/ |title=An 8-Step Guide to Digitization for Book Publishers |last=Harman |first=Mike |date=March 23, 2021 |website=Kitaboo |access-date=October 17, 2022 |archive-url=https://web.archive.org/web/20220122214549/https://kitaboo.com/digitization-for-book-publishers/ |archive-date=January 22, 2022))</ref>

Digital books can be easily distributed, reproduced, and [[screen reading|read on-screen]]. Common file formats are [[DjVu]], [[Portable Document Format]] (PDF), and [[Tag Image File Format]] (TIFF). To convert the raw images [[optical character recognition]] (OCR)<ref name="hurix" /> is used to turn book pages into a digital text format like [[ASCII]] or other similar format, which reduces the file size and allows the text to be reformatted, searched, or processed by other applications.<ref name="hurix" />

Image scanners may be manual or automated. In an ordinary commercial image scanner, the book is placed on a flat glass plate (or platen), and a light and optical array moves across the book underneath the glass. In manual book scanners, the glass plate extends to the edge of the scanner, making it easier to line up the book's spine.<ref name="hurix" /><ref name="kitaboo" />

A problem with scanning bound books is that when a book that is not very thin is laid flat, the part of the page close to the spine (the gutter) is significantly curved, distorting the text in that part of the scan. One solution is to separate the book into separate pages by cutting or unbinding. A non-destructive method is to hold the book in a V-shaped holder and photograph it, rather than lay it flat and scan it. The curvature in the gutter is much less pronounced this way.<ref>((Cite web |title=A Scanner for books with text VERY close to the gutter |author=JThomas |website=DIY Book Scanner |date=April 2012 |url= https://diybookscanner.org/forum/viewtopic.php?t=2549))</ref> Pages may be turned by hand or by automated paper transport devices. Transparent plastic or glass sheets are usually pressed against the page to flatten it.

After scanning, software adjusts the document images by lining it up, cropping it, picture-editing it, and converting it to text and final e-book form. Human proofreaders usually check the output for errors.

Scanning at ((nowrap|118 dots/centimeter)) (((nowrap|300 [[dots per inch|dpi]]))) is adequate for conversion to digital text output, but for archival reproduction of rare, elaborate or illustrated books, much higher resolution is used.((citation needed|date=March 2013)) High-end scanners capable of thousands of pages per hour can cost thousands of dollars, but [[do-it-yourself]] (DIY), manual book scanners capable of 1200 pages per hour have been built for US$300.<ref name="instructables">((cite web|url=http://www.instructables.com/id/DIY-High-Speed-Book-Scanner-from-Trash-and-Cheap-C/|title=DIY High-Speed Book Scanner from Trash and Cheap Cameras|publisher=instructables.com|access-date=19 January 2014))</ref>

==Commercial book scanners==
[[File:V-shaped-cradle - en.svg|thumb|Sketch of a V-shaped book scanner from Atiz]]
[[File:book scanner.svg|thumb|Sketch of a typical manual book scanner]]
Commercial book scanners are not like normal [[image scanner|scanners]]; these book scanners are usually a high quality [[digital camera]] with light sources on either side of the camera mounted on some sort of frame to provide easy access for a person or machine to flip through the pages of the book. Some models involve V-shaped book cradles, which provide support for book spines and also center book position automatically.

The advantage of this type of scanner is that it is very fast, compared to the productivity of overhead scanners.

==Large-scale projects==
((More citations needed section|date=January 2016))
Projects like [[Project Gutenberg]] (est. 1971),<ref>((cite web |url=https://www.openculture.com/2019/09/libraries-archivists-are-digitizing-480000-books.html |title=Libraries & Archivists Are Digitizing 480,000 Books Published in 20th Century That Are Secretly in the Public Domain |date=September 27, 2019 |website=Open Culture |access-date=October 19, 2022 |archive-url=https://web.archive.org/web/20191002033016/https://www.openculture.com/2019/09/libraries-archivists-are-digitizing-480000-books.html |archive-date=October 2, 2019))</ref> [[Million Book Project]] (est. circa 2001), [[Google Books]] (est. 2004), and the [[Open Content Alliance]] (est. 2005) scan books on a large scale.<ref name="monday">((cite journal |url=https://firstmonday.org/ojs/index.php/fm/article/view/2101/2037 |title=Mass book digitization: The deeper story of Google Books and the Open Content Alliance |last=Leetaru |first=Kalev |journal=First Monday |year=2008 |doi=10.5210/fm.v13i10.2101 |access-date=October 19, 2022 |doi-access= free))</ref><ref name="educause">((cite web |url=https://er.educause.edu/articles/2017/3/transforming-our-libraries-from-analog-to-digital-a-2020-vision |title=Transforming Our Libraries from Analog to Digital: A 2020 Vision |last=Kahle |first=Brewster |date=March 13, 2017 |website=Educause |access-date=October 19, 2022 |archive-url=https://web.archive.org/web/20170315224032/https://er.educause.edu/articles/2017/3/transforming-our-libraries-from-analog-to-digital-a-2020-vision |archive-date=March 15, 2017))</ref>

One of the main challenges to this is the sheer volume of books that must be scanned. In 2010 the total number of works appearing as books in human history was estimated to be around 130&nbsp;million.<ref>((cite web|last=Taycher |first=Leonid |url=http://googleblog.blogspot.co.at/2010/08/you-can-count-number-of-books-in-world.html |title=As of Aug 5, 2010, google estimates that there are 129,864,880 different books in the world |publisher=Googleblog.blogspot.co.at |date=2010-08-05 |access-date=2014-08-08))</ref> All of these must be scanned and then made searchable online for the public to use as a [[universal library]]. Currently, there are three main ways that large organizations are relying on: outsourcing, scanning in-house using commercial book scanners, and scanning in-house using robotic scanning solutions.

As for outsourcing, books are often shipped to be scanned by low-cost sources to [[India]] or [[China]]. Alternatively, due to convenience, safety and technology improvement, many organizations choose to scan in-house by using either overhead scanners which are time-consuming, or digital camera-based scanning machines which are substantially faster and is a method employed by Internet Archive as well as Google.<ref name="educause" /><ref name="effort" /> Traditional methods have included cutting off the book's spine and scanning the pages in a [[image scanner|scanner]] with automatic page-feeding capability, with subsequent rebinding of the loose pages.

Once the page is scanned, the [[data]] is either entered manually or via OCR, another major cost of the book scanning projects.((According to whom|date=January 2016))

Due to [[copyright]] issues, most scanned books are those that are out of copyright; however, Google Books is known to scan books still protected under copyright unless the [[publisher]] specifically prohibits this.<ref name="monday" /><ref name="educause" /><ref name="effort">((cite web |url=https://www.edsurge.com/news/2017-08-10-what-happened-to-google-s-effort-to-scan-millions-of-university-library-books |title=What Happened to Google's Effort to Scan Millions of University Library Books? |last=Howard |first=Jennifer |date=August 10, 2017 |website=EdSurge |access-date=October 17, 2022 |archive-url=https://web.archive.org/web/20220105135731/https://www.edsurge.com/news/2017-08-10-what-happened-to-google-s-effort-to-scan-millions-of-university-library-books |archive-date=January 5, 2022))</ref><ref>((cite web |url=https://www.theatlantic.com/technology/archive/2017/04/the-tragedy-of-google-books/523320/ |title=Torching the Modern-Day Library of Alexandria |last=Somers |first=James |date=April 20, 2017 |website=The Atlantic |access-date=October 19, 2022 |archive-url=https://web.archive.org/web/20170420190006/https://www.theatlantic.com/technology/archive/2017/04/the-tragedy-of-google-books/523320/ |archive-date=April 20, 2017))</ref>

===Collaborative projects===
There are many collaborative digitization projects throughout the United States.  Two of the earliest projects were the Collaborative Digitization Project in Colorado and [[NC ECHO]] – North Carolina Exploring Cultural Heritage Online,<ref>((cite web|url=http://www.ncecho.org/|title=North Carolina ECHO : Exploring Cultural Heritage Online|work=ncecho.org))</ref> based at the [[State Library of North Carolina]].

These projects establish and publish best practices for digitization and work with regional partners to digitize cultural heritage materials. Additional criteria for best practices have more recently been established in the UK, Australia and the European Union.<ref>((cite journal |url=http://www.ariadne.ac.uk/issue43/awre-rvw/ |title=Digital Libraries: Principles and Practice in a Global Environment |last=Awre |first=Chris |date=April 30, 2005 |journal=Ariadne |issue=43 |access-date=October 19, 2022 |archive-url=https://web.archive.org/web/20220405041401/http://www.ariadne.ac.uk/issue/43/awre-rvw/ |archive-date=April 5, 2022))</ref> [[Wisconsin Heritage Online]]<ref>((cite web|url=http://wisconsinheritage.org/|title=Recollection Wisconsin|date=29 November 2006))</ref> is a collaborative digitization project modeled after the Colorado Collaborative Digitization Project. Wisconsin uses a [[wiki]]<ref>((cite web|url=http://wisheritage.pbworks.com/|title=Wisconsin Heritage Online [licensed for non-commercial use only] / FrontPage|work=pbworks.com))</ref> to build and distribute collaborative documentation. Georgia's collaborative digitization program, the Digital Library of Georgia,<ref>((cite web|url=http://dlg.galileo.usg.edu/|title=Welcome to the Digital Library of Georgia|work=usg.edu))</ref> presents a seamless virtual library on the state's history and life, including more than a hundred digital collections from 60 institutions and 100 agencies of government. The [[Digital Library of Georgia]] is a [[Georgia Library Learning Online|GALILEO]]<ref>((cite web|url=http://www.galileo.usg.edu|title=GALILEO|work=usg.edu))</ref> initiative based at the University of Georgia Libraries.

In the twentieth century, the [[Hill Museum and Manuscript Library]] photographed books in Ethiopia that were subsequently destroyed amidst political violence in 1975. The library has since worked to photograph manuscripts in Middle Eastern countries.<ref>((cite news|title=Codices decoded|publisher=The Economist|date=18 December 2010|page=151))</ref>

In South Asia, the Nanakshahi trust is digitizing manuscripts of [[Gurmukhī alphabet|Gurmukhī script]].

In Australia, there have been many collaborative projects between the [[National Library of Australia]] and universities to improve the repository infrastructure that digitized information would be stored in.<ref>Libraries in the twenty-first century: Charting new directions in information services. Edited by Stuart Ferguson, 2007, pg 84</ref> Some of these projects include, the ARROW (Australian Research Repositories Online to the World) project and the APSR (Australian Partnership for Sustainable Repository) project.

==Destructive scanning methods==
((Unreferenced|section|date=January 2016))
For book scanning on a low budget, the least expensive way to scan a book or magazine is to cut off the binding. This converts the book or magazine into a sheaf of separate sheets which can be loaded into a standard [[automatic document feeder]] (ADF) and scanned using inexpensive and common scanning technology. The method is not suitable for rare or valuable books. There are two technical difficulties with this process, first with the cutting and second with the scanning.

===Unbinding===
More precise and less destructive than cutting pages is to unbind by hand using suitable tools. This technique has been successfully employed for tens of thousands of pages of archival original paper scanned for the Riazanov Library digital archive project from newspapers and magazines and pamphlets, varying from 50 to 100 years old and more, and often composed of fragile, brittle paper. Although the monetary value for some collectors (and for most sellers of this sort of material) is destroyed by unbinding, it in many cases actually greatly assists preservation of the pages, making them more accessible to researchers<ref name="hurix" /> and less likely to be damaged when subsequently examined. A disadvantage is that unbound stacks of pages are "fluffed up", and therefore more exposed to oxygen in the air, which may in some cases speed deterioration. This can be addressed by putting weights on the pages after they are unbound, and storage in appropriate containers.<ref name="hurix" />

Hand unbinding will preserve text that runs into the gutters of bindings, and most critically allows more easy and complete high quality scans to be made of two-page-wide material, such as center cartoons, graphic art, and photos in magazines. The digital archive of ''The Liberator'' 1918-1924 on [[Marxists Internet Archive]] demonstrates the quality of two-page-wide graphic art scans made possible by careful hand unbinding, then scanning.

Unbinding techniques vary with the binding technology, from simply removing a few staples, to unbending and removing nails, to meticulously grinding down layers of glue on the spine of a book to precisely the right point, followed by laborious removal of the string used to hold the book together.

With some newspapers (such as ''Labor Action'' 1950-1952) there are columns on the center of facing pages that run across the pages. Chopping off part of the spine of a bound volume of such papers will lose part of this text. Even the Greenwood Reprint of this publication failed to preserve the text content of those center columns, cutting off significant amounts of text there. Only when bound volumes of the original newspaper were meticulously unbound, and the opened pairs of center pages were scanned as a single page on a flat bed scanner was the center column content made digitally available. Alternatively, one can present the two facing center pages as three scans: one of each individual page, and one of a page sized area situated over the center of the two pages.

===Cutting===
One way of cutting a stack of 500 to 1,000 pages in one pass is to use a guillotine [[paper cutter]], a large steel table with a paper [[vise]] that screws down onto the stack and firmly secures it before cutting.<ref name="kitaboo" /> A large sharpened steel blade which moves straight down cuts the entire length of each sheet in one operation. A lever on the blade permits several hundred pounds of force to be applied to the blade for a quick one-pass cut.

A clean cut through a thick stack of paper cannot be made with a traditional inexpensive sickle-shaped hinged [[paper cutter]]. These cutters are only intended for a few sheets, with up to ten sheets being the practical cutting limit. A large stack of paper applies torsional forces on the hinge, pulling the blade away from the cutting edge on the table. The cut becomes more inaccurate as the cut moves away from the hinge, and the force required to hold the blade against the cutting edge increases as the cut moves away from the hinge.

The guillotine cutting process dulls the blade over time, requiring that it be resharpened. [[Coated paper]] such as slick magazine paper dulls the blade more quickly than plain book paper, due to the [[kaolinite]] [[clay]] coating. Additionally, removing the binding of an entire hardcover book causes excessive wear due to cutting through the cover's stiff backing material. Instead the outer cover can be removed and only interior pages need be cut.

An alternate method of unbinding books is to use a table saw.  While this method is potentially dangerous and does not leave as smooth an edge as the guillotine paper cutter method, it is more readily available to the average person.  The ideal method is to clamp the book between two thick boards using heavy machine screws to provide the clamping force.  The entire wood and book package is fed through the table saw using the rip fence as a guide.  A sharp fine carbide tooth blade is ideal for generating an acceptable cut.  The quality of the cut depends on the blade, feed rate, type of paper, paper coating, and binding material.

===Scanning===
[[File:A Real Page-Turner.jpg|thumb|Turning the pages in between taking scans|left]]
Once the paper is liberated from the spine, it can be scanned one sheet at a time using a [[flatbed scanner]] or [[automatic document feeder]] (ADF).

Pages with a decorative riffled edging or curving in an arc due to a non-flat binding can be difficult to scan using an ADF, as they are designed to scan pages of uniform shape and size, and variably sized or shaped pages can lead to improper scanning. The riffled edges or curved edge can be guillotined off to render the outer edges flat and smooth before the binding is cut.

The coated paper of magazines and bound textbooks can make them difficult for the rollers in an ADF to pick up and guide along the paper path. An ADF which uses a series of rollers and channels to flip sheets over may jam or misfeed when fed coated paper. Generally there are fewer problems by using as straight of a paper path as is possible, with few bends and curves. The clay can also rub off the paper over time and coat sticky pickup rollers, causing them to loosely grip the paper. The ADF rollers may need periodic cleaning to prevent this slipping.

Magazines can pose a bulk-scanning challenge due to small nonuniform sheets of paper in the stack, such as magazine subscription cards and fold out pages. These need to be removed before the bulk scan begins, and are either scanned separately if they include worthwhile content, or are simply left out of the scan process.

==Non-destructive scanning==

[[File:MyBookScanner,June12,2011.JPG|thumb|An example of a DIY non-destructive book scanner/digitizer, with the book downwards design, allowing gravity to flatten pages]]
Software driven machines and robots have been developed to scan books without the need of unbinding them in order to preserve both the contents of the document and create a digital image archive of its current state. This recent trend has been due in part to ever improving imaging technologies that allow a high quality digital archive image to be captured with little or no damage to a rare or fragile book in a reasonably short period of time.

The first fully automated book scanner was the DL (Digitizing Line) scanner, manufactured by 4DigitalBooks in Switzerland. The first known installation was at Stanford University in 2001.<ref>((cite web|last=Davies|first=John|title=4DigitalBooks launches digital book scanner|url=http://www.printweek.com/print-week/news/1107446/4digitalbooks-launches-digital-book-scanner|publisher=PrintWeek))</ref><ref>((cite web|title=Stanford University Libraries (SUL) Robotic Book Scanner|url=https://web.stanford.edu/dept/SUL/library/prod//depts/diroff/DLStatement.html|publisher=Stanford University Libraries (SUL)))</ref> The scanner received a Dow Jones Runner-Up award under Business Applications Category in 2001.<ref>((cite web|title=Technology Innovation Awards: Winners 2001|url=http://www.dowjones.com/innovation/ei_winners_2001.html|publisher=Dow Jones|access-date=2017-08-07|archive-url=https://web.archive.org/web/20150923220139/http://www.dowjones.com/innovation/ei_winners_2001.html|archive-date=2015-09-23|url-status=dead))</ref>
[[File:ET Series Book Scanner.png|thumb|right|Non-destructive book scanner with Curve Flattening Technology]]

[[File:Robotický knižní scanner.webm|Video of the robotic book scanner DL mini|thumb|left]]

In 2007 the company [[TREVENTUS]] presented an automated book scanner with a book opening angle for scanning of 60°. Which is an improvement in the area of conservation of the books during scanning. The company was awarded with the European Union "ICT Grand Prize 2007",<ref>((Cite web|url=http://europa.eu/rapid/press-release_IP-07-339_en.htm|title=European Commission - PRESS RELEASES - Press release - British, Swedish and Austrian entrepreneurs win the EU's "Nobel prize" for ICT|website=europa.eu|access-date=2019-06-04))</ref><ref>((cite web|title=Treventus ICT Grand price 2007|url=http://www.treventus.com/company.html|publisher=Treventus))</ref> for its development of the ScanRobot. This technology was also used in a mass digitization project from the Bavarian State Library<ref>((cite web|title=Bavarian State Library VD16 project|url=http://www.treventus.com/downloads/customer_report_BSB_20091109.pdf|publisher=Treventus|access-date=2019-06-04|archive-url=https://web.archive.org/web/20160708160245/http://www.treventus.com/downloads/customer_report_BSB_20091109.pdf|archive-date=2016-07-08|url-status=dead))</ref> where 8,900 books from the 16th century were digitized in 18 months using three of these v-shape scanners.

[[File:ScanRobot.jpg|thumb|ScanRobot automated scanner with 60° opening angle]]

Indus International, Inc, based in [[West Salem, Wisconsin]], produces scanners which were bought by some US entities for services such as [[interlibrary loan]].<ref>((cite web|access-date=2020-05-21|title=Meet the Library's New Scanner|url=https://blogs.hope.edu/library/library-highlights/meet-the-librarys-new-scanner/|date=2012-09-06|author=Hope College))</ref>

Most high-end commercial robotic scanners use air and [[suction]] technology, while some use newer approaches such as bionic fingers for turning pages. Some scanners take advantage of [[Ultrasonic sensor|ultrasonic]] or [[photoelectric sensor]]s to detect dual pages and prevent skipping of pages.<ref name="hurix" /><ref name="kitaboo" /> With reports of machines being able to scan up to 2,900 pages per hour,<ref>((cite web|last=Rapp|first=David|title=Product Watch: Library Scanners|url=http://lj.libraryjournal.com/2011/06/industry-news/product-watch-library-scanners/#|publisher=Library Journal|access-date=11 May 2014))</ref> robotic book scanners are specifically designed for large-scale digitization projects.<ref name="hurix" />

Google's patent 7508978 shows an [[infrared]] camera technology which allows detection and automatic adjustment of the three-dimensional shape of the page.<ref>((cite patent |inventor1-last=Lefevere |inventor1-first=Francois-Marie |inventor2-last=Saric |inventor2-first=Marin |title=Detection of grooves in scanned images |issue-date=March 24, 2009 |fdate=September 13, 2004 |country-code=US |patent-number=7508978 |assign1=Google))</ref><ref>''[https://www.npr.org/blogs/library/2009/04/the_granting_of_patent_7508978.html The Secret Of Google's Book Scanning Machine Revealed]'', by Maureen Clements, April 30, 2009.</ref> Researchers from the University of Tokyo have an experimental non-destructive book scanner<ref>((cite web|last=Guizzo |first=Erico |url=https://spectrum.ieee.org/automaton/robotics/robotics-software/book-flipping-scanning |title="Superfast Scanner Lets You Digitize Book By Flipping Pages", IEEE Spectrum, March 17, 2010 |publisher=Spectrum.ieee.org |date=2010-03-17 |access-date=2014-08-08))</ref> that includes a 3D surface scanner to allow images of a curved page to be straightened in software. Thus the book or magazine can be scanned as quickly as the operator can flip through the pages, about 200 [[pages per minute]].

There are techniques to minimise and to correct for distortion in the page gutter.<ref>((Cite report|url=https://www.tinaja.com/glib/gutter01.pdf|title=Some Possible Book Scanning "Gutter Math"|last=Lancaster|first=Don|date=December 2009|publisher=Synergetics))</ref>

==See also==

*[[Digital library]]
*[[Institutional repository]]
*[[Optical character recognition]]
*[[Planetary scanner]]
*[[Europeana]]

==References==
((Reflist|30em))

==External links==
((commons category|Book scanners))
*[http://www.diybookscanner.org/ Do It Yourself book scanner device forum]
*[https://code.google.com/archive/p/linear-book-scanner/ Google Open Source Linear Book Scanner]
*[https://www.youtube.com/watch?v=RdLcrNeWjIs Stanford University video] shows some book scanning
*[http://www.k2.t.u-tokyo.ac.jp/vision/BFS-Auto/ University of Tokyo] high speed scanner

((Books))

[[Category:Book terminology]]
[[Category:Digital libraries]]
[[Category:Publishing]]

Pages transcluded onto the current version of this page (help):

Book scanning (edit)
Template:According to whom (view source) (template editor protected)
Template:Ambox (view source) (template editor protected)
Template:Books (view source) (semi-protected)
Template:Category handler (view source) (protected)
Template:Citation/make link (view source) (protected)
Template:Citation/styles.css (view source) (template editor protected)
Template:Citation needed (view source) (protected)
Template:Cite journal (view source) (protected)
Template:Cite news (view source) (protected)
Template:Cite patent (view source) (template editor protected)
Template:Cite patent/authors (view source) (extended confirmed protected)
Template:Cite patent/core (view source) (template editor protected)
Template:Cite report (view source) (template editor protected)
Template:Cite web (view source) (protected)
Template:Commons category (view source) (template editor protected)
Template:Delink (view source) (protected)
Template:Find sources mainspace (view source) (template editor protected)
Template:Fix (view source) (protected)
Template:Fix-span (view source) (template editor protected)
Template:Fix/category (view source) (protected)
Template:Hlist/styles.css (view source) (protected)
Template:Icon (view source) (template editor protected)
Template:Main other (view source) (protected)
Template:More citations needed (view source) (template editor protected)
Template:More citations needed section (view source) (template editor protected)
Template:Multiple issues (view source) (template editor protected)
Template:Multiple issues/styles.css (view source) (template editor protected)
Template:Navbox (view source) (template editor protected)
Template:Nowrap (view source) (protected)
Template:Original research (view source) (template editor protected)
Template:Pagetype (view source) (protected)
Template:Plainlist/styles.css (view source) (protected)
Template:Reflist (view source) (protected)
Template:Reflist/styles.css (view source) (protected)
Template:Replace (view source) (protected)
Template:SDcat (view source) (protected)
Template:Short description (view source) (protected)
Template:Short description/lowercasecheck (view source) (protected)
Template:Side box (view source) (template editor protected)
Template:Sister project (view source) (template editor protected)
Template:Sp (view source) (template editor protected)
Template:Unreferenced (view source) (template editor protected)
Module:Arguments (view source) (protected)
Module:Category handler (view source) (protected)
Module:Category handler/blacklist (view source) (protected)
Module:Category handler/config (view source) (protected)
Module:Category handler/data (view source) (protected)
Module:Category handler/shared (view source) (protected)
Module:Check for unknown parameters (view source) (protected)
Module:Citation/CS1 (view source) (protected)
Module:Citation/CS1/COinS (view source) (protected)
Module:Citation/CS1/Configuration (view source) (protected)
Module:Citation/CS1/Date validation (view source) (protected)
Module:Citation/CS1/Identifiers (view source) (protected)
Module:Citation/CS1/Utilities (view source) (protected)
Module:Citation/CS1/Whitelist (view source) (protected)
Module:Citation/CS1/styles.css (view source) (protected)
Module:DecodeEncode (view source) (template editor protected)
Module:Delink (view source) (protected)
Module:Disambiguation/templates (view source) (protected)
Module:Find sources (view source) (template editor protected)
Module:Find sources/config (view source) (template editor protected)
Module:Find sources/links (view source) (template editor protected)
Module:Find sources/templates/Find sources mainspace (view source) (template editor protected)
Module:Icon (view source) (template editor protected)
Module:Icon/data (view source) (template editor protected)
Module:Message box (view source) (protected)
Module:Message box/ambox.css (view source) (protected)
Module:Message box/configuration (view source) (protected)
Module:Namespace detect/config (view source) (protected)
Module:Namespace detect/data (view source) (protected)
Module:Navbar (view source) (protected)
Module:Navbar/configuration (view source) (protected)
Module:Navbar/styles.css (view source) (protected)
Module:Navbox (view source) (template editor protected)
Module:Navbox/configuration (view source) (template editor protected)
Module:Navbox/styles.css (view source) (template editor protected)
Module:Pagetype (view source) (protected)
Module:Pagetype/config (view source) (protected)
Module:Pagetype/disambiguation (view source) (protected)
Module:Pagetype/rfd (view source) (template editor protected)
Module:Pagetype/setindex (view source) (protected)
Module:Pagetype/softredirect (view source) (protected)
Module:SDcat (view source) (protected)
Module:Side box (view source) (template editor protected)
Module:Side box/styles.css (view source) (template editor protected)
Module:String (view source) (protected)
Module:Unsubst (view source) (protected)
Module:WikidataIB (view source) (template editor protected)
Module:WikidataIB/nolinks (view source) (template editor protected)
Module:WikidataIB/titleformats (view source) (template editor protected)
Module:Wikitext Parsing (view source) (protected)
Module:Yesno (view source) (protected)

Return to Book scanning.

Retrieved from "https://en.wikipedia.org/wiki/Book_scanning"