##Adobe File Version: 1.000
#=======================================================================
# FTP file name: README.TXT
#
# Contents: Background information on Unicode mapping tables
# for Mac OS text encodings
#
# Copyright: (c) 1995-1999 by Apple Computer, Inc., all rights
# reserved.
#
# Contact: [email protected]
#
# Changes:
#
# b02 1999-Sep-22 Update information on Cyrillic. Update
# contact e-mail address.
# n07 1998-Feb-05 Rewrite to provide additional information
# relevant to using the accompanying mapping
# tables, and to delete some extraneous
# information. Delete Bulgarian (no special
# encoding, uses standard Cyrillic), add
# Farsi, Devanagari, Gurmukhi, Gujarati,
# Celtic, Gaelic, Inuit, Tibetan.
# n04 1995-Nov-15 Update info for Hebrew and Thai
# n03 1995-Apr-15 First version (after fixing some typos).
#
##################
0. Preliminaries
----------------
For maximum interchangeability, this file and the accompanying Mac OS
mapping tables use only ASCII characters. They are intended to be
displayed in a monospaced font.
Apple, the Apple logo, Mac, and Macintosh are trademarks of Apple
Computer, Inc., registered in the United States and other countries.
QuickDraw and TrueType are trademarks of Apple Computer, Inc. Unicode is
a trademark of Unicode Inc. PostScript is a trademark of Adobe Systems
Inc., which may be registered in certain jurisdictions. IBM is a
registered trademark of International Business Machines Corporation. ITC
Zapf Dingbats is a registered trademark of the International Typeface
Corporation. For the sake of brevity, throughout this document and the
accompanying tables, "Macintosh" can be used to refer to Macintosh
computers and "Unicode" can be used to refer to the Unicode standard.
Apple Computer, Inc. ("Apple") makes no warranty or representation,
either express or implied, with respect to this document and the
accompanying tables, their quality, accuracy, or fitness for a
particular purpose. In no event will Apple be liable for direct,
indirect, special, incidental, or consequential damages resulting from
any defect or inaccuracy in this document or the accompanying tables.
1. Introduction
---------------
This document summarizes some Unicode mapping considerations that are
relevant for the accompanying mapping tables. It also provides an
overview of Mac OS encodings.
These mapping tables and character lists are subject to change.
The latest tables should be available from the following:
<ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/APPLE/>
<ftp://dev.apple.com/devworld/Technical_Documentation/Misc._Standards/>
2. Round-trip fidelity and overview of mapping techniques
---------------------------------------------------------
For a particular set of national and international standards, Unicode
provides round-trip fidelity: Text in one of those encodings can be
mapped to Unicode and back again, yielding the original characters.
Characters which are distinct in one of these source standards have
a distinct counterpart in Unicode. Note that this counterpart might not
be a single Unicode character; as is pointed out in "The Unicode
Standard, Version 2.0" (page 2-10), "sometimes a single code value in
another standard corresponds to a sequence of code values in the Unicode
Standard, or vice versa."
However, Unicode does not attempt to provide round-trip fidelity for
most vendor standards. Nevertheless, Apple and other platform vendors
may need to provide such round-trip fidelity for their current encodings
(this can be important in file systems, for example). In order to do
this, Apple makes use of some Unicode characters in the corporate-use
zone (the upper end of the private use area).
Corporate-zone characters must be used with care. Indiscriminate use of
such characters can result in text which is not easily interchanged with
other systems, since these characters have no standard meaning outside a
particular platform. The mappings provided here are intended to minimize
the use of private use characters, or to use them in such a way that
basic text content will not be lost if the corporate zone characters are
dropped when text is transferred to another system.
The tables provided here have three goals, in the following order of
importance:
1. Provide 100% round-trip mapping from a Mac OS encoding to Unicode
and back (even if the mappings here are converted to maximal
decompositions, see below).
2. Map characters in a Mac OS encoding into the Unicode characters
that best represent the interpretation and usage of the Mac OS
characters.
3. When mapping text in a Mac OS encoding to Unicode using the tables,
the resulting Unicode text should be as interchangeable as possible.
To satisfy these goals, the mappings use a variety of techniques. First
we attempt to achieve round-trip mappings using any standard Unicode
feature at our disposal, without resorting to corporate-zone characters.
This can includes the following techniques:
- Use of all Unicode characters defined in Unicode 2.1, including
compatibility characters.
- Mapping a single character in a Mac OS encoding to a sequence of
standard Unicode characters, or vice versa. This requires grouping
characters into appropriate chunks for lookup before mapping them
(this mainly applies to sequences of Unicode characters).
- Using Unicode direction overrides to force direction attributes when
mapping to Unicode. This requires resolution of Unicode character
direction, and use of this information, when mapping from Unicode back
to certain Mac OS encodings.
The requirements imposed on Unicode handling are necessary for other,
non-transcoding operations in a full Unicode implementation anyway, so
requiring them for transcoding should not impose much of a burden.
Next, if round-trip fidelity cannot be achieved using the above
techniques, we attempt to use corporate-zone characters only as
"transcoding hints" (more on this below). These are combined with one or
more standard Unicode characters to mark them as special for
transcoding, but have no other function and can be deleted with no loss
of basic text content (only of round-trip fidelity).
Finally, if a character in a Mac OS encoding is unrelated to any Unicode
or Unicode sequence, we may map it to a single corporate-zone Unicode
code point.
These techniques are described in more detail in the following sections.
Some clients of these tables may have a different set of goals. For
example, some clients may prefer to avoid compatibility characters,
perhaps sacrificing round-trip fidelity if necessary. In most cases it
is fairly easy to construct other types of mappings from the mappings
given here. In particular, the mappings here have been designed so that
if they are converted to maximal decomposition mappings (by recursive
application of the canonical decompositions in the Unicode database),
the resulting mappings will still provide 100% roundtrip fidelity.
There is one more round-trip issue that should be mentioned. If a
Unicode character or sequence can be mapped at all into a particular
Mac encoding, then the reverse mapping back to Unicode should yield
the original Unicode character or sequence (except for possible
differences in direction overrides or other Unicode characters in the
"Other, Format" category). The tables here also provide this. For a
related issue, see the next section.
3. Mapping tolerance: Strict and loose
--------------------------------------
In many character sets, a single character may have
AbbyyFR15直装版【内含安装和使用教程】

Abbyy FineReader 15是一款强大的光学字符识别(OCR)软件,专为专业人士设计,用于将扫描的文档、图像和PDF转换成可编辑、可搜索的文本。它结合了先进的图像处理和文本识别技术,使得从纸质文档或数字图像中提取文字变得极其便捷。在“AbbyyFR15直装版【内含安装和使用教程】”这个压缩包中,用户可以找到完整的安装程序以及教程,方便快捷地开始使用这款软件。
让我们来了解一下Abbyy FineReader 15的主要功能:
1. OCR识别:该软件的核心功能是其高精度的OCR技术,能够准确识别多种语言的文字,包括中文、英文、日文、韩文等。无论是扫描的文档还是JPEG、PNG等图像格式,都可以被转换为可编辑的Word、Excel、PowerPoint或纯文本文件。
2. PDF编辑:除了识别文字,Abbyy FineReader 15还允许用户直接在PDF文件上进行编辑。你可以添加、删除文本,更改字体和字号,甚至插入图片。对于混合内容的PDF,如包含文本和图像的文件,也可以轻松编辑。
3. PDF转换:用户可以将扫描的PDF或图像PDF转换为可编辑的Word文档,或者选择更专业的布局保持转换,以保留原始文档的版面样式。此外,还可以转换为Excel、HTML、EPUB、eBook等多种格式。
4. 自动化工作流程:对于大量文档处理的需求,Abbyy FineReader 15提供了自动化的工作流程功能。通过设置预定义的任务,可以批量处理文件,大大提高了效率。
5. 文档比较:该软件还可以比较两份文档之间的差异,无论是文本文件还是PDF,都能快速找出并标记出不同之处。
6. 数据提取:在表格或发票等结构化文档中,可以自动识别并提取关键数据,方便导入到数据库或其他系统中。
7. 多语言支持:支持超过200种语言的识别,涵盖了全球大部分地区。
安装教程通常会指导用户如何正确下载、安装和激活Abbyy FineReader 15,确保软件的正常运行。使用教程则会详细解释各项功能的操作步骤,帮助用户快速上手。在学习使用过程中,用户应仔细阅读这些教程,以便充分利用软件的所有功能。
Abbyy FineReader 15是一款功能强大的OCR工具,无论是个人还是企业用户,都能通过它提高工作效率,简化文档处理流程。通过这个直装版,用户可以避免复杂的注册过程,直接体验到软件的强大性能。只需按照压缩包中的安装教程操作,就能轻松开启OCR之旅。

Gisbro
- 粉丝: 372
最新资源
- 电气自动化技术专业教学团队推荐表.doc
- 2023年公共关系学网络终考题库2.doc
- 移动通信技术的发展.doc
- 计算机网络技术专业培养计划.doc
- 商业计划书(上海润金软件有限公司交易助理项目).doc
- 医学统计学第十六章--Logistic回归分析.ppt
- 基于PLC的自动摆饼机控制系统的设计及实现(顾小强).ppt
- 粤教版网络技术应用教材与教学研讨市公开课一等奖百校联赛特等奖课件.pptx
- 互联网金融个体网络借贷资金存管业务规范.docx
- 解读云计算与云数据存储发展趋势技术研究.doc
- 综合布线建设方案.doc
- 基于C51单片机的数字时钟课程设计C语言,带闹钟.doc
- 谭浩强C语言第13章.ppt
- 大学生网络利用调查报告.doc
- 2023年学员做试卷中小学教师融合教育知识网络竞赛.docx
- 互联网项目商业计划书模板.doc