PDF Reference, Second Edition

PDF Reference

second edition

Adobe Portable Document Format

Version 1.3

Adobe Systems Incorporated

ADDISON–WESLEY

Boston • San Francisco • New York • Toronto • Montreal

London • Munich • Paris • Madrid

Capetown • Sydney • Tokyo • Singapore • Mexico City

Library of Congress Cataloging-in-Publication Data

Adobe portable document format, version 1.3 / Adobe Systems Incorporated. — 2nd ed.

p. cm.

Includes bibliographical references and index.

ISBN 0-201-61588-6

1. Text processing (Computer science). 2. Adobe Acrobat. 3. Portable document

software. I. Adobe Systems.

QA76.76.T49 A36 2000

005.7

′

2—dc21

00-040581

NOTICE: All information contained herein is the property of Adobe Systems Incorporated.

No part of this publication (whether in hardcopy or electronic form) may be reproduced or

transmitted, in any form or by any means, electronic, mechanical, photocopying, recording,

or otherwise, without the prior written consent of the publisher.

PostScript is a registered trademark of Adobe Systems Incorporated. All instances of the name

PostScript in the text are references to the PostScript language as deﬁned by Adobe Systems

Incorporated unless otherwise stated. The name PostScript also is used as a product trademark

for Adobe Systems’ implementation of the PostScript language interpreter. Except as other-

wise stated, any mention of a “PostScript output device,” “PostScript printer,” “PostScript soft-

ware,” or similar item refers to a product that contains PostScript technology created or

licensed by Adobe Systems Incorporated, not to one that purports to be merely compatible.

Adobe, the Adobe logo, Acrobat, the Acrobat logo, Adobe Garamond, Aldus, Distiller,

Extreme, FrameMaker, Illustrator, InDesign, Minion, Myriad, PageMaker, Photoshop,

Poetica, and PostScript are trademarks of Adobe Systems Incorporated. Apple, Mac, Macin-

tosh, QuickDraw, and TrueType are trademarks of Apple Computer, Inc., registered in the

United States and other countries. ITC Zapf Dingbats is a registered trademark of Interna-

tional Typeface Corporation. Helvetica and Times are registered trademarks of Linotype-Hell

AG and/or its subsidiaries. Microsoft and Windows are either registered trademarks or trade-

marks of Microsoft Corporation in the United States and/or other countries. Times New

Roman is a trademark of The Monotype Corporation registered in the U.S. Patent and Trade-

mark Ofﬁce and may be registered in certain other jurisdictions. Ryumin Light is a trademark

of Morisawa & Co., Ltd. UNIX is a registered trademark of The Open Group. PANTONE is a

registered trademark and Hexachrome is a trademark of Pantone, Inc. QuarkXPress is a trade-

mark of Quark, Inc. and/or certain of the Quark Afﬁliated Companies, Reg. U.S. Pat. & Tm.

Off. and in many other countries. Unicode is a registered trademark of Unicode, Inc. All other

trademarks are the property of their respective owners.

This publication and the information herein are furnished AS IS, are subject to change with-

out notice, and should not be construed as a commitment by Adobe Systems Incorporated.

Adobe Systems Incorporated assumes no responsibility or liability for any errors or inaccura-

cies, makes no warranty of any kind (express, implied, or statutory) with respect to this pub-

lication, and expressly disclaims any and all warranties of merchantability, ﬁtness for

particular purposes, and noninfringement of third-party rights.

1 2 3 4 5 6 7 8 9-MA-0403020100

First printing, July 2000

iii

Contents

Preface

Chapter 1: Introduction

1.1 About This Book 1

1.2 Introduction to PDF 1.3 Features 3

1.3 Related Publications 4

1.4 Copyright Permission 5

Chapter 2: Overview

2.1 Imaging Model 8

2.2 Other General Properties 12

2.3 Using PDF 17

2.4 PDF and the PostScript Language 19

Chapter 3: Syntax

3.1 Lexical Conventions 22

3.2 Objects 25

3.3 Details of Filtered Streams 41

3.4 File Structure 55

3.5 Encryption 64

3.6 Document Structure 71

3.7 Content Streams and Resources 82

3.8 Common Data Structures 86

3.9 Functions 95

3.10 File Specifications 107

Chapter 4: Graphics

119

4.1 Graphics Objects 120

4.2 Coordinate Systems 124

4.3 Graphics State 134

4.4 Path Construction and Painting 147

4.5 Color Spaces 157

4.6 Patterns 200

4.7 External Objects 243

4.8 Images 244

4.9 Form XObjects 263

4.10 PostScript XObjects 267

Contents

Chapter 5: Fonts

269

5.1 Organization and Use of Fonts 269

5.2 Text State Parameters and Operators 278

5.3 Text Objects 285

5.4 Introduction to Font Data Structures 291

5.5 Simple Fonts 293

5.6 Composite Fonts 310

5.7 Font Descriptors 330

5.8 Embedded Font Programs 339

5.9 ToUnicode CMaps 342

Chapter 6: Rendering

347

6.1 CIE-Based Color to Device Color 348

6.2 Conversions among Device Color Spaces 350

6.3 Transfer Functions 354

6.4 Halftones 356

6.5 Scan Conversion Details 377

Chapter 7: Interactive Features

383

7.1 Viewer Preferences 383

7.2 Document-Level Navigation 384

7.3 Page-Level Navigation 391

7.4 Annotations 398

7.5 Actions 420

7.6 Interactive Forms 434

7.7 Sounds 468

7.8 Movies 470

Chapter 8: Document Interchange

473

8.1 Procedure Sets 473

8.2 Document Information Dictionary 474

8.3 File Identifiers 476

8.4 Application Data 477

8.5 Web Capture 507

8.6 Prepress Support 524

Appendix A: Operator Summary

539

Appendix B: Operators in Type 4 Functions

543

B.1 Arithmetic Operators 543

B.2 Relational, Boolean, and Bitwise Operators 544

B.3 Conditional Operators 544

B.4 Stack Operators 544

Contents

Appendix C: Implementation Limits

545

C.1 General Implementation Limits 546

C.2 Implementation Limits Affecting Web Capture 547

Appendix D: Character Sets and Encodings

549

D.1 Latin Character Set and Encodings 551

D.2 Expert Set and MacExpertEncoding 555

D.3 Symbol Set and Encoding 558

D.4 ZapfDingbats Set and Encoding 561

Appendix E: PDF Name Registry

563

Appendix F: Linearized PDF

565

F.1 Background and Assumptions 567

F.2 Linearized PDF Document Structure 569

F.3 Hint Tables 581

F.4 Access Strategies 591

Appendix G: Example PDF Files

597

G.1 Minimal PDF File 597

G.2 Simple Text String Example 600

G.3 Simple Graphics Example 602

G.4 Page Tree Example 605

G.5 Outline Tree Example 610

G.6 Updating Example 614

Appendix H: Compatibility and Implementation Notes

623

H.1 PDF Version Numbers 624

H.2 Dictionary Keys 625

H.3 Implementation Notes 625

Bibliography

643

Index

649

vii

Figures

2.1

Creating PDF files using PDF Writer 18

2.2

Creating PDF files using Acrobat Distiller 19

3.1

PDF components 22

3.2

Initial structure of a PDF file 55

3.3

Structure of an updated PDF file 63

3.4

Structure of a PDF document 72

3.5

Inheritance of attributes 81

3.6

Mapping with the

Decode

array 101

4.1

Graphics objects 122

4.2

Device space 125

4.3

User space 127

4.4

Relationships among coordinate systems 129

4.5

Effects of coordinate transformations 130

4.6

Effect of transformation order 131

4.7

Miter length 141

4.8

Cubic Bézier curve generated by the

operator 151

4.9

Cubic Bézier curves generated by the

and

operators 151

4.10

Nonzero winding number rule 155

4.11

Even-odd rule 156

4.12

Color specification 158

4.13

Color rendering 159

4.14

Component transformations in a CIE-based

ABC

color space 166

4.15

Component transformations in a CIE-based

color space 167

4.16 Quadtone image using

Indexed

DeviceN

193

4.17

Output from Example 4.19 208

4.18

Output from Example 4.20 212

4.19

Radial shading 224

4.20

Starting a new triangle in a free-form Gouraud-shaded triangle mesh 227

4.21

Connecting triangles in a free-form Gouraud-shaded triangle mesh 228

4.22

Varying the value of the edge flag to create different shapes 229

4.23

Lattice-form triangular meshes 230

4.24

Coordinate mapping from a unit square to a four-sided Coons patch 233

4.25

Painted area and boundary of a Coons patch 234

4.26

Color values and edge flags in Coons patch meshes 236

4.27

Edge connections in a Coons patch mesh 237

Contents

viii

4.28

Control points in a tensor-product mesh 239

4.29

Typical sampled image 244

4.30

Source image coordinate system 247

4.31

Mapping the source image 248

5.1

Glyphs painted in 50% gray 273

5.2

Glyph outlines treated as a stroked path 274

5.3

Graphics clipped by a glyph path 275

5.4

Glyph metrics 276

5.5

Metrics for horizontal and vertical writing modes 278

5.6

Character spacing in horizontal writing 281

5.7

Word spacing in horizontal writing 281

5.8

Horizontal scaling 282

5.9

Leading 282

5.10

Text rise 285

5.11

Operation of

operator in horizontal writing 289

5.12

Output from Example 5.9 304

5.13

Characteristics represented in the

Flags

entry of a font descriptor 333

6.1

Various halftoning effects 363

6.2

Halftone cell with a nonzero angle 369

6.3

Angled halftone cell divided into two squares 370

6.4

Halftone cell and two squares tiled across device space 370

6.5

Tiling of device space in a type 16 halftone 372

6.6

Flatness tolerance 378

6.7

Rasterization without stroke adjustment 381

7.1

Presentation timing 397

7.2

Open annotation 399

7.3

Coordinate adjustment with the NoRotate flag 404

7.4

Square and circle annotations 413

7.5

QuadPoints

specification 414

7.6

FDF file structure 461

8.1

Simple Web Capture file structure 510

8.2

Complex Web Capture file structure 511

8.3

Page boundaries 526

8.4

Trapping example 529

G.1

Visual representation of Example G.3 603

G.2

Page tree for 62-page document 605

G.3

Document outline as displayed in Example G.5 610

G.4

Document outline as displayed in Example G.6 612

Tables

3.1

White-space characters 24

3.2

Escape sequences in literal strings 28

3.3

Examples of literal names using the # character 31

3.4

Entries common to all stream dictionaries 35

3.5

Standard filters 38

3.6

Typical LZW encoding sequence 45

3.7

Optional parameters for

LZWDecode

and

FlateDecode

filters 47

3.8

Predictor values 48

3.9

Optional parameters for the CCITTFaxDecode filter 51

3.10 Optional parameter for the DCTDecode filter 53

3.11 Entries in the trailer dictionary 61

3.12 Entries common to all encryption dictionaries 64

3.13 Additional encryption dictionary entries for the standard security

handler 68

3.14 User password access privileges 68

3.15 Entries in the catalog dictionary 73

3.16 Required entries in a page tree node 76

3.17 Entries in a page object 77

3.18 Entries in the name dictionary 81

3.19 Compatibility operators 84

3.20 Entries in a resource dictionary 85

3.21 PDF data types 87

3.22 Entries in a name tree node dictionary 91

3.23 Example of a name tree 92

3.24 Entries in a number tree node dictionary 95

3.25 Entries common to all function dictionaries 97

3.26 Additional entries specific to a type 0 function dictionary 98

3.27 Additional entries specific to a type 2 function dictionary 102

3.28 Additional entries specific to a type 3 function dictionary 103

3.29 Operators in type 4 functions 105

3.30 Examples of file specifications 110

3.31 Entries in a file specification dictionary 111

3.32 Additional entries in an embedded file stream dictionary 113

3.33 Entries in an embedded file parameter dictionary 113

3.34 Entries in a Macintosh-specific file information dictionary 114

Contents

4.1 Operator categories 123

4.2 Device-independent parameters of the graphics state 135

4.3 Device-dependent parameters of the graphics state 136

4.4 Line cap styles 139

4.5 Line join styles 140

4.6 Examples of line dash patterns 142

4.7 Graphics state operators 142

4.8 Entries in a graphics state parameter dictionary 144

4.9 Path construction operators 149

4.10 Path-painting operators 152

4.11 Clipping path operators 156

4.12 Color space families 161

4.13 Entries in a CalGray color space dictionary 168

4.14 Entries in a CalRGB color space dictionary 169

4.15 Entries in a Lab color space dictionary 172

4.16 Entries in an ICC profile stream dictionary 174

4.17 ICC profile types 175

4.18 Ranges for typical ICC color spaces 175

4.19 Rendering intents 180

4.20 Entry in a DeviceN color space attributes dictionary 189

4.21 Color operators 198

4.22 Entries in a type 1 pattern dictionary 203

4.23 Entries in a type 2 pattern dictionary 213

4.24 Shading operator 214

4.25 Entries common to all shading dictionaries 216

4.26 Additional entries specific to a type 1 shading dictionary 219

4.27 Additional entries specific to a type 2 shading dictionary 220

4.28 Additional entries specific to a type 3 shading dictionary 222

4.29 Additional entries specific to a type 4 shading dictionary 226

4.30 Additional entries specific to a type 5 shading dictionary 231

4.31 Additional entries specific to a type 6 shading dictionary 235

4.32 Data values in a Coons patch mesh 238

4.33 Data values in a tensor-product patch mesh 242

4.34 XObject operator 243

4.35 Entries in an image dictionary 249

4.36 Default Decode arrays 254

4.37 Entries in an alternate image dictionary 255

4.38 In-line image operators 260

4.39 Entries in an in-line image object 261

4.40 Additional abbreviations in an in-line image object 261

4.41 Entries in a type 1 form dictionary 264

4.42 Entries in a PostScript XObject dictionary 267

Contents

5.1 Text state parameters 279

5.2 Text state operators 280

5.3 Text rendering modes 284

5.4 Text object operators 286

5.5 Text-positioning operators 287

5.6 Text-showing operators 289

5.7 Font types 292

5.8 Entries in a Type 1 font dictionary 294

5.9 Entries in a Type 3 font dictionary 300

5.10 Type 3 font operators 303

5.11 Entries in an encoding dictionary 307

5.12 Entries in a CIDSystemInfo dictionary 314

5.13 Entries in a CIDFont dictionary 314

5.14 Predefined CJK CMap names 320

5.15 Entries in a CMap dictionary 322

5.16 Entries in a Type 0 font dictionary 327

5.17 Entries common to all font descriptors 330

5.18 Font flags 332

5.19 Additional font descriptor entries for CIDFonts 335

5.20 Character classes in CJK fonts 337

5.21 Embedded font organization for various font types 339

5.22 Additional entries in a FontFile stream dictionary 340

6.1 Predefined spot functions 359

6.2 PDF halftone types 365

6.3 Entries in a type 1 halftone dictionary 366

6.4 Entries in a stream dictionary for a type 6 halftone 368

6.5 Entries in a stream dictionary for a type 10 halftone 371

6.6 Entries in a stream dictionary for a type 16 halftone 373

6.7 Entries in a type 5 halftone dictionary 374

7.1 Entries in a viewer preferences dictionary 383

7.2 Destination syntax 386

7.3 Entries in an outline dictionary 388

7.4 Entries in an outline item dictionary 389

7.5 Entries in a page label dictionary 393

7.6 Entries in a thread dictionary 394

7.7 Entries in a bead dictionary 394

7.8 Entries in a transition dictionary 396

7.9 Entries common to all annotation dictionaries 400

7.10 Annotation flags 402

7.11 Entries in a border style dictionary 405

7.12 Entries in an appearance dictionary 406

Contents

xii

7.13 Annotation types 408

7.14 Additional entries specific to a text annotation 409

7.15 Additional entries specific to a link annotation 410

7.16 Additional entries specific to a free text annotation 411

7.17 Additional entries specific to a line annotation 412

7.18 Additional entries specific to a square or circle annotation 413

7.19 Additional entries specific to markup annotations 414

7.20 Additional entries specific to a rubber stamp annotation 415

7.21 Additional entries specific to an ink annotation 416

7.22 Additional entries specific to a pop-up annotation 416

7.23 Additional entries specific to a file attachment annotation 417

7.24 Additional entries specific to a sound annotation 418

7.25 Additional entries specific to a movie annotation 418

7.26 Additional entries specific to a widget annotation 419

7.27 Entries common to all action dictionaries 421

7.28 Entries in an additional-actions dictionary 422

7.29 Action types 424

7.30 Additional entries specific to a go-to action 425

7.31 Additional entries specific to a remote go-to action 426

7.32 Additional entries specific to a launch action 426

7.33 Windows-specific launch parameters 427

7.34 Additional entries specific to a thread action 428

7.35 Additional entries specific to a URI action 429

7.36 Entry in a URI dictionary 430

7.37 Additional entries specific to a sound action 430

7.38 Additional entries specific to a movie action 431

7.39 Additional entries specific to a hide action 432

7.40 Named actions 433

7.41 Additional entries specific to named actions 433

7.42 Entries in the interactive form dictionary 435

7.43 Signature flags 436

7.44 Entries common to all field dictionaries 437

7.45 Field flags common to all field types 438

7.46 Additional entries common to all fields containing variable text 439

7.47 Entries in an appearance characteristics dictionary 442

7.48 Field flags specific to button fields 444

7.49 Field flags specific to text fields 448

7.50 Additional entry specific to a text field 448

7.51 Field flags specific to choice fields 450

7.52 Additional entries specific to a choice field 450

7.53 Entries in a signature dictionary 452

7.54 Additional entries specific to a submit-form action 454

7.55 Flags for submit-form actions 455

Contents

xiii

7.56 Additional entries specific to a reset-form action 457

7.57 Flag for reset-form actions 458

7.58 Additional entries specific to an import-data action 458

7.59 Additional entries specific to a JavaScript action 459

7.60 Entry in an FDF trailer dictionary 463

7.61 Entry in an FDF catalog dictionary 463

7.62 Entries in an FDF dictionary 463

7.63 Entries in an FDF field dictionary 464

7.64 Entries in an icon fit dictionary 466

7.65 Entries in an FDF page dictionary 467

7.66 Entries in an FDF template dictionary 467

7.67 Entries in an FDF named page reference dictionary 468

7.68 Additional entry for annotation dictionaries in an FDF file 468

7.69 Additional entries specific to a sound object 469

7.70 Entries in a movie dictionary 471

7.71 Entries in a movie activation dictionary 471

8.1 Predefined procedure sets 474

8.2 Entries in a document information dictionary 475

8.3 Entries in a page-piece dictionary 478

8.4 Entries in an application data dictionary 478

8.5 Marked-content operators 480

8.6 Entries in the structure tree root 486

8.7 Entries in a structure element dictionary 487

8.8 Entries in a marked-content reference dictionary 490

8.9 Entries in an object reference dictionary 494

8.10 Additional dictionary entry for structure element access 497

8.11 Entry common to all attribute objects 500

8.12 Entries in a Web Capture information dictionary 508

8.13 Entries common to all content sets 515

8.14 Additional entries specific to a page set 516

8.15 Additional entries specific to an image set 517

8.16 Entries in a source information dictionary 518

8.17 Entries in a URL alias dictionary 519

8.18 Entries in a command dictionary 520

8.19 Web Capture command flags 521

8.20 Entries in a command settings dictionary 522

8.21 Entries in a separation dictionary 528

8.22 Additional entries specific to a trap network annotation 531

8.23 Additional entries specific to a trap network appearance stream 532

8.24 Entry in an OPI version dictionary 534

8.25 Entries in a version 1.3 OPI dictionary 534

8.26 Entries in a version 2.0 OPI dictionary 537

Contents

xiv

A.1 PDF content stream operators 539

C.1 Architectural limits 546

D.1 Latin-text encodings 550

F. 1 Linearization parameters 572

F. 2 Standard hint tables 576

F. 3 Page offset hint table, header section 583

F. 4 Page offset hint table, per-page entry 584

F. 5 Shared objects hint table, header section 586

F. 6 Shared objects hint table, shared object group entry 587

F. 7 Thumbnails hint table, header section 588

F. 8 Thumbnails hint table, per-page entry 589

F. 9 Generic hint table 590

F.10 Interactive form or structure hint table 590

G.1 Objects in minimal example 598

G.2 Objects in simple text string example 600

G.3 Objects in simple graphics example 602

G.4 Object use after adding four text annotations 615

G.5 Object use after deleting two text annotations 618

G.6 Object use after adding three text annotations 620

H.1 Abbreviations for standard filter names 627

H.2 Acrobat behavior with unknown filters 627

Preface

THE ORIGINS OF THE Portable Document Format and the Adobe

Acrobat

product family date to early 1990. At that time, the PostScript

page description

language was rapidly becoming the worldwide standard for the production of the

printed page. PDF builds on the PostScript page description language by layering

a document structure and interactive navigation features on PostScript’s under-

lying imaging model, providing a convenient, efﬁcient mechanism enabling doc-

uments to be reliably viewed and printed anywhere.

The PDF speciﬁcation was ﬁrst published at the same time the ﬁrst Acrobat prod-

ucts were introduced in 1993. Since then, updated versions of the speciﬁcation

have been and continue to be available from Adobe via the World Wide Web. This

book is the ﬁrst version of the speciﬁcation that is completely self-contained,

including the precise documentation of the underlying imaging model from

PostScript along with the PDF-speciﬁc features that are combined in version 1.3

of the PDF standard.

Over the past seven years, aided by the explosive growth of the Internet, PDF has

become the de facto standard for the electronic exchange of documents. Well over

100 million copies of the Acrobat Reader application have been distributed

around the world, facilitating efﬁcient electronic access to and sharing of infor-

mation. In addition, PDF is now the industry standard for the intermediate rep-

resentation of printed material in electronic prepress systems for conventional

printing applications. As major corporations, government agencies, and educa-

tional institutions streamline their operations by replacing paper-based workﬂow

with electronic exchange of information, the impact and opportunity for the ap-

plication of PDF will continue to grow at a rapid pace.

Adobe offers a collection of PDF-based applications, the Adobe Acrobat prod-

ucts, that provide a broad range of capabilities for its customers. Adobe Acrobat

provides the basic tools to create and enhance documents prepared by essentially

any software product on the popular operating system platforms. The Acrobat

Reader, available free of charge for downloading from myriad Web sites (includ-

ing Adobe.com), is frequently bundled with consumer products to provide

paperless documentation that customers can view on-line or print to paper.

xvi

Acrobat Capture converts paper documents into PDF format, using state-of-the-

art character recognition combined with a highly compressed representation of

graphics, enabling the conversion of legacy information into electronic form. A

signiﬁcant number of third-party developers and systems integrators offer cus-

tomized enhancements and extensions to the core family of products.

The emergence of PDF as a de facto standard for electronic information exchange

is the result of concerted effort by many individuals in both the private and pub-

lic sectors. Without the dedication of Adobe employees, our industry partners,

and our customers, the widespread acceptance of PDF could not have been

achieved. We thank all of you for your continuing support and creative contribu-

tions to the success of PDF.

Chuck Geschke and John Warnock

March 2000

CHAPTER 1

1Introduction

THIS BOOK DESCRIBES the Adobe Portable Document Format (PDF), the

native ﬁle format of the Adobe

Acrobat

family of products. The goal of these

products is to enable users to exchange and view electronic documents easily and

reliably, independently of the environment in which they were created. PDF relies

on the imaging model of the PostScript

page description language to describe

text and graphics in a device-independent and resolution-independent manner.

To improve performance for interactive viewing, PDF deﬁnes a more structured

format than that used by most PostScript language programs. PDF also includes

objects, such as annotations and hypertext links, that are not part of the page it-

self but are useful for interactive viewing and document interchange.

1.1 About This Book

This book provides a description of the PDF ﬁle format and is intended primarily

for application developers wishing to develop PDF generator applications that

create PDF ﬁles directly. It also contains enough information to allow developers

to write PDF consumer applications that read existing PDF ﬁles and interpret or

modify their contents.

Although the PDF speciﬁcation is independent of any particular software imple-

mentation, some PDF features are best explained by describing the way they are

processed by a typical application program. In such cases, this book uses the

Adobe Acrobat family of PDF viewer applications as its model. (The prototypical

viewer is the fully capable Acrobat product, not the limited Acrobat Reader prod-

uct.) Similarly, Appendix C discusses some implementation limits in the Acrobat

viewer applications, even though these limits are not part of the ﬁle format itself.

To provide guidance to implementors of PDF generator and consumer applica-

IntroductionCHAPTER 1

tions, implementation notes in Appendix H describe the behavior of Acrobat

viewer applications when they encounter newer features they do not understand.

• Chapter 2, “Overview,” brieﬂy introduces the overall architecture of PDF and

the design considerations behind it, compares it with the PostScript language,

and describes the underlying imaging model that they share.

• Chapter 3, “Syntax,” presents the syntax of PDF at the object, ﬁle, and docu-

ment level. It sets the stage for subsequent chapters, which describe how that

information is interpreted as page descriptions, interactive navigational aids,

and application-level logical structure.

• Chapter 4, “Graphics,” describes the graphics operators used to describe the

appearance of pages in a PDF document.

• Chapter 5, “Fonts,” discusses PDF’s special facilities for presenting text in the

form of character shapes, or glyphs, deﬁned by fonts.

• Chapter 6, “Rendering,” considers how device-independent content descrip-

tions are matched to the characteristics of a particular output device.

• Chapter 7, “Interactive Features,” describes those features of PDF that allow a

user to interact with a document on the screen, using the mouse and keyboard.

• Chapter 8, “Document Interchange,” shows how PDF documents can incorpo-

rate higher-level information that is useful for the interchange of documents

among applications.

• Appendix A, “Operator Summary,” lists all the operators used in describing the

visual content of a PDF document.

• Appendix B, “Operators in Type 4 Functions,” summarizes the PostScript oper-

ators that can be used in PostScript calculator functions, which contain code

written in a small subset of the PostScript language.

• Appendix C, “Implementation Limits,” describes typical size and quantity

limits imposed by the Acrobat viewer applications.

• Appendix D, “Character Sets and Encodings,” lists the character sets and en-

codings that are assumed to be predeﬁned in any PDF viewer application.

• Appendix E, “PDF Name Registry,” discusses a registry, maintained for devel-

opers by Adobe Systems, that contains private names and formats used by PDF

producers or Acrobat plug-in extensions.

Introduction to PDF 1.3 Features1.2

• Appendix F, “Linearized PDF,” describes a special form of PDF ﬁle organiza-

tion designed to work efﬁciently in network environments.

• Appendix G, “Example PDF Files,” presents several examples showing the

structure of actual PDF ﬁles, ranging from one containing a minimal one-page

document to one showing how the structure of a PDF ﬁle evolves over the

course of several revisions.

• Appendix H, “Compatibility and Implementation Notes,” provides details on

the behavior of Acrobat viewer applications and describes how viewer applica-

tions should handle PDF ﬁles containing features that they do not recognize.

The book concludes with a Bibliography and an Index.

The enclosed CD-ROM contains the entire text of this book in PDF form.

1.2 Introduction to PDF 1.3 Features

This second edition of the PDF Reference describes version 1.3 of the Portable

Document Format. (See implementation note 1 in Appendix H.) Throughout

the book, information speciﬁc to particular versions of PDF is marked with indi-

cators of the form (PDF 1.0), (PDF 1.1), (PDF 1.2), or (PDF 1.3). Features so

marked may be new in the indicated version or may have been substantially rede-

ﬁned in that version. Features designated (PDF 1.0) have generally been super-

seded in later versions; unless otherwise stated, features identiﬁed as speciﬁc to

other versions are understood to be available in later versions as well. PDF viewer

applications designed for a speciﬁc PDF version generally ignore newer features

that they do not recognize.

PDF 1.3 adds support for the new features of the Adobe imaging model embod-

ied in PostScript LanguageLevel 3, as well as other new features, including the fol-

lowing:

• Data structures for efﬁciently mapping strings and numbers to PDF objects

(Sections 3.8.4, “Name Trees,” and 3.8.5, “Number Trees”)

• New types of functions (Section 3.9, “Functions”)

• Embedding of ﬁles of any type within a PDF document (Section 3.10.3, “Em-

bedded File Streams”)

IntroductionCHAPTER 1

• New color spaces: ICCBased and DeviceN (“ICCBased Color Spaces” on

page 173 and “DeviceN Color Spaces” on page 186)

• Smooth shading (Section 4.6.3, “Shading Patterns”)

• Alternate representations for a single image (“Alternate Images” on page 255)

• Masked images (Section 4.8.5, “Masked Images”)

• Additional support for CIDFonts (Section 5.6, “Composite Fonts”)

• Enhanced page numbering (Section 7.3.1, “Page Labels”)

• Many new annotation types (Section 7.4.5, “Annotation Types”)

• Digital signatures (“Signature Fields” on page 451)

• Support for JavaScript (“JavaScript Actions” on page 458)

• A facility for representing the logical structure of a document independently of

its graphic structure (Section 8.4.3, “Logical Structure”)

• A facility for capturing information from the World Wide Web and converting

it to PDF form (Section 8.5, “Web Capture”)

• Information useful in prepress production workﬂows (Section 8.6, “Prepress

Support”)

1.3 Related Publications

PDF and the PostScript page description language share the same underlying

Adobe imaging model. A document can be converted straightforwardly between

PDF and the PostScript language; the two representations produce the same out-

put when printed. However, PostScript includes a general-purpose programming

language framework not present in PDF. The PostScript Language Reference is the

comprehensive reference for the PostScript language and its imaging model.

PDF and PostScript support several standard formats for font programs, includ-

ing Adobe Type 1, CFF (Compact Font Format), TrueType

, and CID-keyed

fonts. The PDF manifestations of these fonts are documented in this book. How-

ever, the speciﬁcations for the font ﬁles themselves are published separately,

because they are highly specialized and are of interest to a different user commu-

nity. A variety of Adobe publications are available on the subject of font formats,

most notably the following:

• Adobe Type 1 Font Format and Adobe Technical Note #5015, Type 1 Font Format

Supplement

• Adobe Technical Note #5176, The Compact Font Format Speciﬁcation

• Adobe Technical Note #5177, The Type 2 Charstring Format

• Adobe Technical Note #5014, Adobe CMap and CID Font Files Speciﬁcation

See the Bibliography for additional publications related to PDF and the contents

of this book.

1.4 Copyright Permission

The general idea of using an interchange format for electronic documents is in

the public domain. Anyone is free to devise a set of unique data structures and

operators that deﬁne an interchange format for electronic documents. However,

Adobe Systems Incorporated owns the copyright for the particular data struc-

tures and operators and the written speciﬁcation constituting the interchange

format called the Portable Document Format. Thus, these elements of the Port-

able Document Format may not be copied without Adobe’s permission.

Adobe will enforce its copyright. Adobe’s intention is to maintain the integrity of

the Portable Document Format standard. This enables the public to distinguish

between the Portable Document Format and other interchange formats for elec-

tronic documents. However, Adobe desires to promote the use of the Portable

Document Format for information interchange among diverse products and

applications. Accordingly, Adobe gives copyright permission to anyone to:

• Prepare ﬁles whose content conforms to the Portable Document Format

• Write drivers and applications that produce output represented in the Portable

Document Format

• Write software that accepts input in the form of the Portable Document

Format and displays, prints, or otherwise interprets the contents

• Copy Adobe’s copyrighted list of data structures and operators, as well as the

example code and PostScript language function deﬁnitions in the written

IntroductionCHAPTER 1

speciﬁcation, to the extent necessary to use the Portable Document Format for

the purposes above

The only condition of such copyright permission is that anyone who uses the

copyrighted list of data structures and operators in this way must include an ap-

propriate copyright notice. This limited right to use the copyrighted list of data

structures and operators does not include the right to copy this book, other copy-

righted material from Adobe, or the software in any of Adobe’s products that use

the Portable Document Format, in whole or in part, nor does it include the right

to use any Adobe patents (except as may be permitted by an ofﬁcial Adobe Patent

Clariﬁcation Notice).

CHAPTER 2

2Overview

THE ADOBE PORTABLE DOCUMENT FORMAT (PDF) is a ﬁle format for rep-

resenting documents in a manner independent of the application software, hard-

ware, and operating system used to create them and of the output device on

which they are to be displayed or printed. A PDF document consists of a collec-

tion of objects that together describe the appearance of one or more pages, possi-

bly accompanied by additional interactive elements and higher-level application

data. A PDF ﬁle contains the objects making up a PDF document along with asso-

ciated structural information, all represented as a single self-contained sequence

of bytes.

A document’s pages (and other visual elements) may contain any combination of

text, graphics, and images. A page’s appearance is described by a PDF content

stream, which contains a sequence of graphics objects to be painted on the page.

This appearance is fully speciﬁed; all layout and formatting decisions have al-

ready been made by the application that generated the content stream.

In addition to describing the static appearance of pages, a PDF document may

contain interactive elements that are possible only in an electronic representa-

tion. PDF supports annotations of many kinds for such things as text notes,

hypertext links, markup, ﬁle attachments, sounds, and movies. A document can

deﬁne its own user interface; keyboard and mouse input can trigger actions that

are speciﬁed by PDF objects. The document can contain interactive form ﬁelds to

be ﬁlled in by the user, and can import the values of these ﬁelds from or export

them to other applications.

Finally, a PDF document can contain higher-level information that is useful for

interchange among applications. In addition to specifying appearance, a docu-

ment’s content can include identiﬁcation and structural information that allows

it to be searched, edited, or extracted for reuse elsewhere. PDF is particularly well

OverviewCHAPTER 2

suited for representing a document as it moves through successive stages of a pre-

press production workﬂow.

2.1 Imaging Model

At the heart of PDF is its ability to describe the appearance of sophisticated

graphics and typography. This is achieved through the use of the Adobe imaging

model, the same high-level, device-independent representation used in the Post-

Script page description language.

Although application programs could theoretically describe any page as a full-

resolution pixel array, the resulting ﬁle would be bulky, device-dependent, and

impractical for high-resolution devices. A high-level imaging model enables

applications to describe the appearance of pages containing text, graphical

shapes, and sampled images in terms of abstract graphical elements rather than

directly in terms of device pixels. Such a description is economical and device-

independent, and can be used to produce high-quality output on a broad range

of printers and displays.

2.1.1 Page Description Languages

Among its other roles, PDF serves as a page description language: a language for

describing the graphical appearance of pages with respect to an imaging model.

An application program produces output through a two-stage process:

1. The application generates a device-independent description of the desired

output in the page description language.

2. A program controlling a speciﬁc output device interprets the description and

renders it on that device.

The two stages may be executed in different places and at different times; the page

description language serves as an interchange standard for the compact, device-

independent transmission and storage of printable or displayable documents.

Imaging Model2.1

2.1.2 Adobe Imaging Model

The Adobe imaging model is a simple and uniﬁed view of two-dimensional

graphics borrowed from the graphic arts. In this model, “paint” is placed on a

page in selected areas.

• The painted ﬁgures may be in the form of character shapes (glyphs), geometric

shapes, lines, or sampled images such as digital representations of photographs.

• The paint may be in color or in black, white, or any shade of gray; it may also

take the form of a repeating pattern (PDF 1.2) or a smooth transition between

colors (PDF 1.3).

• Any of these elements may be clipped to appear within other shapes as they are

placed onto the page.

A page’s content stream contains operands and operators describing a sequence of

graphics objects. A PDF viewer application maintains an implicit current page

that accumulates the marks made by the painting operators. Initially, the current

page is completely blank. For each graphics object encountered in the content

stream, the viewer places marks on the current page, which completely obscure

any previous marks they may overlay (subject to the effects of the overprint

parameter in the graphics state; see Section 4.5.6, “Overprint Control”). This

method is known as a painting model: no matter what color a mark has—white,

black, gray, or color—it is placed on the current page as if it were applied with

opaque paint. Once the page has been completely composed, the accumulated

marks are rendered on the output medium and the current page is cleared to

blank again.

The principal graphics objects (among others) are as follows:

• A path object consists of a sequence of connected and disconnected points,

lines, and curves that together describe shapes and their positions. It is built up

through the sequential application of path construction operators, each of which

appends one or more new elements. The path object is ended by a path-

painting operator, which paints the path on the page in some way. The principal

path-painting operators are

S (stroke), which paints a line along the path, and f

(ﬁll), which paints the interior of the path.

• A text object consists of one or more glyph shapes representing characters of

text. The glyph shapes for the characters are described in a separate data struc-

ture called a font. Like path objects, text objects can be stroked or ﬁlled.

OverviewCHAPTER 2

• An image object is a rectangular array of sample values, each representing a

color at a particular position within the rectangle. Image objects are typically

used to represent photographs.

The painting operators require various parameters, some explicit and others im-

plicit. Implicit parameters include the current color, current line width, current

font (typeface and size), and many others. Together, these implicit parameters

make up the graphics state. There are operators for setting the value of each im-

plicit parameter in the graphics state; painting operators use the values currently

in effect at the time they are invoked.

One additional implicit parameter in the graphics state modiﬁes the results of

painting graphics objects. The current clipping path outlines the area of the cur-

rent page within which paint can be placed. Although painting operators may

attempt to place marks anywhere on the current page, only those marks falling

within the current clipping path will affect the page; those falling outside it will

not. Initially, the current clipping path encompasses the entire imageable area of

the page. It can temporarily be reduced to the shape deﬁned by a path or text ob-

ject, or to the intersection of multiple such shapes. Marks placed by subsequent

painting operators will then be conﬁned within that boundary.

2.1.3 Raster Output Devices

Much of the power of the Adobe imaging model derives from its ability to deal

with the general class of raster output devices. These encompass such technologies

as laser, dot-matrix, and ink-jet printers, digital imagesetters, and raster-scan

displays. The deﬁning property of a raster output device is that a printed or

displayed image consists of a rectangular array, or raster, of dots called pixels

(picture elements) that can be addressed individually. On a typical bilevel output

device, each pixel can be made either black or white. On some devices, pixels can

be set to intermediate shades of gray or to some color. The ability to set the colors

of individual pixels makes it possible to generate printed or displayed output that

can include text, arbitrary graphical shapes, and reproductions of sampled

images.

The resolution of a raster output device measures the number of pixels per unit of

distance along the two linear dimensions. Resolution is typically—but not neces-

sarily—the same horizontally and vertically. Manufacturers’ decisions on device

Imaging Model2.1

technology and price/performance tradeoffs create characteristic ranges of reso-

lution:

• Computer displays have relatively low resolution, typically 75 to 110 pixels per

inch.

• Dot-matrix printers generally range from 100 to 250 pixels per inch.

• Ink-jet and laser-scanned xerographic printing technologies achieve medium-

level resolutions of 300 to 1400 pixels per inch.

• Photographic technology permits high resolutions of 2400 pixels per inch or

more.

Higher resolution yields better quality and ﬁdelity of the resulting output, but is

achieved at greater cost. As the technology improves and computing costs de-

crease, products evolve to higher resolutions.

2.1.4 Scan Conversion

An abstract graphical element (such as a line, a circle, a character glyph, or a

sampled image) is rendered on a raster output device by a process known as scan

conversion. Given a mathematical description of the graphical element, this pro-

cess determines which pixels to adjust and what values to assign to those pixels to

achieve the most faithful rendition possible at the available device resolution.

The pixels on a page can be represented by a two-dimensional array of pixel

values in computer memory. For an output device whose pixels can only be black

or white, a single bit sufﬁces to represent each pixel. For a device that can repro-

duce gray levels or colors, multiple bits per pixel are required.

Note: Although the ultimate representation of a printed or displayed page is logically

a complete array of pixels, its actual representation in computer memory need not

consist of one memory cell per pixel. Some implementations use other representa-

tions, such as display lists. The Adobe imaging model has been carefully designed not

to depend on any particular representation of raster memory.

For each graphical element that is to appear on the page, the scan converter sets

the values of the corresponding pixels. When the interpretation of the page de-

scription is complete, the pixel values in memory represent the appearance of the

OverviewCHAPTER 2

page. At this point, a raster output process can render this representation (make it

visible) on a printed page or display screen.

Scan-converting a graphical shape, such as a rectangle or circle, entails determin-

ing which device pixels lie inside the shape and setting their values appropriately

(for example, to black). Because the edges of a shape do not always fall precisely

on the boundaries between pixels, some policy is required for deciding how to set

the pixels along the edges. Scan-converting a glyph representing a text character

is conceptually the same as scan-converting an arbitrary graphical shape; how-

ever, character glyphs are much more sensitive to legibility requirements and

must meet more rigid objective and subjective measures of quality.

Rendering grayscale elements on a bilevel device is accomplished by a technique

known as halftoning. The array of pixels is divided into small clusters according to

some pattern (called the halftone screen). Within each cluster, some pixels are set

to black and some to white in proportion to the level of gray desired at that loca-

tion on the page. When viewed from a sufﬁcient distance, the individual dots be-

come imperceptible and the perceived result is a shade of gray. This enables a

bilevel raster output device to reproduce shades of gray and to approximate natu-

ral images such as photographs. Some color devices use a similar technique.

2.2 Other General Properties

This section describes other notable general properties of PDF, aside from its im-

aging model.

2.2.1 Portability

PDF ﬁles are represented as sequences of 8-bit binary bytes. A PDF ﬁle is de-

signed to be portable across all platforms and operating systems. The binary rep-

resentation is intended to be generated, transported, and consumed directly,

without translation between native character sets, end-of-line representations, or

other conventions used on various platforms.

Any PDF ﬁle can also be represented in a form that uses only 7-bit ASCII (Ameri-

can Standard Code for Information Interchange) character codes. This is useful

for the purpose of exposition, as in this book. However, this representation is not

recommended for actual use, since it is less efﬁcient than the normal binary rep-

resentation. Regardless of which representation is used, PDF ﬁles must be trans-

Other General Properties2.2

ported and stored as binary ﬁles, not as text ﬁles; inadvertent changes, such as

conversion between text end-of-line conventions, will damage the ﬁle and may

render it unusable.

2.2.2 Compression

To reduce ﬁle size, PDF supports a number of industry-standard compression ﬁl-

ters:

• JPEG compression of color and grayscale images

• CCITT Group 3, CCITT Group 4, and run-length compression of mono-

chrome images

• LZW (Lempel-Ziv-Welch) and Flate compression (PDF 1.2) of text, graphics,

and images

Using JPEG compression, color and grayscale images can be compressed by a fac-

tor of 10 or more. Effective compression of monochrome images depends on the

compression ﬁlter used and the properties of the image, but reductions of 2:1 to

8:1 are common. LZW or Flate compression of the content streams describing all

other text and graphics in the document results in compression ratios of approxi-

mately 2:1. All of these compression ﬁlters produce binary data, which can then

be further converted to ASCII base-85 encoding if a 7-bit ASCII representation is

desired.

2.2.3 Font Management

Managing fonts is a fundamental challenge in document interchange. Generally,

the receiver of a document must have the same fonts that were originally used to

create it. If a different font is substituted, its character set, glyph shapes, and met-

rics may differ from those in the original font. This can produce unexpected and

undesirable results, such as lines of text extending into margins or overlapping

with graphics.

OverviewCHAPTER 2

PDF provides various means for dealing with font management:

• The original font programs can be embedded in the PDF ﬁle. PDF supports

various font formats, including Type 1, TrueType

, and CID-keyed fonts. This

ensures the most predictable and dependable results.

• To conserve space, a font subset can be embedded, containing just the glyph

descriptions for those characters that are actually used in the document. Also,

Type 1 fonts can be represented in a special compact format.

• PDF prescribes a set of 14 standard fonts that can be used without prior deﬁni-

tion. These include four faces of each of three Latin text typefaces (Courier,

Helvetica*, and Times*) and two symbolic fonts (Symbol and ITC Zapf

Dingbats

). These fonts, or suitable substitute fonts with the same metrics, are

guaranteed to be available in all PDF viewer applications.

• A PDF ﬁle can refer by name to fonts that are not embedded in the PDF ﬁle. In

this case, a viewer application will use those fonts if they are available in the

viewer’s environment. This approach suffers from the uncertainties noted

above.

• A PDF ﬁle contains a font descriptor for each font that it uses (other than the

standard 14). The font descriptor includes font metrics and style information,

enabling a viewer application to select or synthesize a suitable substitute font if

necessary. Although the glyphs’ shapes will differ from those intended, their

placement will be accurate.

Font management is primarily concerned with producing the correct appearance

of text—that is, the shape and placement of glyphs. However, it is sometimes nec-

essary for a PDF application to extract the meaning of the text, represented in

some standard information encoding such as Unicode

. In some cases, this infor-

mation can be deduced from the encoding used to represent the text in the PDF

ﬁle. Otherwise, the PDF creator application should specify the mapping explicitly

by including a special object, the

ToUnicode CMap.

2.2.4 Single-Pass File Generation

Because of system limitations and efﬁciency considerations, it may be necessary

or desirable for an application program to generate a PDF ﬁle in a single pass. For

example, the program may have limited memory available or be unable to open

temporary ﬁles. For this reason, PDF supports single-pass generation of ﬁles.

Although some PDF objects must specify their length in bytes, a mechanism is

Other General Properties2.2

provided allowing the length to follow the object itself in the PDF ﬁle. In addi-

tion, information such as the number of pages in the document can be written

into the ﬁle after all pages have been generated.

A PDF ﬁle that is generated in a single pass is generally not ordered for most efﬁ-

cient viewing, particularly when accessing the contents of the ﬁle over a network.

When generating a PDF ﬁle that is intended to be viewed many times, it is worth-

while to perform a second pass to optimize the order in which objects occur in

the ﬁle. PDF speciﬁes a particular ﬁle organization, Linearized PDF, which is doc-

umented in Appendix F. Other optimizations are also possible, such as detecting

duplicated sequences of graphics objects and collapsing them to a single shared

sequence that is speciﬁed only once.

2.2.5 Random Access

A PDF ﬁle should be thought of as a ﬂattened representation of a data structure

consisting of a collection of objects that can refer to each other in any arbitrary

way. The order of the objects’ occurrence in the PDF ﬁle has no semantic signiﬁ-

cance. In general, a viewing application should process a PDF ﬁle by following

references from object to object, rather than by processing objects sequentially.

This is particularly important for interactive document viewing, or for any appli-

cation in which pages or other objects in the PDF ﬁle are accessed out of

sequence.

To support such random access to individual objects, every PDF ﬁle contains a

cross-reference table that can be used to locate and directly access pages and other

important objects within the ﬁle. The cross-reference table is stored at the end of

the ﬁle, allowing applications that generate PDF ﬁles in a single pass to store it

easily and applications that read PDF ﬁles to locate it easily. Using the cross-

reference table makes the time needed to locate a page or other object nearly in-

dependent of the length of the document. This allows PDF documents containing

hundreds or thousands of pages to be accessed efﬁciently.

OverviewCHAPTER 2

2.2.6 Security

PDF has two security features that can be used, separately or together, in any doc-

ument:

• The document can be encrypted so that only authorized users can access it.

There is separate authorization for the owner of the document and for all other

users; the users’ access can be selectively restricted to allow only certain opera-

tions, such as viewing, printing, or editing.

• The document can be digitally signed to certify its authenticity. The signature

may take many forms, including a document digest that has been encrypted

with a public/private key, a biometric signature such as a ﬁngerprint, and oth-

ers. Any subsequent changes to a signed PDF ﬁle will invalidate the signature.

2.2.7 Incremental Update

Applications may allow users to modify PDF documents. Users should not have

to wait for the entire ﬁle—which can contain hundreds of pages or more—to be

rewritten each time modiﬁcations to the document are saved. PDF allows modiﬁ-

cations to be appended to a ﬁle, leaving the original data intact. The addendum

appended when a ﬁle is incrementally updated contains only those objects that

were actually added or modiﬁed, and includes an update to the cross-reference

table. Incremental update allows an application to save modiﬁcations to a PDF

document in an amount of time proportional to the size of the modiﬁcation in-

stead of the size of the ﬁle.

In addition, because the original contents of the document are still present in the

ﬁle, it is possible to undo saved changes by deleting one or more addenda. The

ability to recover the exact contents of an original document is critical when digi-

tal signatures have been applied and subsequently need to be veriﬁed.

2.2.8 Extensibility

PDF is designed to be extensible. Not only can new features be added, but appli-

cations based on earlier versions of PDF can behave reasonably when they en-

counter newer features that they do not understand. Appendix H describes how a

PDF viewer application should behave in such cases.

Using PDF2.3

Additionally, PDF provides means for applications to store their own private in-

formation in a PDF ﬁle. This information can be recovered when the ﬁle is im-

ported by the same application, but is ignored by other applications. This allows

PDF to serve as an application’s native ﬁle format while allowing its documents

be viewed and printed by other applications. Application-speciﬁc data can be

stored either as marked content annotating the graphics objects in a PDF content

stream or as entirely separate objects unconnected with the PDF content.

2.3 Using PDF

PDF ﬁles may be produced either directly by application programs or indirectly

by conversion from other ﬁle formats or imaging models. As PDF documents and

applications that process them become more prevalent, new ways of creating and

using PDF will be invented. One of the goals of this book is to make the ﬁle for-

mat accessible so that application developers can expand on the ideas behind

PDF and the applications that initially support it.

Many applications can produce PDF ﬁles directly, and some can import them as

well. This is the most desirable approach, since it gives the application access to

the full capabilities of PDF, including the imaging model and the interactive and

document interchange features. Alternatively, existing applications that do not

generate PDF directly can still be used to produce PDF output by indirect meth-

ods. There are two principal ways of doing this:

• The application describes its printable output by making calls to an application

programming interface (API), such as GDI in Microsoft

Windows

or Quick-

Draw

in the Apple

Mac

OS. A software component called a printer driver in-

tercepts these calls and interprets them to generate output in the form of a PDF

ﬁle.

• The application produces printable output directly in some other ﬁle format,

such as PostScript, PCL, HPGL, or DVI, which is then converted into PDF by a

separate translation program.

Note, however, that while these indirect strategies are often the easiest way to ob-

tain PDF output from an existing application, the resulting PDF ﬁles may not

make the best use of the high-level Adobe imaging model. This is because the in-

formation embodied in the application’s API calls or in the intermediate output

ﬁle often describes the desired results at too low a level; any higher-level informa-

OverviewCHAPTER 2

tion maintained by the original application has been lost and is not available to

the printer driver or translator.

Figures 2.1 and 2.2 show how Adobe Acrobat products support these indirect ap-

proaches. PDF Writer (Figure 2.1), available on the Windows and Mac OS plat-

forms, acts as a printer driver, intercepting graphics and text operations

generated by a running application program through the operating system’s API.

Instead of converting these operations into printer commands and transmitting

them directly to a printer, however, PDF Writer converts them to equivalent PDF

operators and embeds them in a PDF ﬁle. The result is a platform-independent

ﬁle that can be viewed and printed by a PDF viewer application, such as Adobe

Acrobat, running on any supported platform—even a different platform from

the one on which the ﬁle was originally generated.

FIGURE 2.1 Creating PDF ﬁles using PDF Writer

Acrobat

Macintosh application Windows application

PDF Writer

QuickDraw GDI

PDF

PDF and the PostScript Language2.4

Instead of describing their printable output via API calls, some applications pro-

duce PostScript page descriptions directly—either because of limitations in the

QuickDraw or GDI imaging models or because the applications run on platforms

such as DOS or UNIX

, where there is no system-level printer driver. PostScript

ﬁles generated by such applications can be converted into PDF ﬁles using the

Acrobat Distiller

application (see Figure 2.2). Because PostScript and PDF share

the same Adobe imaging model, Acrobat Distiller can preserve the exact graphi-

cal content of the PostScript ﬁle in the translation to PDF. Additionally, Distiller

supports a PostScript language extension, called

pdfmark, that allows the produc-

ing application to embed instructions in the PostScript ﬁle for creating hypertext

links, logical structure, and other interactive and document interchange features

of PDF. Again, the resulting PDF ﬁle can be viewed with a viewer application,

such as Adobe Acrobat, on any supported platform.

FIGURE 2.2 Creating PDF ﬁles using Acrobat Distiller

2.4 PDF and the PostScript Language

The PDF operators for setting the graphics state and painting graphics objects are

similar to the corresponding operators in the PostScript language. Unlike Post-

Script, however, PDF is not a full-scale programming language; it trades reduced

PostScript

page description

Acrobat Exchange or Reader

PDF

Acrobat Distiller

OverviewCHAPTER 2

ﬂexibility for improved efﬁciency and predictability. PDF therefore differs from

PostScript in the following signiﬁcant ways:

• PDF enforces a strictly deﬁned ﬁle structure that allows an application to access

parts of a document in arbitrary order.

• To simplify the processing of content streams, PDF does not include common

programming language features such as procedures, variables, and control con-

structs.

• PDF ﬁles contain information such as font metrics to ensure viewing ﬁdelity.

• A PDF ﬁle may contain additional information that is not directly connected

with the imaging model, such as hypertext links for interactive viewing and

logical structure information for document interchange.

Because of these differences, a PDF ﬁle generally cannot be transmitted directly

to a PostScript output device for printing (although a few such devices do also

support PDF directly). An application printing a PDF document to a PostScript

device must carry out the following steps:

1. Insert procedure sets containing PostScript procedure deﬁnitions to implement

the PDF operators.

2. Extract the content for each page. Each content stream is essentially the script

portion of a traditional PostScript program using very speciﬁc procedures,

such as

m for moveto and l for lineto.

3. Decode compressed text, graphics, and image data as necessary. The compres-

sion ﬁlters used in PDF are compatible with those used in PostScript; they may

or may not be supported, depending on the LanguageLevel of the target out-

put device.

4. Insert any needed resources, such as fonts, into the PostScript ﬁle. These can

be either the original fonts themselves or suitable substitute fonts based on the

font metrics in the PDF ﬁle. Fonts may need to be converted to a format that

the PostScript interpreter recognizes, such as Type 1 or Type 42.

5. Put the information in the correct order. The result is a traditional PostScript

program that fully represents the visual aspects of the document but no longer

contains PDF elements such as hypertext links, annotations, and bookmarks.

6. Send the PostScript program to the output device.

CHAPTER 3

3Syntax

THIS CHAPTER COVERS everything about the syntax of PDF at the object, ﬁle,

and document level. It sets the stage for subsequent chapters, which describe how

the contents of a PDF ﬁle are interpreted as page descriptions, interactive naviga-

tional aids, and application-level logical structure.

PDF syntax is best understood by thinking of it in four parts, as shown in

Figure 3.1:

• Objects. A PDF document is a data structure composed from a small set of basic

types of data object. Section 3.1, “Lexical Conventions,” describes the character

set used to write objects and other syntactic elements. Section 3.2, “Objects,”

describes the syntax and essential properties of the objects themselves.

Section 3.3, “Details of Filtered Streams,” provides complete details of the most

complex data type, the stream object.

• File structure. The PDF ﬁle structure determines how objects are stored in a

PDF ﬁle, how they are accessed, and how they are updated. This structure is in-

dependent of the semantics of the objects. Section 3.4, “File Structure,” de-

scribes the ﬁle structure. Section 3.5, “Encryption,” describes a ﬁle-level

mechanism for protecting a document’s contents from unauthorized access.

• Document structure. The PDF document structure speciﬁes how the basic ob-

ject types are used to represent components of a PDF document: pages, fonts,

annotations, and so forth. Section 3.6, “Document Structure,” describes the

overall document structure; later chapters address the detailed semantics of the

components.

• Content streams. A PDF content stream contains a sequence of instructions de-

scribing the appearance of a page or other graphical entity. These instructions,

while also represented as objects, are conceptually distinct from the objects that

SyntaxCHAPTER 3

represent the document structure and are described separately. Section 3.7,

“Content Streams and Resources,” discusses PDF content streams and their as-

sociated resources.

FIGURE 3.1 PDF components

In addition, this chapter describes some data structures, built from basic objects,

that are so widely used that they can almost be considered basic object types in

their own right. These objects are covered in Sections 3.8, “Common Data Struc-

tures”; 3.9, “Functions”; and 3.10, “File Speciﬁcations.”

PDF’s object and ﬁle syntax is also used as the basis for other ﬁle formats. These

include the Forms Data Format (FDF), described in Section 7.6.6, “Forms Data

Format,” and the Portable Job Ticket Format (PJTF), described in Adobe Techni-

cal Note #5620, Portable Job Ticket Format.

3.1 Lexical Conventions

At the most fundamental level, a PDF ﬁle is a sequence of 8-bit bytes. These bytes

can be grouped into tokens according to the syntax rules described below. One or

more tokens are then assembled to form higher-level syntactic entities, prin-

cipally objects, which are the basic data values from which a PDF document is

constructed.

PDF can be entirely represented using byte values corresponding to the visible

printable subset of the ASCII character set, plus characters that appear as “white

space,” such as space, tab, carriage return, and line feed characters. ASCII is the

American Standard Code for Information Interchange, a widely used convention

Objects

File

structure

Document

structure

Content

stream

Lexical Conventions3.1

for encoding a speciﬁc set of 128 characters as binary numbers. However, a PDF

ﬁle is not restricted to the ASCII character set; it can contain arbitrary 8-bit bytes,

subject to the following considerations:

• The tokens that delimit objects and that describe the structure of a PDF ﬁle are

all written in the ASCII character set, as are all the reserved words and the

names used as keys in standard dictionaries.

• The data values of certain types of object—strings and streams—can be but

need not be written entirely in ASCII. For the purpose of exposition (as in this

book), ASCII representation is preferred. However, in actual practice, data that

is naturally binary, such as sampled images, is represented directly in binary for

the sake of compactness and efﬁciency.

• A PDF ﬁle containing binary data must be transported and stored by means

that preserve all bytes of the ﬁle faithfully; that is, as a binary ﬁle rather than a

text ﬁle. Such a ﬁle is not portable to environments that impose reserved char-

acter codes, maximum line lengths, end-of-line conventions, or other restric-

tions.

Note: In this chapter, the term character is synonymous with byte and merely refers

to a particular 8-bit value. This is entirely independent of any logical meaning that

the value may have when it is treated as data in speciﬁc contexts, such as represent-

ing human-readable text or selecting a glyph from a font.

3.1.1 Character Set

The PDF character set is divided into three classes, called regular, delimiter, and

white-space characters. This classiﬁcation determines the grouping of characters

into tokens, except within strings, streams, and comments; different rules apply

in those contexts.

White-space characters (see Table 3.1) separate syntactic constructs such as names

and numbers from each other. All white-space characters are equivalent, except

in comments, strings, and streams. In all other contexts, PDF treats any sequence

of consecutive white-space characters as if there were just one.

SyntaxCHAPTER 3

TABLE 3.1 White-space characters

DECIMAL HEXADECIMAL OCTAL NAME

0 00 000 Null (NUL)

9 09 011 Tab (HT)

10 0A 012 Line feed (LF)

12 0C 014 Form feed (FF)

13 0D 015 Carriage return (CR)

32 20 040 Space (SP)

The carriage return (CR) and line feed (LF) characters, also called newline charac-

ters, are treated as end-of-line (EOL) markers. The combination of a carriage

return followed immediately by a line feed is treated as one EOL marker. For the

most part, EOL markers are treated the same as any other white-space characters.

However, there are certain instances in which an EOL marker is required or rec-

ommended—that is, the following token must appear at the beginning of a line.

Note: The examples in this book illustrate a recommended convention for arranging

tokens into lines. However, the examples’ use of white space for indentation is purely

for clarity of exposition and is not recommended for practical use.

The delimiter characters

(, ), <, >, [, ], {, }, /, and % are special. They delimit syntac-

tic entities such as strings, arrays, names, and comments. Any of these characters

terminates the entity preceding it and is not included in the entity.

All characters besides the white-space characters and delimiters are referred to as

regular characters. These include 8-bit binary characters that are outside the

ASCII character set. A sequence of consecutive regular characters comprises a

single token.

Note: PDF is case-sensitive; corresponding uppercase and lowercase letters are con-

sidered distinct.

Objects3.2

3.1.2 Comments

Any occurrence of the percent sign character (%) outside a string or stream intro-

duces a comment. The comment consists of all characters between the percent

sign and the end of the line, including regular, delimiter, space, and tab charac-

ters. PDF ignores comments, treating them as if they were single white-space

characters. That is, a comment separates the token preceding it from the one fol-

lowing; thus the PDF fragment

abc% comment {/%) blah blah blah

123

is syntactically equivalent to just the tokens abc and 123.

Comments (other than the

%PDF−1.3 and %%EOF comments described in

Section 3.4, “File Structure”) have no semantics. They are not necessarily pre-

served by applications that edit PDF ﬁles (see implementation note 2 in

Appendix H). In particular, there is no PDF equivalent of the PostScript docu-

ment structuring conventions (DSC).

3.2 Objects

PDF supports eight basic types of object:

• Boolean values

• Integer and real numbers

• Strings

• Names

• Arrays

• Dictionaries

• Streams

• The null object

Objects may be labeled so that they can be referred to by other objects. A labeled

object is called an indirect object.

SyntaxCHAPTER 3

The following sections describe each object type, as well as how to create and

refer to indirect objects.

3.2.1 Boolean Objects

PDF provides boolean objects with values true and false. The keywords true and

false represent these values. Boolean objects can be used as the values of array

elements and dictionary entries, and can also occur in PostScript calculator func-

tions as the results of boolean and relational operators and as operands to the

conditional operators

if and ifelse (see Section 3.9.4, “Type 4 (PostScript Calcula-

tor) Functions”).

3.2.2 Numeric Objects

PDF provides two types of numeric object: integer and real. Integer objects rep-

resent mathematical integers within a certain interval centered at 0. Real objects

approximate mathematical real numbers, but with limited range and precision;

they are typically represented in ﬁxed-point, rather than ﬂoating-point, form.

The range and precision of numbers are limited by the internal representations

used in the machine on which the PDF viewer application is running;

Appendix C gives these limits for typical implementations.

An integer is written one or more decimal digits optionally preceded by a sign:

123 43445 +17 −98 0

The value is interpreted as a signed decimal integer and is converted to an integer

object. If it exceeds the implementation limit for integers, it is converted to a real

object.

A real value is written as one or more decimal digits with an optional sign and a

leading, trailing, or embedded period (decimal point):

34.5 −3.62 +123.6 4. −.002 0.0

The value is interpreted as a real number and is converted to a real object. If it

exceeds the implementation limit for real numbers, an error occurs.

Objects3.2

Note: PDF does not support the PostScript syntax for numbers with nondecimal

radices (such as

16#FFFE) or in exponential format (such as 6.02E23).

Throughout this book, the term number refers to an object whose type may be

either integer or real. Wherever a real number is expected, an integer may be used

instead and will be automatically converted to an equivalent real value. For ex-

ample, it is not necessary to write the number

1.0 in real format; the integer 1 will

sufﬁce.

3.2.3 String Objects

A string object consists of a series of bytes—unsigned integer values in the range 0

to 255. The string elements are not integer objects, but are stored in a more com-

pact format. The length of a string is subject to an implementation limit; see

Appendix C.

There are two conventions, described in the following sections, for writing a

string object in PDF:

• As a sequence of literal characters enclosed in parentheses ( )

• As hexadecimal data enclosed in angle brackets < >

This section describes only the basic syntax for writing a string as a sequence of

bytes. Strings can be used for many purposes and can be formatted in a variety of

ways. When a string is used for a speciﬁc purpose (to represent a date, for ex-

ample), it is useful to have a standard format for that purpose (see Section 3.8.2,

“Dates”). Such formats are merely conventions for interpreting the contents of a

string and are not in themselves separate object types. The use of a particular for-

mat is described with the deﬁnition of the string object that uses that format.

Literal Strings

A literal string is written as an arbitrary number of characters enclosed in paren-

theses. Any characters may appear in a string except unbalanced parentheses and

the backslash, which must be treated specially. Balanced pairs of parentheses

within a string require no special treatment.

SyntaxCHAPTER 3

The following are valid literal strings:

(This is a string)

(Strings may contain newlines

and such.)

(Strings may contain balanced parentheses ( ) and

special characters (*!&}^% and so on).)

(The following is an empty string.)

()

(It has zero (0) length.)

Within a literal string, the backslash (\) is used as an escape character for various

purposes, such as to include newline characters, nonprinting ASCII characters,

unbalanced parentheses, or the backslash character itself in the string. The char-

acter immediately following the backslash determines its precise interpretation

(see Table 3.2). If the character following the backslash is not one of those shown

in the table, the backslash is ignored.

TABLE 3.2 Escape sequences in literal strings

SEQUENCE MEANING

\n Line feed

\r Carriage return

\t Horizontal tab

\b Backspace

\f Form feed

\( Left parenthesis

\) Right parenthesis

\\ Backslash

\ddd Character code ddd (octal)

If a string is too long to be conveniently placed on a single line, it may be split

across multiple lines by using the backslash character at the end of a line to indi-

Objects3.2

cate that the string continues on the following line. The backslash and the end-

of-line marker following it are not considered part of the string. For example:

(These \

two strings \

are the same.)

(These two strings are the same.)

If an end-of-line marker appears within a literal string without a preceding back-

slash, the result is equivalent to

\n (regardless of whether the end-of-line marker

itself was a carriage return, a line feed, or both). For example:

(This string has an end−of−line at the end of it.

)

(So does this one.\n)

The \ddd escape sequence provides a way to represent characters outside the

printable ASCII character set. For example:

(This string contains \245two octal characters\307.)

The number ddd may consist of one, two, or three octal digits, with high-order

overﬂow ignored. It is required that three octal digits be used, with leading zeros

as needed, if the next character of the string is also a digit. For example, the literal

(\0053)

denotes a string containing two characters, \005 (Control-E) followed by the digit

3, whereas both

(\053)

and

(\53)

denote strings containing the single character \053, a plus sign (+).

This notation provides a way to specify characters outside the 7-bit ASCII charac-

ter set using ASCII characters only. However, any 8-bit value may appear in a

string. In particular, when a document is encrypted (see Section 3.5, “Encryp-

SyntaxCHAPTER 3

tion”), all of its strings are encrypted and often contain arbitrary 8-bit values.

Note that the backslash character is still required as an escape to specify unbal-

anced parentheses or the backslash character itself.

Hexadecimal Strings

Strings may also be written in hexadecimal form; this is useful for including arbi-

trary binary data in a PDF ﬁle. A hexadecimal string is written as a sequence of

hexadecimal digits (

0–9 and either A–F or a–f) enclosed within angle brackets

(

< and >):

<4E6F762073686D6F7A206B6120706F702E>

Each pair of hexadecimal digits deﬁnes one byte of the string. White-space char-

acters (such as space, tab, carriage return, line feed, and form feed) are ignored.

If the ﬁnal digit of a hexadecimal string is missing—that is, if there is an odd

number of digits—the ﬁnal digit is assumed to be

0. For example,

<901FA3>

is a 3-byte string consisting of the characters whose hexadecimal codes are 90, 1F,

and

A3, but

<901FA>

is a 3-byte string containing the characters whose hexadecimal codes are 90, 1F,

and

A0.

3.2.4 Name Objects

A name object is an atomic symbol uniquely deﬁned by a sequence of characters.

Uniquely deﬁned means that any two name objects deﬁned by the same sequence

of characters are identically the same object. Atomic means that a name has no

internal structure; although it is deﬁned by a sequence of characters, those char-

acters are not “elements” of the name.

A slash character (

/) introduces a name. The slash is not part of the name itself,

but a preﬁx indicating that the following sequence of characters constitutes a

name. There can be no white-space characters between the slash and the ﬁrst

Objects3.2

character in the name. The name may include any regular characters, but not

delimiter or white-space characters (see Section 3.1, “Lexical Conventions”).

Uppercase and lowercase letters are considered distinct;

/A and /a are different

names. The following are examples of valid literal names:

/Name1

/ASomewhatLongerName

/A;Name_With−Various***Characters?

/1.2

/$$

/@pattern

/.notdef

Note: The token / (a slash followed by no regular characters) is a valid name.

In PDF 1.2 and higher, any character except null (character code 0) may be in-

cluded in a name by writing its 2-digit hexadecimal code, preceded by the num-

ber sign character (

#); see implementation notes 3 and 4 in Appendix H. This

syntax is required in order to represent any of the delimiter or white-space char-

acters or the number sign character itself; it is recommended but not required for

characters whose codes are outside the range 33 (

!) to 126 (~). The examples

shown in Table 3.3 are valid literal names in PDF 1.2 and higher.

TABLE 3.3 Examples of literal names using the # character

LITERAL NAME RESULT

/Adobe#20Green Adobe Green

/PANTONE#205757#20CV PANTONE 5757 CV

/paired#28#29parentheses paired()parentheses

/The_Key_of_F#23_Minor The_Key_of_F#_Minor

/A#42 AB

The length of a name is subject to an implementation limit; see Appendix C. The

limit applies to the number of characters in the name’s internal representation.

For example, the name

/A#20B has 4 characters (/, A, space, B), not 6.

SyntaxCHAPTER 3

In PDF, name objects always begin with the slash character /, unlike keywords

such as

true, false, and obj. This book follows a typographic convention of writ-

ing names in boldface without the leading slash when they appear in running text

and tables. For example,

Type and DecodeParms denote names that would

actually be written in a PDF ﬁle (and in code examples in this book) as

/Type and

/DecodeParms.

3.2.5 Array Objects

An array object is a one-dimensional collection of objects arranged sequentially.

Unlike arrays in many other computer languages, PDF arrays may be hetero-

geneous; that is, an array’s elements may be any combination of numbers, strings,

dictionaries, or any other objects, including other arrays. The number of

elements in an array is subject to an implementation limit; see Appendix C.

An array is written as a sequence of objects enclosed in square brackets (

[ and ]):

[549 3.14 false (Ralph) /SomeName]

PDF directly supports only one-dimensional arrays. Arrays of higher dimension

can be constructed by using arrays as elements of arrays, nested to any depth.

3.2.6 Dictionary Objects

A dictionary object is an associative table containing pairs of objects, known as the

dictionary’s entries. The ﬁrst element of each entry is the key and the second

element is the value. The key must be a name (unlike dictionary keys in Post-

Script, which may be objects of any type). The value can be any kind of object,

including another dictionary. A dictionary entry whose value is

null (see

Section 3.2.8, “The Null Object”) is equivalent to an absent entry. (Note that this

differs from PostScript, where

null behaves like any other object as the value of a

dictionary entry.) The number of entries in a dictionary is subject to an imple-

mentation limit; see Appendix C.

Note: No two entries in the same dictionary should have the same key. If a key does

appear more than once, its value is undeﬁned.

Objects3.2

A dictionary is written as a sequence of key-value pairs enclosed in double angle

brackets (

<< and >>). For example:

<< /Type /Example

/Subtype /DictionaryExample

/Version 0.01

/IntegerItem 12

/StringItem (a string)

/Subdictionary << /Item1 0.4

/Item2 true

/LastItem (not!)

/VeryLastItem (OK)

Note: Do not confuse the double angle brackets with single angle brackets (< and >),

which delimit a hexadecimal string (see “Hexadecimal Strings” on page 30).

Dictionary objects are the main building blocks of a PDF document. They are

commonly used to collect and tie together the attributes of a complex object,

such as a font or a page of the document, with each entry in the dictionary speci-

fying the name and value of an attribute. By convention, the

Type entry of such a

dictionary identiﬁes the type of object the dictionary describes. In some cases, a

Subtype entry (sometimes abbreviated S) is used to further identify a specialized

subcategory of the general type. The value of the

Type or Subtype entry is always

a name. For example, in a font dictionary, the value of the

Type entry is always

Font, whereas that of the Subtype entry may be Type1, TrueType, or one of sever-

al other values.

The value of the

Type entry can almost always be inferred from context. The op-

erand of the

Tf operator, for example, must be a font object, so the Type entry in a

font dictionary serves primarily as documentation and as information for error

checking. The

Type entry is not required unless so stated in its description; how-

ever, if the entry is present, it must have the correct value. In addition, the value

of the

Type entry in any dictionary, even in private data, must be either a name

deﬁned in this book or a registered name; see Appendix E for details.

3.2.7 Stream Objects

A stream object, like a string object, is a sequence of bytes. However, a PDF appli-

cation can read a stream incrementally, while a string must be read in its entirety.

SyntaxCHAPTER 3

Furthermore, a stream can be of unlimited length, whereas a string is subject to

an implementation limit. For this reason, objects with potentially large amounts

of data, such as images and page descriptions, are represented as streams.

Note: As with strings, this section describes only the syntax for writing a stream as a

sequence of bytes. What those bytes represent is determined by the context in which

the stream is referenced.

A stream consists of a dictionary that describes a sequence of bytes, followed by

zero or more lines of bytes bracketed between the keywords

stream and end-

stream

dictionary

stream

… Zero or more lines of bytes …

endstream

All streams must be indirect objects (see Section 3.2.9, “Indirect Objects”) and

the stream dictionary must be a direct object. The keyword

stream that follows

the stream dictionary should be followed by either a carriage return and a line

feed or by just a line feed, and not by a carriage return alone. The sequence of

bytes that make up a stream lie between the

stream and endstream keywords, or,

in PDF 1.2, may be contained in an external ﬁle. If the data is in an external ﬁle,

the stream dictionary speciﬁes the ﬁle, and any bytes between

stream and end-

stream

are ignored. (See implementation note 5 in Appendix H.)

Note: Without the restriction against following the keyword

stream by a carriage re-

turn alone, it would be impossible to differentiate a stream that uses carriage return

as its end-of-line marker and has a line feed as its ﬁrst byte of data from one that uses

a carriage return–line feed sequence to denote end-of-line.

Table 3.4 lists the entries common to all stream dictionaries; certain types of

stream may have additional dictionary entries, as indicated where those streams

are described. The optional entries regarding ﬁlters for the stream indicate wheth-

er and how the data in the stream must be transformed (“decoded”) before it is

used.

Objects3.2

TABLE 3.4 Entries common to all stream dictionaries

KEY TYPE VALUE

Length integer (Required) The number of bytes from the beginning of the line fol-

lowing the keyword

stream to the last byte just before the keyword

endstream. (There may be an additional EOL marker, preceding end-

stream

, that is not included in the count and is not logically part of

the stream data.)

Filter name or array (Optional) The name of a filter to be applied in processing the stream

data found between the keywords

stream and endstream, or an array

of such names. Multiple ﬁlters should be speciﬁed in the order in

which they are to be applied.

DecodeParms dictionary or array (Optional) A parameter dictionary, or an array of such dictionaries,

used by the ﬁlters speciﬁed by

Filter. If there is only one ﬁlter and that

ﬁlter has parameters,

DecodeParms must be set to the ﬁlter’s parame-

ter dictionary unless all the ﬁlter’s parameters have their default

values, in which case the

DecodeParms entry may be omitted. If there

are multiple ﬁlters and any of the ﬁlters has parameters set to non-

default values,

DecodeParms must be an array with one entry for

each ﬁlter: either the parameter dictionary for that ﬁlter, or the null

object if that ﬁlter has no parameters (or if all of its parameters have

their default values). If none of the ﬁlters have parameters, or if all

their parameters have default values, the

DecodeParms entry may be

omitted. (See implementation note 6 in Appendix H.)

F ﬁle speciﬁcation (Optional; PDF 1.2) The ﬁle containing the stream data. If this entry

is present, the bytes between

stream and endstream are ignored, the

ﬁlters are speciﬁed by

FFilter rather than Filter, and the ﬁlter parame-

ters are speciﬁed by

FDecodeParms rather than DecodeParms. How-

ever, the

Length entry should still specify the number of those bytes.

(Usually there are no bytes and

Length is 0.)

FFilter name or array (Optional; PDF 1.2) The name of a filter to be applied in processing

the data found in the stream’s external ﬁle, or an array of such names.

The same rules apply as for

Filter.

FDecodeParms dictionary or array (Optional; PDF 1.2) A parameter dictionary, or an array of such dic-

tionaries, used by the ﬁlters speciﬁed by

FFilter. The same rules apply

as for DecodeParms.

SyntaxCHAPTER 3

Stream Extent

Every stream dictionary has a Length entry that indicates how many bytes of the

PDF ﬁle are used for the stream’s data. (If the stream has a ﬁlter,

Length is the

number of bytes of encoded data.) In addition, most ﬁlters are deﬁned so that the

data is self-limiting; that is, they use an encoding scheme in which an explicit

end-of-data (EOD) marker delimits the extent of the data. Finally, streams are

used to represent many objects from whose attributes a length can be inferred. All

of these constraints must be consistent.

For example, an image with 10 rows and 20 columns, using a single color compo-

nent and 8 bits per component, requires exactly 200 bytes of image data. If the

stream uses a ﬁlter, there must be enough bytes of encoded data in the PDF ﬁle to

produce those 200 bytes. An error occurs if

Length is too small, if an explicit EOD

marker occurs too soon, or if the decoded data does not contain 200 bytes.

It is also an error if the stream contains too much data, with the exception that

there may be an extra end-of-line marker in the PDF ﬁle before the keyword

end-

stream

Filters

A ﬁlter is an optional part of the speciﬁcation of a stream, indicating how the data

in the stream must be decoded before it is used. For example, if a stream has an

ASCIIHexDecode ﬁlter, an application reading the data in that stream will trans-

form the ASCII hexadecimal-encoded data in the stream into binary data.

An application program that produces a PDF ﬁle can encode certain information

(for example, data for sampled images) to compress it or to convert it to a port-

able ASCII representation. Then an application that reads (“consumes”) the PDF

ﬁle can invoke the corresponding decoding ﬁlter to convert the information back

to its original form.

The ﬁlter or ﬁlters for a stream are speciﬁed by the

Filter key in the stream’s dic-

tionary (or the

FFilter key if the stream is external). Filters can be cascaded to

form a pipeline that passes the stream through two or more decoding transforma-

tions in sequence. For example, data encoded using LZW and ASCII base-85

Objects3.2

encoding (in that order) can be decoded using the following entry in the stream

dictionary:

/Filter [/ASCII85Decode /LZWDecode]

Some ﬁlters may take parameters to control how they operate. These optional

parameters are speciﬁed by the

DecodeParms entry in the stream’s dictionary (or

the

FDecodeParms entry if the stream is external).

Standard Filters

PDF supports a standard set of ﬁlters that fall into two main categories:

• ASCII ﬁlters enable arbitrary 8-bit binary data to be represented in the print-

able subset of the ASCII character set. (See Section 3.1, “Lexical Conventions,”

for an explanation of why this might be useful. Note that ASCII ﬁlters serve no

useful purpose in a PDF ﬁle that is encrypted; see Section 3.5, “Encryption.”)

• Decompression ﬁlters enable data to be represented in a compressed form. Com-

pression is particularly valuable for large sampled images, since it reduces stor-

age requirements and transmission time. Note that the compressed data is

always in 8-bit binary format, even if the original data happens to be ASCII

text.

These ﬁlters are summarized in Table 3.5, which also indicates whether they

accept any optional parameters. The ﬁlters and their parameters (if any) are

described further in Section 3.3, “Details of Filtered Streams.” (See also imple-

mentation notes 7 and 8 in Appendix H.)

Example Encoded Stream

Example 3.1 shows a stream, containing the marking instructions for a page, that

was compressed using the LZW compression method and then encoded in ASCII

base-85 representation. Example 3.2 shows the same stream without any encod-

ing. (The stream’s contents are explained in Section 3.7.1, “Content Streams,” and

the operators used there are further described in Chapter 5.)

SyntaxCHAPTER 3

TABLE 3.5 Standard ﬁlters

FILTER NAME PARAMETERS? DESCRIPTION

ASCIIHexDecode no Decodes data encoded in an ASCII hexadecimal representation,

reproducing the original binary data.

ASCII85Decode no Decodes data encoded in an ASCII base-85 representation, repro-

ducing the original binary data.

LZWDecode yes Decompresses data encoded using the LZW (Lempel-Ziv-Welch)

adaptive compression method, reproducing the original text or bin-

ary data.

FlateDecode yes (PDF 1.2) Decompresses data encoded using the public-domain zlib/

deﬂate compression method, reproducing the original text or binary

data. (See implementation note 9 in Appendix H.)

RunLengthDecode no Decompresses data encoded using a byte-oriented run-length encod-

ing algorithm, reproducing the original text or binary data (typically

monochrome image data, or any data that contains frequent long

runs of a single byte value).

CCITTFaxDecode yes Decompresses data encoded using the CCITT facsimile standard,

reproducing the original data (typically monochrome image data at 1

bit per pixel).

DCTDecode yes Decompresses data encoded using a DCT (discrete cosine transform)

technique based on the JPEG standard, reproducing image sample

data that approximates the original data.

Example 3.1

<< /Length 534

/Filter [/ASCII85Decode /LZWDecode]

stream

J..)6T`?p&<!J9%_[umg"B7/Z7KNXbN'S+,*Q/&"OLT'F

LIDK#!n`$"<Atdi`\Vn%b%)&'cA*VnK\CJY(sF>c!Jnl@

RM]WM;jjH6Gnc75idkL5]+cPZKEBPWdR>FF(kj1_R%W_d

&/jS!;iuad7h?[L−F$+]]0A3Ck*$I0KZ?;<)CJtqi65Xb

Vc3\n5ua:Q/=0$W<#N3U;H,MQKqfg1?:lUpR;6oN[C2E4

ZNr8Udn.'p+?#X+1>0Kuk$bCDF/(3fL5]Oq)^kJZ!C2H1

'TO]Rl?Q:&'<5&iP!$Rq;BXRecDN[IJB`,)o8XJOSJ9sD

S]hQ;Rj@!ND)bD_q&C\g:inYC%)&u#:u,M6Bm%IY!Kb1+

Objects3.2

":aAa'S`ViJglLb8<W9k6Yl\\0McJQkDeLWdPN?9A'jX*

al>iG1p&i;eVoK&juJHs9%;Xomop"5KatWRT"JQ#qYuL,

JD?M$0QP)lKn06l1apKDC@\qJ4B!!(5m+j.7F790m(Vj8

8l8Q:_CZ(Gm1%X\N1&u!FKHMB~>

endstream

Example 3.2

<< /Length 568 >>

stream

/F1 12 Tf

0Tc

0Tw

72.5 712 TD

[(Unencoded streams can be read easily) 65 (, )] TJ

0 −14 TD

[(b) 20 (ut generally tak) 10 (e more space than \311)] TJ

T* (encoded streams.) Tj

0 −28 TD

[(Se) 25 (v) 15 (eral encoding methods are a) 20 (v) 25 (ailable in PDF) 80 (.)] TJ

0 −14 TD

(Some are used for compression and others simply) Tj

T* [(to represent binary data in an ) 55 (ASCII format.)] TJ

T* (Some of the compression encoding methods are \

suitable ) Tj

T* (for both data and images, while others are \

suitable only ) Tj

T* (for continuous−tone images.) Tj

endstream

3.2.8 The Null Object

The null object is used to ﬁll empty or uninitialized positions in an array or dic-

tionary. There is only one object of type null, denoted by the keyword

null. As

noted in Section 3.2.6, “Dictionary Objects,” specifying the null object as the

value of a dictionary entry is equivalent to omitting the entry entirely.

SyntaxCHAPTER 3

3.2.9 Indirect Objects

Any object in a PDF ﬁle may be labeled as an indirect object. This gives the object

a unique object identiﬁer by which other objects can refer to it (for example, as an

element of an array or as the value of a dictionary entry). The object identiﬁer

consists of two parts:

• A positive integer object number. Indirect objects are often numbered sequen-

tially within a PDF ﬁle, but this is not required; object numbers may be as-

signed in any arbitrary order.

• A nonnegative integer generation number. In a newly created ﬁle, all indirect

objects have generation numbers of 0. Nonzero generation numbers may be in-

troduced when the ﬁle is later updated; see Sections 3.4.3, “Cross-Reference

Table,” and 3.4.5, “Incremental Updates.”

Together, the combination of an object number and a generation number

uniquely identiﬁes an indirect object. The object retains the same object number

and generation number throughout its existence, even if its value is modiﬁed.

The deﬁnition of an indirect object in a PDF ﬁle consists of its object number and

generation number, followed by the value of the object itself bracketed between

the keywords

obj and endobj. For example, the deﬁnition

12 0 obj

(Brillig)

endobj

deﬁnes an indirect string object with an object number of 12, a generation num-

ber of 0, and the value

Brillig. The object can then be referred to from elsewhere in

the ﬁle by an indirect reference consisting of the object number, the generation

number, and the keyword

12 0 R

An indirect reference to an undeﬁned object is not an error; it is simply treated as

a reference to the null object. For example, if a ﬁle contains the indirect reference

17 0 R

but does not contain the corresponding deﬁnition

Details of Filtered Streams3.3

17 0 obj

…

endobj

then the indirect reference is considered to refer to the null object.

Note: In the data structures that make up a PDF document, certain values are re-

quired to be speciﬁed as indirect object references. Except where this is explicitly

called out, any object (other than a stream) may be speciﬁed either directly or as an

indirect object reference; the semantics are entirely equivalent. Note in particular

that content streams, which deﬁne the visible contents of the document, may not con-

tain indirect references (see Section 3.7.1, “Content Streams”).

Example 3.3 shows the use of an indirect object to specify the length of a stream.

The value of the stream’s

Length entry is an integer object that follows the stream

itself in the ﬁle. This allows applications that generate PDF in a single pass to

defer specifying the stream’s length until after its contents have been generated.

Example 3.3

7 0 obj

<< /Length 8 0 R >> % An indirect reference to object 8

stream

/F1 12 Tf

72 712 Td

(A stream with an indirect length) Tj

endstream

endobj

8 0 obj

77 % The length of the preceding stream

endobj

3.3 Details of Filtered Streams

Stream ﬁlters are introduced under “Filters” on page 36. This section describes

the semantics of ﬁlters in more detail, including speciﬁcations of encoding algo-

rithms for some ﬁlters.

SyntaxCHAPTER 3

3.3.1 ASCIIHexDecode Filter

The ASCIIHexDecode ﬁlter decodes data that has been encoded in ASCII hexa-

decimal form. ASCII hexadecimal encoding and ASCII base-85 encoding

(described in the next section) convert binary data, such as image data, to 7-bit

ASCII characters. In general, ASCII base-85 encoding is preferred to ASCII hexa-

decimal encoding because it is more compact: it expands the data by a factor of

4:5, compared with 1:2 for ASCII hexadecimal encoding.

For each pair of ASCII hexadecimal digits (

0–9 and A–F or a–f), the ASCIIHex-

Decode

ﬁlter produces one byte of binary data. All white-space characters (see

Section 3.1, “Lexical Conventions”) are ignored. A right angle bracket character

(

>) indicates EOD. Any other characters will cause an error. If the ﬁlter encoun-

ters the EOD marker after reading an odd number of hexadecimal digits, it will

behave as if a

0 followed the last digit.

3.3.2 ASCII85Decode Filter

The ASCII85Decode ﬁlter decodes data that has been encoded in ASCII base-85

encoding and produces binary data. The following paragraphs describe the pro-

cess for encoding binary data in ASCII base-85; the

ASCII85Decode ﬁlter reverses

this process.

The ASCII base-85 encoding uses the characters

! through u and the character z,

with the 2-character sequence

~> as its EOD marker. The ASCII85Decode ﬁlter

ignores all white-space characters (see Section 3.1, “Lexical Conventions”). Any

other characters, and any character sequences that represent impossible combi-

nations in the ASCII base-85 encoding, will cause an error.

Speciﬁcally, ASCII base-85 encoding produces 5 ASCII characters for every 4

bytes of binary data. Each group of 4 binary input bytes, (b

), is convert-

ed to a group of 5 output bytes, (c

), using the relation

In other words, 4 bytes of binary data are interpreted as a base-256 number and

then converted into a base-85 number. The ﬁve “digits” of the base-85 number

are then converted to ASCII characters by adding 33 (the ASCII code for the

256

×()b

256

×()b

256

×()b

+++

×()

++++

Details of Filtered Streams

3.3

character

) to each. The resulting encoded data contains only printable ASCII

characters with codes in the range 33 (

) to 117 (

). As a special case, if all ﬁve

digits are 0, they are represented by the character with code 122 (

) instead of by

ﬁve exclamation points (

!!!!!

If the length of the binary data to be encoded is not a multiple of 4 bytes, the last,

partial group of 4 is used to produce a last, partial group of 5 output characters.

Given

(1, 2, or 3) bytes of binary data, the encoder ﬁrst appends 4

−

zero

bytes to make a complete group of 4. It then encodes this group in the usual way,

but without applying the special

case. Finally, it writes only the ﬁrst

1 char-

acters of the resulting group of 5. These characters are immediately followed by

the

EOD marker.

The following conditions (which never occur in a correctly encoded byte

sequence) will cause errors during decoding:

•

The value represented by a group of 5 characters is greater than 2

−

•

character occurs in the middle of a group.

•

A ﬁnal partial group contains only one character.

3.3.3 LZWDecode and FlateDecode Filters

The

LZWDecode

and (in PDF 1.2)

FlateDecode

ﬁlters have much in common

and so are discussed together in this section. They decode data that has been en-

coded using the LZW or Flate data compression method, respectively.

•

LZW (Lempel-Ziv-Welch) is a variable-length, adaptive compression method

that has been adopted as one of the standard compression methods in the

Ta g

Image File Format

(TIFF) standard. Details on LZW encoding follow in the

next section.

•

The Flate method is based on the public-domain zlib/deﬂate compression

method, which is a variable-length Lempel-Ziv adaptive compression method

cascaded with adaptive Huffman coding. It is fully deﬁned in Internet

RFCs 1950,

ZLIB Compressed Data Format Speciﬁcation

, and 1951,

DEFLATE

Compressed Data Format Speciﬁcation

(see the Bibliography).

Syntax

CHAPTER 3

Both of these methods compress either binary data or ASCII text but (like all

compression methods) always produce binary data, even if the original data was

text.

The LZW and Flate compression methods can discover and exploit many pat-

terns in the input data, whether the data is text or images. As described later, both

ﬁlters support optional transformation by a

predictor function

, which improves

the compression of sampled image data. Thanks to its cascaded adaptive Huff-

man coding, Flate-encoded output is usually much more compact than LZW-

encoded output for the same input. Flate and LZW decoding speeds are com-

parable, but Flate encoding is considerably slower than LZW encoding.

Usually, both Flate and LZW encodings compress their input substantially. How-

ever, in the worst case (in which no pair of adjacent characters appears twice),

Flate encoding

expands

its input by no more than 11 bytes or a factor of 1.003

(whichever is larger), plus the effects of algorithm tags added by PNG predictors.

For LZW encoding, the best case (all zeros) provides a compression approaching

1365:1 for long ﬁles, but the worst-case expansion is at least a factor of 1.125,

which can increase to nearly 1.5 in some implementations (plus the effects of

PNG tags as with Flate encoding).

Details of LZW Encoding

Data encoded using the LZW compression method consists of a sequence of

codes that are 9 to 12 bits long. Each code represents a single character of input

data (0–255), a clear-table marker (256), an EOD marker (257), or a table entry

representing a multiple-character sequence that has been encountered previously

in the input (258 or greater).

Initially, the code length is 9 bits and the LZW table contains only entries for the

258 ﬁxed codes. As encoding proceeds, entries are appended to the table, asso-

ciating new codes with longer and longer sequences of input characters. The en-

coder and the decoder maintain identical copies of this table.

Whenever both the encoder and the decoder independently (but synchronously)

realize that the current code length is no longer sufﬁcient to represent the num-

ber of entries in the table, they increase the number of bits per code by 1. The ﬁrst

output code that is 10 bits long is the one following the creation of table entry

511, and similarly for 11 (1023) and 12 (2047) bits. Codes are never longer than

12 bits, so entry 4095 is the last entry of the LZW table.

Details of Filtered Streams

3.3

The encoder executes the following sequence of steps to generate each output

code:

1. Accumulate a sequence of one or more input characters matching a sequence

already present in the table. For maximum compression, the encoder looks for

the longest such sequence.

2. Emit the code corresponding to that sequence.

3. Create a new table entry for the ﬁrst unused code. Its value is the sequence

found in step 1 followed by the next input character.

For example, suppose the input consists of the following sequence of ASCII char-

acter codes:

45 45 45 45 45 65 45 45 45 66

Starting with an empty table, the encoder proceeds as shown in Table 3.6.

TABLE 3.6 Typical LZW encoding sequence

INPUT OUTPUT CODE ADDED SEQUENCE REPRESENTED

SEQUENCE CODE TO TABLE BY NEW CODE

– 256 (clear-table) – –

45 45 258 45 45

45 45 258 259 45 45 45

45 45 258 260 45 45 65

65 65 261 65 45

45 45 45 259 262 45 45 45 66

– 257 (EOD) – –

Codes are packed into a continuous bit stream, high-order bit ﬁrst. This stream is

then divided into 8-bit bytes, high-order bit ﬁrst. Thus, codes can straddle byte

boundaries arbitrarily. After the EOD marker (code value 257), any leftover bits

in the ﬁnal byte are set to 0.

SyntaxCHAPTER 3

In the example above, all the output codes are 9 bits long; they would pack into

bytes as follows (represented in hexadecimal):

80 0B 60 50 22 0C 0E 02

To adapt to changing input sequences, the encoder may at any point issue a clear-

table code, which causes both the encoder and the decoder to restart with initial

tables and a 9-bit code length. By convention, the encoder begins by issuing a

clear-table code. It must issue a clear-table code when the table becomes full; it

may do so sooner.

Note: The LZW compression method is the subject of United States patent number

4,558,302 and corresponding foreign patents owned by the Unisys Corporation.

Adobe Systems has licensed this patent for use in its Acrobat products; however, inde-

pendent software vendors (ISVs) may be required to license this patent directly from

Unisys to develop software that uses the LZW method to compress data in PDF ﬁles.

For information on Unisys licensing policies, send e-mail to <[email protected]>;

or visit the Unisys Web site at <http://www.unisys.com>.

LZWDecode and FlateDecode Parameters

The LZWDecode and FlateDecode ﬁlters accept optional parameters to control

the decoding process. Most of these parameters are related to techniques that re-

duce the size of compressed sampled images (rectangular arrays of color values,

described in Section 4.8, “Images”). For example, image data frequently changes

very little from sample to sample; subtracting the values of adjacent samples (a

process called differencing), and encoding the differences rather than the raw

sample values, can reduce the size of the output data. Furthermore, when the

image data contains several color components (red-green-blue or cyan-magenta-

yellow-black) per sample, taking the difference between the values of correspond-

ing components in adjacent samples, rather than between different color compo-

nents in the same sample, often reduces the output data size.

Table 3.7 shows the parameters that can optionally be speciﬁed for

LZWDecode

and FlateDecode ﬁlters. Except where otherwise noted, all values supplied to the

decoding ﬁlter for any optional parameters must match those used when the data

was encoded.

Details of Filtered Streams3.3

TABLE 3.7 Optional parameters for LZWDecode and FlateDecode ﬁlters

KEY TYPE VALUE

Predictor integer A code that selects the predictor algorithm, if any. If the value of this entry

is 1, the ﬁlter assumes that the normal algorithm was used to encode the data,

without prediction. If the value is greater than 1, the ﬁlter assumes that the

data was differenced before being encoded, and

Predictor selects the predic-

tor algorithm. For more information regarding

Predictor values greater

than 1, see “LZW and Flate Predictor Functions,” below. Default value: 1.

Colors integer (Used only if Predictor is greater than 1) The number of interleaved color com-

ponents per sample. Valid values are 1 to 4 in PDF 1.2 or earlier, and 1 or

greater in PDF 1.3. Default value: 1.

BitsPerComponent integer (Used only if Predictor is greater than 1) The number of bits used to represent

each color component in a sample. Valid values are 1, 2, 4, and 8. Default

value: 8.

Columns integer (Used only if Predictor is greater than 1) The number of samples in each row.

Default value: 1.

EarlyChange integer (LZWDecode only) An indication of when to increase the code length. If the

value of this entry is 0, code length increases are postponed as long as pos-

sible. If it is 1, they occur one code early. This parameter is included because

LZW sample code distributed by some vendors increases the code length one

code earlier than necessary. Default value: 1.

LZW and Flate Predictor Functions

LZW and Flate encoding compress more compactly if their input data is highly

predictable. One way of increasing the predictability of many continuous-tone

sampled images is to replace each sample with the difference between that sample

and a predictor function applied to earlier neighboring samples. If the predictor

function works well, the postprediction data will cluster toward 0.

Two groups of predictor functions are supported. The ﬁrst, the TIFF group, con-

sists of the single function that is Predictor 2 in the TIFF standard. (In the TIFF

standard, Predictor 2 applies only to LZW compression, but here it applies to

Flate compression as well.) TIFF Predictor 2 predicts that each color component

of a sample will be the same as the corresponding color component of the sample

immediately to its left.

SyntaxCHAPTER 3

The second supported group of predictor functions, the PNG group, consists of

the “ﬁlters” of the World Wide Web Consortium’s Portable Network Graphics

recommendation, documented in Internet RFC 2083, PNG (Portable Network

Graphics) Speciﬁcation (see the Bibliography). The term predictors is used here in-

stead of ﬁlters to avoid confusion. There are ﬁve basic PNG predictor algorithms

(and a sixth that chooses the optimum predictor function separately for each

row):

None No prediction

Sub Predicts the same as the sample to the left

Up Predicts the same as the sample above

Average Predicts the average of the sample to the left and the sample above

Paeth A nonlinear function of the sample above, the sample to the left,

and the sample to the upper left

The predictor algorithm to be used, if any, is indicated by the

Predictor ﬁlter

parameter (see Table 3.7), which can have any of the values listed in Table 3.8.

TABLE 3.8 Predictor values

VALUE MEANING

1 No prediction (the default value)

2 TIFF Predictor 2

10 PNG prediction (on encoding, PNG None on all rows)

11 PNG prediction (on encoding, PNG Sub on all rows)

12 PNG prediction (on encoding, PNG Up on all rows)

13 PNG prediction (on encoding, PNG Average on all rows)

14 PNG prediction (on encoding, PNG Paeth on all rows)

15 PNG prediction (on encoding, PNG optimum)

For LZWDecode and FlateDecode, a Predictor value greater than or equal to 10

merely indicates that a PNG predictor is in use; the speciﬁc predictor function

used is explicitly encoded in the incoming data. The value of

Predictor supplied

by the decoding ﬁlter need not match the value used when the data was encoded

if they are both greater than or equal to 10.

Details of Filtered Streams3.3

The two groups of predictor functions have some commonalities. Both assume

the following:

• Data is presented in order, from the top row to the bottom row and, within a

row, from left to right.

• A row occupies a whole number of bytes, rounded up if necessary.

• Samples and their components are packed into bytes from high-order to low-

order bits.

• All color components of samples outside the image (which are necessary for

predictions near the boundaries) are 0.

The predictor function groups also differ in signiﬁcant ways:

• The postprediction data for each PNG-predicted row begins with an explicit

algorithm tag, so different rows can be predicted with different algorithms to

improve compression. TIFF Predictor 2 has no such identiﬁer; the same algo-

rithm applies to all rows.

• The TIFF function group predicts each color component from the prior in-

stance of that component, taking into account the number of bits per com-

ponent and components per sample. In contrast, the PNG function group

predicts each byte of data as a function of the corresponding byte of one or

more previous image samples, regardless of whether there are multiple color

components in a byte or whether a single color component spans multiple

bytes. This can yield signiﬁcantly better speed at the cost of somewhat worse

compression.

3.3.4 RunLengthDecode Filter

The RunLengthDecode ﬁlter decodes data that has been encoded in a simple

byte-oriented format based on run length. The encoded data is a sequence of

runs, where each run consists of a length byte followed by 1 to 128 bytes of data. If

the length byte is in the range 0 to 127, the following length + 1 (1 to 128) bytes

are copied literally during decompression. If length is in the range 129 to 255, the

following single byte is to be copied 257 − length (2 to 128) times during decom-

pression. A length value of 128 denotes EOD.

SyntaxCHAPTER 3

The compression achieved by run-length encoding depends on the input data. In

the best case (all zeros), a compression of approximately 64:1 is achieved for long

ﬁles. The worst case (the hexadecimal sequence

00 alternating with FF) results in

an expansion of 127:128.

3.3.5 CCITTFaxDecode Filter

The CCITTFaxDecode ﬁlter decodes image data that has been encoded using

either Group 3 or Group 4 CCITT facsimile (fax) encoding. CCITT encoding is

designed to achieve efﬁcient compression of monochrome (1 bit per pixel) image

data at relatively low resolutions, and so is useful only for bitmap image data, not

for color images, grayscale images, or text.

The CCITT encoding standard is deﬁned by the International Telecommunica-

tions Union (ITU), formerly known as the Comité Consultatif International

Téléphonique et Télégraphique (International Coordinating Committee for Tele-

phony and Telegraphy). The encoding algorithm is not described in detail here,

but can be found in ITU Recommendations T.4 and T.6 (see the Bibliography).

For historical reasons, we refer to these documents as the CCITT standard.

CCITT encoding is bit-oriented, not byte-oriented. This means that, in principle,

encoded or decoded data might not end at a byte boundary. This problem is dealt

with in the following ways:

• Unencoded data is treated as complete scan lines, with unused bits inserted at

the end of each scan line to ﬁll out the last byte. This is compatible with the

PDF convention for sampled image data.

• Encoded data is ordinarily treated as a continuous, unbroken bit stream. The

EncodedByteAlign parameter (described in Table 3.9) can be used to cause

each encoded scan line to be ﬁlled to a byte boundary; although this is not pre-

scribed by the CCITT standard and fax machines never do this, some software

packages ﬁnd it convenient to encode data this way.

• When a ﬁlter reaches EOD, it always skips to the next byte boundary following

the encoded data.

If the

CCITTFaxDecode ﬁlter encounters improperly encoded source data, an

error will occur. The ﬁlter will not perform any error correction or resynchroni-

zation, except as noted for the

DamagedRowsBeforeError parameter in Table 3.9.

Details of Filtered Streams3.3

Table 3.9 lists the optional parameters that can be used to control the decoding.

Except where noted otherwise, all values supplied to the decoding ﬁlter by any of

these parameters must match those used when the data was encoded.

TABLE 3.9 Optional parameters for the CCITTFaxDecode ﬁlter

KEY TYPE VALUE

K integer A code identifying the encoding scheme used:

<0 Pure two-dimensional encoding (Group 4)

0 Pure one-dimensional encoding (Group 3, 1-D)

>0 Mixed one- and two-dimensional encoding (Group 3,

2-D), in which a line encoded one-dimensionally can be

followed by at most

K − 1 lines encoded two-dimensionally

The ﬁlter distinguishes among negative, zero, and positive values of

K to determine how to interpret the encoded data; however, it does

not distinguish between different positive

K values. Default value: 0.

EndOfLine boolean A ﬂag indicating whether end-of-line bit patterns are required to be

present in the encoding. The

CCITTFaxDecode ﬁlter always accepts

end-of-line bit patterns, but requires them only if

EndOfLine is true.

Default value: false.

EncodedByteAlign boolean A ﬂag indicating whether the ﬁlter expects extra 0-bits before each

encoded line so that the line begins on a byte boundary. If true, the

ﬁlter skips over encoded bits to begin decoding each line at a byte

boundary. If false, the ﬁlter does not expect extra bits in the encod-

ed representation. Default value: false.

Columns integer The width of the image in pixels. If the value is not a multiple of 8,

the ﬁlter adjusts the width of the unencoded image to the next mul-

tiple of 8, so that each line starts on a byte boundary. Default value:

1728.

Rows integer The height of the image in scan lines. If the value is 0 or absent, the

image’s height is not predetermined, and the encoded data must be

terminated by an end-of-block bit pattern or by the end of the ﬁl-

ter’s data. Default value: 0.

SyntaxCHAPTER 3

EndOfBlock boolean A ﬂag indicating whether the ﬁlter expects the encoded data to be

terminated by an end-of-block pattern, overriding the

Rows

parameter. If false, the ﬁlter stops when it has decoded the number

of lines indicated by

Rows or when its data has been exhausted,

whichever occurs ﬁrst. The end-of-block pattern is the CCITT end-

of-facsimile-block (EOFB) or return-to-control (RTC) appropriate

for the

K parameter. Default value: true.

BlackIs1 boolean A ﬂag indicating whether 1-bits are to be interpreted as black pixels

and 0-bits as white pixels, the reverse of the normal PDF conven-

tion for monochrome image data. Default value: false.

DamagedRowsBeforeError integer The number of damaged rows of data to be tolerated before an

error occurs. This entry applies only if

EndOfLine is true and K is

nonnegative. Tolerating a damaged row means locating its end in

the encoded data by searching for an

EndOfLine pattern and then

substituting decoded data from the previous row if the previous

row was not damaged, or a white scan line if the previous row was

also damaged. Default value: 0.

The compression achieved using CCITT encoding depends on the data, as well as

on the value of various optional parameters. For Group 3 one-dimensional en-

coding, in the best case (all zeros), each scan line compresses to 4 bytes, and the

compression factor depends on the length of a scan line. If the scan line is 300

bytes long, a compression ratio of approximately 75:1 is achieved. The worst

case, an image of alternating ones and zeros, produces an expansion of 2:9.

3.3.6 DCTDecode Filter

The DCTDecode ﬁlter decodes grayscale or color image data that has been encod-

ed in the JPEG baseline format. (JPEG stands for the Joint Photographic Experts

Group, a group within the International Organization for Standardization that

developed the format; DCT stands for discrete cosine transform, the primary

technique used in the encoding.)

JPEG encoding is a “lossy” compression method, meaning that the data produced

by the decoder is not exactly the same as the data originally presented to the en-

coder. This method is designed speciﬁcally for compression of sampled

continuous-tone images, not for general data compression.

Details of Filtered Streams3.3

Data to be encoded consists of a stream of image samples, each consisting of one,

two, three, or four color components. The color component values for a particu-

lar sample must appear consecutively. Each component value occupies an 8-bit

byte.

During encoding, several parameters control the algorithm and the information

loss. The values of these parameters, which include the dimensions of the image

and the number of components per sample, are entirely under the control of the

encoder and are stored in the encoded data.

DCTDecode generally obtains the

parameter values it requires directly from the encoded data. However, in one

instance, the parameter might not be present in the encoded data but must be

speciﬁed in the ﬁlter parameter dictionary; see Table 3.10.

TABLE 3.10 Optional parameter for the DCTDecode ﬁlter

KEY TYPE VALUE

ColorTransform integer A code specifying the transformation to be performed on the sample values:

0 No transformation.

1 If the image has three color components, transform RGB values to

YUV before encoding and from YUV to RGB after decoding. If the

image has four components, transform CMYK values to YUVK be-

fore encoding and from YUVK to CMYK after decoding. This option

is ignored if the image has one or two color components.

Note: The RGB and YUV used here have nothing to do with the color spaces de-

ﬁned as part of the Adobe imaging model. The purpose of converting from RGB

to YUV is to separate luminance and chrominance information (see below).

The default value of

ColorTransform is 1 if the image has three components

and 0 otherwise. In other words, conversion between RGB and YUV is per-

formed for all three-component images unless explicitly disabled by setting

ColorTransform to 0. Additionally, the encoding algorithm inserts an Adobe-

deﬁned marker code in the encoded data indicating the

ColorTransform value

used. If present, this marker code overrides the

ColorTransform value given to

DCTDecode. Thus it is necessary to specify ColorTransform only when decod-

ing data that does not contain the Adobe-deﬁned marker code.

The details of the encoding algorithm are not presented here but can be found in

the ISO speciﬁcation and in JPEG: Still Image Data Compression Standard, by

Pennebaker and Mitchell (see the Bibliography). Brieﬂy, the JPEG algorithm

SyntaxCHAPTER 3

breaks an image up into blocks 8 samples wide by 8 high. Each color component

in an image is treated separately. A two-dimensional DCT is performed on each

block. This operation produces 64 coefﬁcients, which are then quantized. Each

coefﬁcient may be quantized with a different step size. It is this quantization that

results in the loss of information in the JPEG algorithm. The quantized coef-

ﬁcients are then compressed.

The encoding algorithm can reduce the information loss by making the step size

in the quantization smaller at the expense of reducing the amount of compres-

sion achieved by the algorithm. The compression achieved by the JPEG algorithm

depends on the image being compressed and the amount of loss that is accept-

able. In general, a compression of 15:1 can be achieved without perceptible loss

of information, and 30:1 compression causes little impairment of the image.

Better compression is often possible for color spaces that treat luminance and

chrominance separately than for those that do not. The RGB-to-YUV conversion

provided by the ﬁlters is one attempt to separate luminance and chrominance; it

conforms to CCIR recommendation 601-1. Other color spaces, such as the CIE

1976 L*a*b* space, may also achieve this objective. The chrominance compo-

nents can then be compressed more than the luminance by using coarser sam-

pling or quantization, with no degradation in quality.

The JPEG ﬁlter implementation in Adobe Acrobat products does not support fea-

tures of the JPEG standard that are irrelevant to images. In addition, certain

choices have been made regarding reserved marker codes and other optional fea-

tures of the standard. For details, see Adobe Technical Note #5116, Supporting the

DCT Filters in PostScript Level 2.

In addition to the baseline JPEG format, in PDF 1.3 the

DCTDecode ﬁlter sup-

ports the progressive JPEG extension. This extension does not add any entries to

the

DCTDecode parameter dictionary; the distinction between baseline and pro-

gressive JPEG is represented in the encoded data.

Note: There is no beneﬁt to using progressive JPEG for stream data that is embedded

in a PDF ﬁle. Decoding progressive JPEG is slower and consumes more memory than

baseline JPEG. The purpose of this feature is to enable a stream to refer to an ex-

ternal ﬁle whose data happens to be already encoded in progressive JPEG. (See also

implementation note 10 in Appendix H.)

File Structure3.4

3.4 File Structure

The preceding sections describe the syntax of individual objects. This section

describes how objects are organized in a PDF ﬁle for efﬁcient random access and

incremental update. A canonical PDF ﬁle initially consists of four elements (see

Figure 3.2):

• A one-line header identifying the version number of the PDF speciﬁcation to

which the ﬁle conforms

• A body containing the objects that make up the document contained in the ﬁle

• A cross-reference table containing information about the indirect objects in the

ﬁle

• A trailer giving the location of the cross-reference table and of certain special

objects within the body of the ﬁle

FIGURE 3.2 Initial structure of a PDF ﬁle

Header

Body

Cross-reference

table

Trailer

SyntaxCHAPTER 3

This initial structure may be modiﬁed by later updates, which append additional

elements to the end of the ﬁle; see Section 3.4.5, “Incremental Updates,” for

details.

As a matter of convention, the tokens in a PDF ﬁle are arranged into lines; see

Section 3.1, “Lexical Conventions.” Each line is terminated by an end-of-line

(EOL) marker, which may be a carriage return (character code 13), a line feed

(character code 10), or both. PDF ﬁles with binary data may have arbitrarily long

lines. However, to increase compatibility with other applications that process

PDF ﬁles, lines that are not part of stream object data are limited to no more than

255 characters (see implementation notes 11 and 12 in Appendix H).

The rules described here are sufﬁcient to produce a well-formed PDF ﬁle. How-

ever, there are some additional rules for organizing a PDF ﬁle to enable efﬁcient

incremental access to a document’s components in a network environment. This

form of organization, called Linearized PDF, is described in Appendix F.

3.4.1 File Header

The ﬁrst line of a PDF ﬁle is a header identifying the version number of the PDF

speciﬁcation to which the ﬁle conforms. For a ﬁle conforming to PDF version 1.3,

the header should be

%PDF−1.3

However, since any ﬁle conforming to an earlier version of PDF also conforms to

version 1.3, an application that processes PDF 1.3 can also accept ﬁles with any of

the following headers:

%PDF−1.0

%PDF−1.1

%PDF−1.2

(See also implementation notes 13 and 14 in Appendix H.)

Furthermore, under some conditions, a viewer application may be able to process

PDF ﬁles conforming to a later version than it was designed to accept. New PDF

features are often introduced in such a way that they can safely be ignored by a

viewer that does not understand them (see Section H.1, “PDF Version Num-

bers”).

File Structure3.4

Note: If a PDF ﬁle contains binary data, as most do (see Section 3.1, “Lexical Con-

ventions”), it is recommended that the header line be immediately followed by a

comment line containing at least four binary characters—that is, characters whose

codes are 128 or greater. This will ensure proper behavior of ﬁle transfer applications

that inspect data near the beginning of a ﬁle to determine whether to treat the ﬁle’s

contents as text or as binary.

3.4.2 File Body

The body of a PDF ﬁle consists of a sequence of indirect objects representing the

contents of a document. The objects, which are of the basic types described in

Section 3.2, “Objects,” represent components of the document such as fonts,

pages, and sampled images.

3.4.3 Cross-Reference Table

The cross-reference table contains information that permits random access to in-

direct objects within the ﬁle, so that the entire ﬁle need not be read to locate any

particular object. The table contains a one-line entry for each indirect object,

specifying the location of that object within the body of the ﬁle.

The cross-reference table is the only part of a PDF ﬁle with a ﬁxed format; this

permits entries in the table to be accessed randomly. The table comprises one or

more cross-reference sections. Initially, the entire table consists of a single section;

one additional section is added each time the ﬁle is updated (see Section 3.4.5,

“Incremental Updates”).

Each cross-reference section begins with a line containing the keyword

xref. Fol-

lowing this line are one or more cross-reference subsections, which may appear in

any order. The subsection structure is useful for incremental updates, since it

allows a new cross-reference section to be added to the PDF ﬁle, containing

entries only for objects that have been added or deleted. For a ﬁle that has never

been updated, the cross-reference section contains only one subsection, whose

object numbering begins at 0.

SyntaxCHAPTER 3

Each cross-reference subsection contains entries for a contiguous range of object

numbers. The subsection begins with a line containing two numbers, separated

by a space: the object number of the ﬁrst object in this subsection and the num-

ber of entries in the subsection. For example, the line

28 5

introduces a subsection containing ﬁve objects, numbered consecutively from 28

to 32.

Following this line are the cross-reference entries themselves, one per line. Each

entry is exactly 20 bytes long, including the end-of-line marker. There are two

kinds of cross-reference entry: one for objects that are in use and another for ob-

jects that have been deleted and so are free. Both types of entry have similar basic

formats, distinguished by the keyword

n (for an in-use entry) or f (for a free

entry). The format of an in-use entry is as follows:

nnnnnnnnnn ggggg n eol

where

nnnnnnnnnn is a 10-digit byte offset

ggggg is a 5-digit generation number

n is a literal keyword identifying this as an in-use entry

eol is a 2-character end-of-line sequence

The byte offset is a 10-digit number, padded with leading zeros if necessary,

giving the number of bytes from the beginning of the ﬁle to the beginning of the

object. It is separated from the generation number by a single space. The genera-

tion number is a 5-digit number, also padded with leading zeros if necessary. Fol-

lowing the generation number is a single space, the keyword

n, and then a

2-character end-of-line sequence. If the ﬁle’s end-of-line marker is a single char-

acter (either a carriage return or a line feed), it is preceded by a single space; if the

marker is 2 characters (both a carriage return and a line feed), it is not preceded

by a space. Thus the overall length of the entry is always exactly 20 bytes.

File Structure3.4

The cross-reference entry for a free object has essentially the same format, except

that the keyword is

f instead of n and the interpretation of the ﬁrst item is differ-

ent:

nnnnnnnnnn ggggg f eol

where

nnnnnnnnnn is the 10-digit object number of the next free object

ggggg is a 5-digit generation number

f is a literal keyword identifying this as a free entry

eol is a 2-character end-of-line sequence

The free entries in the cross-reference table form a linked list, with each free entry

containing the object number of the next. The ﬁrst entry in the table (object

number 0) is always free and has a generation number of 65,535; it is the head of

the linked list of free objects. The last free entry (the tail of the linked list) links

back to object number 0.

Except for object number 0, all objects in the cross-reference table initially have

generation numbers of 0. When an indirect object is deleted, its cross-reference

entry is marked free and it is added to the linked list of free entries. The entry’s

generation number is incremented by 1 to indicate the generation number to be

used the next time an object with that object number is created. Thus each time

the entry is reused, it is given a new generation number. The maximum genera-

tion number is 65,535; when a cross-reference entry reaches this value, it will

never be reused.

The cross-reference table (comprising the original cross-reference section and all

update sections) must contain one entry for each object number from 0 to the

maximum object number used in the ﬁle, even if one or more of the object num-

bers in this range do not actually occur in the ﬁle.

Example 3.4 shows a cross-reference section consisting of a single subsection

with six entries: four that are in use (objects number 1, 2, 4, and 5) and two that

are free (objects number 0 and 3). Object number 3 has been deleted, and the

next object created with that object number will be given a generation number

of 7.

SyntaxCHAPTER 3

Example 3.4

xref

0000000003 65535 f

0000000017 00000 n

0000000081 00000 n

0000000000 00007 f

0000000331 00000 n

0000000409 00000 n

Example 3.5 shows a cross-reference section with four subsections, containing a

total of ﬁve entries. The ﬁrst subsection contains one entry, for object number 0,

which is free. The second subsection contains one entry, for object number 3,

which is in use. The third subsection contains two entries, for objects number 23

and 24, both of which are in use. Object number 23 has been reused, as can be

seen from the fact that it has a generation number of 2. The fourth subsection

contains one entry, for object number 30, which is in use.

Example 3.5

xref

0000000000 65535 f

0000025325 00000 n

23 2

0000025518 00002 n

0000025635 00000 n

30 1

0000025777 00000 n

See Section G.6, “Updating Example,” for a more extensive example of the struc-

ture of a PDF ﬁle that has been updated several times.

3.4.4 File Trailer

The trailer of a PDF ﬁle enables an application reading the ﬁle to quickly ﬁnd the

cross-reference table and certain special objects. Applications should read a PDF

ﬁle from its end. The last line of the ﬁle contains only the end-of-ﬁle marker,

%%EOF. (See implementation note 15 in Appendix H.) The two preceding lines

contain the keyword

startxref and the byte offset from the beginning of the ﬁle to

File Structure3.4

the beginning of the xref keyword in the last cross-reference section. The start-

xref

line is preceded by the trailer dictionary, consisting of the keyword trailer fol-

lowed by a series of key-value pairs enclosed in double angle brackets. Thus the

trailer has the following overall structure:

trailer

<< key

value

key

value

…

key

value

startxref

Byte_offset_of_last_cross-reference_section

%%EOF

Table 3.11 shows the contents of the trailer dictionary.

TABLE 3.11 Entries in the trailer dictionary

KEY TYPE VALUE

Size integer (Required) The total number of entries in the ﬁle’s cross-reference table, as deﬁned

by the combination of the original section and all update sections. Equivalently, this

value is 1 greater than the highest object number used in the ﬁle.

Prev integer (Present only if the ﬁle has more than one cross-reference section) The byte offset from

the beginning of the ﬁle to the beginning of the previous cross-reference section.

Root dictionary (Required; must be an indirect reference) The catalog object for the PDF document

contained in the ﬁle (see Section 3.6.1, “Document Catalog”).

Encrypt dictionary (Required if document is encrypted; PDF 1.1) The document’s encryption dictionary

(see Section 3.5, “Encryption”).

Info dictionary (Optional; must be an indirect reference) The document’s information dictionary

(see Section 8.2, “Document Information Dictionary”).

ID array (Optional; PDF 1.1) An array of two strings, each of which is a ﬁle identiﬁer (see

Section 8.3, “File Identiﬁers”). The ﬁrst identiﬁer is established permanently when

the ﬁle is created; the second is changed each time the ﬁle is updated.

Example 3.6 shows an example trailer for a ﬁle that has never been updated (as

indicated by the absence of a

Prev entry in the trailer dictionary).

SyntaxCHAPTER 3

Example 3.6

trailer

<< /Size 22

/Root 2 0 R

/Info 1 0 R

/ID [ <81b14aafa313db63dbd6f981e49f94f4>

<81b14aafa313db63dbd6f981e49f94f4>

]

startxref

18799

%%EOF

3.4.5 Incremental Updates

The contents of a PDF ﬁle can be updated incrementally without rewriting the

entire ﬁle. Changes are appended to the end of the ﬁle, leaving its original con-

tents intact. Any new or changed objects are appended, a cross-reference section

is added, and a new trailer is inserted. The resulting ﬁle has the structure shown

in Figure 3.3. A complete example of an updated ﬁle is shown in Section G.6,

“Updating Example.”

The cross-reference section added when a ﬁle is updated contains entries only for

objects that have been changed, replaced, or deleted, plus the entry for object 0.

Deleted objects are left unchanged in the ﬁle, but are marked as deleted via their

cross-reference entries. The added trailer contains all the entries (perhaps modi-

ﬁed) from the previous trailer, as well as a

Prev entry giving the location of the

previous cross-reference section (see Table 3.11 on page 61). As shown in

Figure 3.3, a ﬁle that has been updated several times contains several trailers; note

that each trailer is terminated by its own end-of-ﬁle (

%%EOF) marker.

Because updates are appended to PDF ﬁles, it is possible to end up with several

copies of an object with the same object identiﬁer (object number and generation

number). This can occur, for example, if a text annotation (see Section 7.4, “An-

notations”) is changed several times, with the ﬁle being saved between changes.

Because the text annotation object is not deleted, it retains the same object num-

ber and generation number as before. An updated copy of the object is included

in the new update section added to the ﬁle; the update’s cross-reference section

includes a byte offset to this new copy of the object, overriding the old byte offset

File Structure3.4

contained in the original cross-reference section. When a viewer application

reads the ﬁle, it must build its cross-reference information in such a way that the

most recent copy of each object is the one accessed in the ﬁle.

FIGURE 3.3 Structure of an updated PDF ﬁle

Header

Original

body

Original

cross-reference

section

Updated trailer n

Body update 1

Cross-reference

section 1

Body update n

Cross-reference

section n

Original trailer

Updated trailer 1

SyntaxCHAPTER 3

3.5 Encryption

A PDF document can be encrypted (PDF 1.1) to protect its contents from un-

authorized access. Encryption applies to all strings and streams in the document’s

PDF ﬁle, but not to other object types such as integers and boolean values, which

are used primarily to convey information about the document’s structure rather

than its content. Leaving these values unencrypted allows random access to the

objects within a document, while encrypting the strings and streams protects the

document’s substantive contents.

Note: When a PDF stream object (see Section 3.2.7, “Stream Objects”) refers to an

external ﬁle, the stream’s contents are not encrypted, since they are not part of the

PDF ﬁle itself. However, if the contents of the stream are embedded within the PDF

ﬁle (see Section 3.10.3, “Embedded File Streams”), they are encrypted like any other

stream in the ﬁle.

Encryption is controlled by an encryption dictionary, which is the value of the

Encrypt entry in the document’s trailer dictionary (see Table 3.11 on page 61). If

this entry is absent from the trailer dictionary, the document is not encrypted.

The entries shown in Table 3.12 are common to all encryption dictionaries.

TABLE 3.12 Entries common to all encryption dictionaries

KEY TYPE VALUE

Filter name (Required) The name of the security handler for this document; see below. Default value:

Standard, for the built-in security handler. (Names for nonstandard security handlers

can be registered using the procedure described in Appendix E.)

V number (Optional) A code specifying the algorithm to be used in encrypting and decrypting the

document:

1 Algorithm 3.1 on page 66

0 An alternate algorithm that is undocumented and no longer supported, and

whose use is strongly discouraged

The default value if this entry is omitted is 0, but a value of 1 is strongly recommended.

Values greater than 1 are not deﬁned for PDF 1.3, and documents specifying such values

cannot be opened by PDF 1.3 viewer applications.

Encryption3.5

The encryption dictionary’s Filter entry identiﬁes the ﬁle’s security handler, a soft-

ware module that implements various aspects of the encryption process and con-

trols access to the contents of the encrypted document. PDF speciﬁes a standard

security handler that all viewer applications are expected to support, but applica-

tions may optionally substitute alternate security handlers of their own. The re-

maining contents of the encryption dictionary are determined by the security

handler, and may vary from one handler to another. Those for the standard secu-

rity handler are described below in Section 3.5.2, “Standard Security Handler.”

Unlike strings within the body of the document, those in the encryption diction-

ary must be direct objects and are not encrypted by the usual methods. The secu-

rity handler itself is responsible for encrypting and decrypting strings in the

encryption dictionary, using whatever encryption algorithm it chooses.

Note: If the standard encryption methods provided by PDF are not sufﬁcient to their

needs, document creators have two choices: they can provide an alternate, more

secure security handler or they can encrypt whole PDF documents themselves, by-

passing PDF security entirely.

3.5.1 General Encryption Algorithm

PDF’s standard encryption methods use the MD5 message-digest algorithm (de-

scribed in Internet RFC 1321, The MD5 Message-Digest Algorithm; see the Bibli-

ography) and a proprietary encryption algorithm known as RC4. RC4 is a

symmetric stream cipher—the same algorithm is used for both encryption and

decryption, and the algorithm does not change the length of the data.

Note: RC4 is a copyrighted, proprietary algorithm of RSA Security, Inc. Adobe Sys-

tems has licensed this algorithm for use in its Acrobat products. Independent soft-

ware vendors may be required to license RC4 in order to develop software that

encrypts or decrypts PDF documents. For further information, visit the RSA Web site

at <http://www.rsasecurity.com> or send e-mail to <products@rsasecurity.com>.

The encryption of data in a PDF ﬁle is based on the use of an encryption key com-

puted by the security handler. Different security handlers can compute the key in

a variety of ways, more or less cryptographically secure. In particular, PDF’s stan-

dard encryption handler limits the key to 5 bytes (40 bits) in length, in accor-

dance with U.S. cryptographic export requirements in effect at the time of initial

publication of the PDF 1.3 speciﬁcation. Regardless of how the key is computed,

its use in the encryption of data is always the same (see Algorithm 3.1). Because

SyntaxCHAPTER 3

the RC4 algorithm is symmetric, this same sequence of steps can be used (given a

key) both to encrypt and to decrypt data.

Algorithm 3.1 Encryption of data using an encryption key

1. Obtain the object number and generation number from the object identiﬁer of

the string or stream to be encrypted (see Section 3.2.9, “Indirect Objects”). If the

string is a direct object, use the identiﬁer of the indirect object containing it.

2. Treating the object number and generation number as binary integers, extend the

original 5-byte key to 10 bytes by appending the low-order 3 bytes of the object

number and the low-order 2 bytes of the generation number in that order, low-

order byte ﬁrst.

3. Pass the resulting 10-byte string as input to the MD5 hash function.

4. Use the ﬁrst 10 bytes of the output from the MD5 function as the key for the RC4

encryption function, along with the string or stream data to be encrypted. The

output is the encrypted data to be stored in the PDF ﬁle.

Stream data is encrypted after applying all stream encoding ﬁlters, and is de-

crypted before applying any stream decoding ﬁlters; the number of bytes to be

encrypted or decrypted is given by the

Length entry in the stream dictionary.

Decryption of strings (other than those in the encryption dictionary) is done

after escape-sequence processing and hexadecimal decoding as appropriate to the

string representation described in Section 3.2.3, “String Objects.”

3.5.2 Standard Security Handler

PDF’s standard security handler allows two passwords to be speciﬁed for a docu-

ment: an owner password and a user password. Correctly supplying either pass-

word allows a user to open the document, decrypt it, and display it on the screen.

The owner password allows the following additional operations:

• Modifying the document’s contents

• Copying text and graphics from the document

• Adding or modifying text annotations (see Section 7.4, “Annotations”) and

interactive form ﬁelds (Section 7.6, “Interactive Forms”)

• Printing the document

Encryption3.5

Access to any of these operations may be restricted if the user password is sup-

plied instead of the owner password. Access information in the document’s en-

cryption dictionary speciﬁes which, if any, of these additional operations are

permitted by the user password. The owner password must be supplied in order

to change these restrictions or the passwords themselves.

Note: PDF cannot enforce the document access privileges speciﬁed in the encryption

dictionary. It is up to the implementors of PDF viewer applications to respect the in-

tent of the document creator by restricting access to an encrypted PDF ﬁle according

to the passwords and permissions contained in the ﬁle.

Note: If the owner and user passwords are the same, the document is always opened

with user access privileges. It is therefore impossible in these circumstances to obtain

owner privileges for the document.

Encryption Dictionary

Table 3.13 shows the encryption dictionary entries for the standard security

handler (in addition to those in Table 3.12). The values of the

O and U entries are

used to determine whether a password string supplied by the user is the correct

owner password, user password, or neither. If the user password is supplied, the

entry determines which operations are to be permitted. A document is encrypted

if an owner password, user password, or any access restriction was speciﬁed when

the document was created. However, the user is prompted for a password on

opening the document only if the document has a user password; this can be de-

termined by testing the empty string as the user password (see Algorithm 3.5 on

page 70).

The value of the encryption dictionary’s

P entry is an unsigned 32-bit integer

containing a set of ﬂags specifying which access privileges should be granted

when the document is opened with the user password. Table 3.14 shows the

meanings of these ﬂags. Bit positions within the ﬂag word are numbered from 1

(low-order) to 32 (high-order); a 1-bit in any position enables the corresponding

access privilege.

Note: PDF integer objects in fact are represented internally in signed twos-

complement form. Since all the reserved high-order ﬂag bits in the encryption

dictionary’s

P value are required to be 1, the value must be speciﬁed as a negative

integer. For example, the value -44 allows printing and copying but disallows modi-

fying the content and annotations.

SyntaxCHAPTER 3

TABLE 3.13 Additional encryption dictionary entries for the standard security handler

KEY TYPE VALUE

R number (Required) The revision number of the standard security handler that created this diction-

ary. At the time of publication, the current revision number is 2.

O string (Required) A 32-byte string used in determining whether a valid owner password was

entered. Contains an encrypted version of the padded user password (see step 1 of

Algorithm 3.2 below).

U string (Required) A 32-byte string used in determining whether a valid user password was

entered. Contains an encrypted version of the ﬁxed padding string shown in step 1 of

Algorithm 3.2 below.

P integer (Required) A set of ﬂags specifying which operations are permitted when the document is

opened with the user password (see Table 3.14).

TABLE 3.14 User password access privileges

BIT POSITION MEANING

1–2 Reserved; must be 0

3 Print document

4 Modify contents of document (other than text annotations and

interactive form ﬁelds)

5 Copy text and graphics from document

6 Add or modify text annotations and interactive form ﬁelds

7–32 Reserved; must be 1

Key Generation Algorithms

As noted earlier, one function of a security handler is to generate a 5-byte encryp-

tion key for use in encrypting and decrypting the contents of a document. Given

a password string, the standard security handler computes an encryption key as

shown in Algorithm 3.2.

Encryption3.5

Algorithm 3.2 Computing an encryption key

1. Pad or truncate the password string to exactly 32 bytes. If the password string is

more than 32 bytes long, use only its ﬁrst 32 bytes; if it is less than 32 bytes long,

pad it by appending the required number of additional bytes from the beginning

of the following padding string:

<28BF4E5E4E758A4164004E56FFFA0108

2E 2E 00 B6 D0 68 3E 80 2F 0C A9 FE 64 53 69 7A >

That is, if the password string is n bytes long, append the ﬁrst 32 − n bytes of the

padding string to the end of the password string. If the password is omitted, treat

it as an empty (zero-length) string and substitute the entire padding string in its

place.

2. Pass the result of step 1 as input to the MD5 hash function, followed by the value

of the encryption dictionary’s

O entry. (Algorithm 3.3 shows how the O value is

computed.)

3. Treat the value of the

P entry as an unsigned 4-byte integer and pass these bytes to

the MD5 hash function, low-order byte ﬁrst.

4. Pass the ﬁrst element of the ﬁle’s ﬁle identiﬁer to the MD5 hash function (see

Section 8.3, “File Identiﬁers”).

5. The ﬁrst 5 bytes of the output from the MD5 algorithm constitute the encryption

key.

This algorithm, when applied to the user password, produces the encryption key

used to encrypt or decrypt string and stream data according to Algorithm 3.1 on

page 66. Parts of this algorithm are also used in the algorithms described below.

In addition to the encryption key, the standard security handler must provide the

contents of the encryption dictionary (Tables 3.12 on page 64 and 3.13 on

page 68). The values of the

Filter, R, P, and V entries are straightforward, but the

computation of the

O (owner password) and U (user password) entries requires

further explanation. Algorithms 3.3 and 3.4 show how to compute the values of

these entries.

Algorithm 3.3 Computing the O (owner) value in the encryption dictionary

1. Pad or truncate the owner password string as described in step 1 of Algorithm 3.2.

If there is no owner password, use the user password instead. (See implementa-

tion note 16 in Appendix H.)

2. Pass the result of step 1 as input to the MD5 hash function.

SyntaxCHAPTER 3

3. Create an RC4 key using the ﬁrst 5 bytes of the MD5 output.

4. Pad or truncate the user password string as described in step 1 of Algorithm 3.2.

5. Encrypt the padded user password string with the RC4 algorithm, using the key

obtained in step 3.

6. Store the result of step 5 as the value of the

O entry in the encryption dictionary.

Algorithm 3.4 Computing the U (user) value in the encryption dictionary

1. Create an encryption key based on the user password string, as described in

Algorithm 3.2.

2. Encrypt the 32-byte padding string shown in step 1 of Algorithm 3.2, using the

RC4 algorithm with the encryption key from the preceding step.

3. Store the result of step 2 as the value of the

U entry in the encryption dictionary.

Given a password string supplied by the user, the standard security handler uses

the contents of the encryption dictionary to determine whether the document

should be opened and what access privileges should be granted. If the password

supplied is the correct user password, the document is opened with only the ac-

cess privileges speciﬁed by the

P entry in the encryption dictionary. If the pass-

word supplied is the correct owner password (but not the same as the user

password), full access privileges are granted.

The standard security handler uses Algorithms 3.5 and 3.6 to determine whether

a supplied password string is the correct user or owner password.

Algorithm 3.5 Checking the user password

1. Compute an encryption key from the supplied password string, as described in

Algorithm 3.2.

2. Decrypt the value of the encryption dictionary’s

U entry, using the RC4 algorithm

with the encryption key computed in step 1.

3. If the result of step 2 is identical to the ﬁxed padding string shown in step 1 of

Algorithm 3.2, the password supplied is the correct user password. The key ob-

tained in step 1 can be used to decrypt the document using Algorithm 3.1 on

page 66.

Document Structure3.6

Algorithm 3.6 Checking the owner password

1. Compute an encryption key from the supplied password string, as described in

steps 1 to 3 of Algorithm 3.3.

2. Decrypt the value of the encryption dictionary’s

O entry, using the RC4 algorithm

with the encryption key computed in step 1.

3. Use Algorithm 3.2 to compute an encryption key from the decrypted value ob-

tained in step 2.

4. Decrypt the value of the encryption dictionary’s

U entry, using the RC4 algorithm

with the encryption key computed in step 3.

5. If the result of step 4 is identical to the ﬁxed padding string shown in step 1 of

Algorithm 3.2, the password supplied is the correct owner password. The key ob-

tained in step 3 can be used to decrypt the document using Algorithm 3.1 on

page 66.

3.6 Document Structure

A PDF document can be regarded as a hierarchy of objects contained in the body

section of a PDF ﬁle. At the root of the hierarchy is the document’s catalog dic-

tionary (see Section 3.6.1, “Document Catalog”). Most of the objects in the hier-

archy are dictionaries. For example, each page of the document is represented by

a page object—a dictionary that includes references to the page’s contents and

other attributes, such as its thumbnail image (Section 7.2.3, “Thumbnail Imag-

es”) and any annotations (Section 7.4, “Annotations”) associated with it. The in-

dividual page objects are tied together in a structure called the page tree

(described in Section 3.6.2, “Page Tree”), which in turn is located via an indirect

reference in the document catalog. Parent, child, and sibling relationships within

the hierarchy are deﬁned by dictionary entries whose values are indirect refer-

ences to other dictionaries. Figure 3.4 illustrates the structure of the object hier-

archy.

Note: The data structures described in this section, particularly the catalog and page

dictionaries, combine entries describing document structure with ones dealing with

the detailed semantics of documents and pages. All entries are listed here, but many

of their descriptions are deferred to subsequent chapters.

SyntaxCHAPTER 3

FIGURE 3.4 Structure of a PDF document

Outline

entry

Page

Thumbnail

image

Annotations

Bead

Thread

Outline

entry

Content

stream

Thread

Named

destinations

Article

threads

Interactive

form

Outline

hierarchy

Document catalog

Page

•

Page

tree

• • •

Document Structure3.6

3.6.1 Document Catalog

The root of a document’s object hierarchy is the catalog dictionary, located via the

Root entry in the trailer of the PDF ﬁle (see Section 3.4.4, “File Trailer”). The cat-

alog contains references to other objects deﬁning the document’s contents, out-

line, article threads (PDF 1.1), named destinations, and other attributes. In

addition, it contains information about how the document should be displayed

on the screen, such as whether its outline and thumbnail page images should be

displayed automatically and whether some location other than the ﬁrst page

should be shown when the document is opened.

Table 3.15 shows the entries in the catalog dictionary. (See also implementation

note 17 in Appendix H.)

TABLE 3.15 Entries in the catalog dictionary

KEY TYPE VALUE

Type name (Required) The type of PDF object that this dictionary describes; must

Catalog for the catalog dictionary.

Pages dictionary (Required, must be an indirect reference) The page tree node that is the

root of the document’s page tree (see Section 3.6.2, “Page Tree”).

PageLabels number tree (Optional; PDF 1.3) A number tree (see Section 3.8.5, “Number Trees”)

deﬁning the page labeling for the document. The keys in this tree are

page indices; the corresponding values are page label dictionaries (see

Section 7.3.1, “Page Labels”). Each page index denotes the ﬁrst page to

which the speciﬁed page label dictionary applies. The tree must include

a value for page index 0.

Names dictionary (Optional; PDF 1.2) The document’s name dictionary (see Section 3.6.3,

“Name Dictionary”).

Dests dictionary (Optional; PDF 1.1; must be an indirect reference) A dictionary of names

and corresponding destinations (see “Named Destinations” on

page 387).

ViewerPreferences dictionary (Optional; PDF 1.2) A viewer preferences dictionary (see Section 7.1,

“Viewer Preferences”) specifying the way the document is to be dis-

played on the screen. If this entry is absent, viewer applications should

use their own current user preference settings.

SyntaxCHAPTER 3

PageLayout name (Optional) A name object specifying the page layout to be used when the

document is opened:

SinglePage Display one page at a time.

OneColumn Display the pages in one column.

TwoColumnLeft Display the pages in two columns, with odd-

numbered pages on the left.

TwoColumnRight Display the pages in two columns, with odd-

numbered pages on the right.

(See implementation note 18 in Appendix H.) Default value:

SinglePage.

PageMode name (Optional) A name object specifying how the document should be dis-

played when opened:

UseNone Neither document outline nor thumbnail im-

ages visible

UseOutlines Document outline visible

UseThumbs Thumbnail images visible

FullScreen Full-screen mode, with no menu bar, window

controls, or any other window visible

Default value:

UseNone.

Outlines dictionary (Optional; must be an indirect reference) The outline dictionary that is

the root of the document’s outline hierarchy (see Section 7.2.2, “Docu-

ment Outline”).

Threads array (Optional; PDF 1.1; must be an indirect reference) An array of thread dic-

tionaries representing the document’s article threads (see Section 7.3.2,

“Articles”).

OpenAction array or (Optional; PDF 1.1) A value specifying a destination to be displayed or

dictionary an action to be performed when the document is opened. The value is

either an array deﬁning a destination (see Section 7.2.1, “Destinations”)

or an action dictionary representing an action (Section 7.5, “Actions”).

If this entry is absent, the document should be opened to the top of the

ﬁrst page at the default magniﬁcation factor.

URI dictionary (Optional) A dictionary containing document-level information for uni-

form resource identiﬁer (URI) actions (see “URI Actions” on page 428).

AcroForm dictionary (Optional; PDF 1.2) The document’s interactive form (AcroForm) dic-

tionary (see Section 7.6.1, “Interactive Form Dictionary”).

Document Structure3.6

StructTreeRoot dictionary (Optional; PDF 1.3) The document’s structure tree root dictionary (see

“Structure Hierarchy” on page 486).

SpiderInfo dictionary (Optional; PDF 1.3) A dictionary containing state information used by

the Acrobat Web Capture (AcroSpider) plug-in extension (see

Section 8.5.1, “Web Capture Information Dictionary”).

Example 3.7 shows a sample catalog object.

Example 3.7

1 0 obj

<< /Type /Catalog

/Pages 2 0 R

/Outlines 3 0 R

/PageMode /UseOutlines

endobj

3.6.2 Page Tree

The pages of a document are accessed through a structure known as the page tree,

which deﬁnes their ordering within the document. The tree structure allows PDF

viewer applications to quickly open a document containing thousands of pages

using only limited memory. The tree contains nodes of two types—intermediate

nodes, called page tree nodes, and leaf nodes, called page objects—whose form is

described in the sections below. Viewer applications should be prepared to han-

dle any form of tree structure built of such nodes. The simplest structure would

consist of a single page tree node that references all of the document’s page ob-

jects directly; however, to optimize the performance of viewer applications, the

Acrobat Distiller and PDF Writer programs construct trees of a particular form,

known as balanced trees. Further information on this form of tree can be found in

Data Structures and Algorithms, by Aho, Hopcroft, and Ullman (see the Bibliog-

raphy).

Page Tree Nodes

Table 3.16 shows the required entries in a page tree node.

SyntaxCHAPTER 3

TABLE 3.16 Required entries in a page tree node

KEY TYPE VALUE

Type name (Required) The type of PDF object that this dictionary describes; must be Pages for

a page tree node.

Parent dictionary (Required except in root node; must be an indirect reference) The page tree node that

is the immediate parent of this one.

Kids array (Required) An array of indirect references to the immediate children of this node.

The children may be page objects or other page tree nodes.

Count integer (Required) The number of leaf nodes (page objects) that are descendants of this

node within the page tree.

Note: The structure of the page tree is not necessarily related to the logical structure

of the document itself; that is, page tree nodes do not represent chapters, sections, and

so forth. (Other data structures are deﬁned for that purpose; see Section 8.4.3, “Log-

ical Structure.”) Applications that consume or produce PDF ﬁles are not required to

preserve the existing structure of the page tree.

Example 3.8 illustrates the page tree for a document with three pages. See “Page

Objects,” below, for the contents of the individual page objects, and Section G.4,

“Page Tree Example,” for a more extended example showing the page tree for a

longer document.

Example 3.8

2 0 obj

<< /Type /Pages

/Kids [ 4 0 R

10 0 R

24 0 R

]

/Count 3

endobj

4 0 obj

<< /Type /Page

… Additional entries describing the attributes of this page …

endobj

Document Structure3.6

10 0 obj

<< /Type /Page

… Additional entries describing the attributes of this page …

endobj

24 0 obj

<< /Type /Page

… Additional entries describing the attributes of this page …

endobj

In addition to the entries shown in Table 3.16, a page tree node may contain fur-

ther entries deﬁning inherited attributes for the page objects that are its descen-

dants (see “Inheritance of Page Attributes” on page 80).

Page Objects

The leaves of the page tree are page objects, each of which is a dictionary specify-

ing the attributes of a single page of the document. Table 3.17 shows the contents

of this dictionary (see also implementation note 19 in Appendix H). The table

also identiﬁes which attributes a page may inherit from its ancestor nodes in the

page tree, as described under “Inheritance of Page Attributes” on page 80.

Attributes that are not explicitly identiﬁed in the table as inheritable cannot be

inherited.

TABLE 3.17 Entries in a page object

KEY TYPE VALUE

Type name (Required) The type of PDF object that this dictionary describes; must be

Page for a page object.

Parent dictionary (Required; must be an indirect reference) The page tree node that is the im-

mediate parent of this page object.

Resources dictionary (Required; inheritable) A dictionary containing any resources required by

the page (see Section 3.7.2, “Resource Dictionaries”). If the page requires

no resources, the value of this entry should be an empty dictionary; omit-

ting the entry entirely, or specifying a null value, indicates that the re-

sources are to be inherited from an ancestor node in the page tree.

SyntaxCHAPTER 3

MediaBox rectangle (Required; inheritable) A rectangle (see Section 3.8.3, “Rectangles”), ex-

pressed in default user space units, deﬁning the maximum imageable area

of the physical medium on which the page is to be printed (see

Section 8.6.1, “Page Boundaries”).

CropBox rectangle (Optional; inheritable) A rectangle, expressed in default user space units,

deﬁning the region to which the contents of the page are to be clipped

(cropped) when displayed or printed (see Section 8.6.1, “Page Bound-

aries”). Default value: the value of

MediaBox.

BleedBox rectangle (Optional; PDF 1.3) A rectangle, expressed in default user space units, de-

ﬁning the region to which the contents of the page should be clipped

when output in a production environment (see Section 8.6.1, “Page

Boundaries”). Default value: the value of

CropBox.

TrimBox rectangle (Optional; PDF 1.3) A rectangle, expressed in default user space units, de-

ﬁning the intended dimensions of the ﬁnished page after trimming (see

Section 8.6.1, “Page Boundaries”). Default value: the value of

CropBox.

ArtBox rectangle (Optional; PDF 1.3) A rectangle, expressed in default user space units, de-

ﬁning the extent of the page’s meaningful content (including potential

white space) as intended by the page’s creator (see Section 8.6.1, “Page

Boundaries”). Default value: the value of

CropBox.

Contents stream or array (Optional) A content stream (see Section 3.7.1, “Content Streams”) de-

scribing the contents of this page. If this entry is absent, the page is empty.

The value may be either a single stream or an array of streams. If it is an

array, the effect is as if all of the streams in the array were concatenated, in

order, to form a single stream. This allows a program generating a PDF

ﬁle to create image objects and other resources as they occur, even though

they interrupt the content stream. The division between streams may

occur only at the boundaries between lexical tokens (see Section 3.1,

“Lexical Conventions”), but is unrelated to the page’s logical content or

organization. Applications that consume or produce PDF ﬁles are not re-

quired to preserve the existing structure of the

Contents array.

Rotate integer (Optional; inheritable) The number of degrees by which the page should

be rotated clockwise when displayed or printed. The value must be a mul-

tiple of 90. Default value: 0.

Thumb stream (Optional) A stream object deﬁning the page’s thumbnail image (see

Section 7.2.3, “Thumbnail Images”).

B array (Optional; PDF 1.1; recommended if the page contains article beads) An

array of indirect references to article beads appearing on the page (see

Section 7.3.2, “Articles”; see also implementation note 20 in Appendix H).

The beads are listed in the array in natural reading order.

Document Structure3.6

Dur number (Optional; PDF 1.1) The page’s display duration (also called its advance

timing): the maximum length of time, in seconds, that the page will be

displayed during presentations before the viewer application automatical-

ly advances to the next page (see Section 7.3.3, “Presentations”). By de-

fault, the viewer does not advance automatically.

Trans dictionary (Optional; PDF 1.1) A transition dictionary describing the transition

effect to be used when displaying the page during presentations (see

Section 7.3.3, “Presentations”).

Annots array (Optional) An array of annotation dictionaries representing annotations

associated with the page (see Section 7.4, “Annotations”).

AA dictionary (Optional; PDF 1.2) An additional-actions dictionary deﬁning actions to

be performed when the page is opened or closed (see Section 7.5.2, “Trig-

ger Events”; see also implementation note 21 in Appendix H).

PieceInfo dictionary (Optional; PDF 1.3) A page-piece dictionary associated with the page (see

Section 8.4.1, “Page-Piece Dictionaries”).

LastModiﬁed date (Optional unless PieceInfo is present; PDF 1.3) The date and time (see

Section 3.8.2, “Dates”) when the page’s contents were most recently mod-

iﬁed.

StructParents integer (Required if the page contains structural content items; PDF 1.3) The inte-

ger key of the page’s entry in the structural parent tree (see “Finding

Structure Elements from Content Items” on page 496).

ID string (Optional; PDF 1.3; indirect reference preferred) The digital identiﬁer of the

page’s parent Web Capture content set (see Section 8.5.5, “Object At-

tributes Related to Web Capture”).

PZ number (Optional; PDF 1.3) The page’s preferred zoom (magniﬁcation) factor: the

factor by which it should be scaled to achieve the “natural” display magni-

ﬁcation (see Section 8.5.5, “Object Attributes Related to Web Capture”).

SeparationInfo dictionary (Optional; PDF 1.3) A separation dictionary containing information

needed to generate color separations for the page (see Section 8.6.2, “Sep-

aration Dictionaries”).

Example 3.9 shows the deﬁnition of a page object with a thumbnail image and

two annotations. The media box speciﬁes that the page is to be printed on letter-

size paper. In addition, the resource dictionary is speciﬁed as a direct object and

shows that the page makes use of three fonts, named

F3, F5, and F7.

SyntaxCHAPTER 3

Example 3.9

3 0 obj

<< /Type /Page

/Parent 4 0 R

/MediaBox [0 0 612 792]

/Resources << /Font << /F3 7 0 R

/F5 9 0 R

/F7 11 0 R

/ProcSet [/PDF]

/Contents 12 0 R

/Thumb 14 0 R

/Annots [ 23 0 R

24 0 R

]

endobj

Inheritance of Page Attributes

Some of the page attributes shown in Table 3.17 are designated as inheritable. If

such an attribute is omitted from a page object, its value is inherited from an an-

cestor node in the page tree. If the attribute is a required one, a value must be

supplied in an ancestor node; if it is optional and no inherited value is speciﬁed,

the default value is used.

An attribute can thus be deﬁned once for a whole set of pages, by specifying it in

an intermediate page tree node and arranging the pages that share the attribute as

descendants of that node. For example, a document might specify the same

media box for all of its pages by including a

MediaBox entry in the root node of

the page tree. If necessary, an individual page object could then override this in-

herited value with a

MediaBox entry of its own.

Note: In a document conforming to the Linearized PDF organization (see

Appendix F), all page attributes must be speciﬁed explicitly as entries in the page dic-

tionaries to which they apply; they may not be inherited from an ancestor node.

Figure 3.5 illustrates the inheritance of attributes. In the page tree shown,

pages 1, 2, and 4 are rotated clockwise by 90 degrees, page 3 by 270 degrees,

page 6 by 180 degrees, and pages 5 and 7 not at all (0 degrees).

Document Structure3.6

FIGURE 3.5 Inheritance of attributes

3.6.3 Name Dictionary

Some categories of objects in a PDF ﬁle can be referred to by name rather than by

object reference. The correspondence between names and objects is established

by the document’s name dictionary (PDF 1.2), located via the

Names entry in the

document’s catalog (see Section 3.6.1, “Document Catalog”). Each entry in this

dictionary designates the root of a name tree (Section 3.8.4, “Name Trees”) de-

ﬁning names for a particular category of objects. Table 3.18 shows the contents of

the name dictionary.

TABLE 3.18 Entries in the name dictionary

KEY TYPE VALUE

Dests name tree (Optional; PDF 1.2) A name tree mapping name strings to destinations (see

“Named Destinations” on page 387).

AP name tree (Optional; PDF 1.3) A name tree mapping name strings to annotation appear-

ance streams (see Section 7.4.4, “Appearance Streams”).

JavaScript name tree (Optional; PDF 1.3) A name tree mapping name strings to document-level Java-

Script actions (see “JavaScript Actions” on page 458).

Pages

PagesPages

/Rotate 90

Pages

/Rotate 180

PagePage Page

/Rotate 0

PagePagePages

Page

/Rotate 90

Page

/Rotate 270

Page 1 Page 2 Page 5 Page 6 Page 7

Page 4Page 3

SyntaxCHAPTER 3

Pages name tree (Optional; PDF 1.3) A name tree mapping name strings to visible pages for use

in interactive forms (see Section 7.6.5, “Named Pages”).

Templates name tree (Optional; PDF 1.3) A name tree mapping name strings to invisible pages for use

in interactive forms (see Section 7.6.5, “Named Pages”).

IDS name tree (Optional; PDF 1.3) A name tree mapping content set IDs to Web Capture con-

tent sets (see Section 8.5.3, “Content Sets”).

URLS name tree (Optional; PDF 1.3) A name tree mapping uniform resource locators (URLs) to

Web Capture content sets (see Section 8.5.3, “Content Sets”).

3.7 Content Streams and Resources

Content streams are the primary means for describing the appearance of pages

and other graphical elements. A content stream depends on information con-

tained in an associated resource dictionary; in combination, these two objects

form a self-contained entity. This section describes these objects.

3.7.1 Content Streams

A content stream is a PDF stream object whose data consists of a sequence of

instructions describing the graphical elements to be painted on a page. The in-

structions are represented in the form of PDF objects, using the same object syn-

tax as in the rest of the PDF document. However, whereas the document as a

whole is a static, random-access data structure, the objects in the content stream

are intended to be interpreted and acted upon sequentially.

Each page of a document is represented by one or more content streams. Content

streams are also used to package up sequences of instructions as self-contained

graphical elements, such as forms (see Section 4.9, “Form XObjects”), patterns

(Section 4.6, “Patterns”), certain fonts (Section 5.5.4, “Type 3 Fonts”), and anno-

tation appearances (Section 7.4.4, “Appearance Streams”).

A content stream, after decoding with any speciﬁed ﬁlters, is interpreted accord-

ing to the PDF syntax rules described in Section 3.1, “Lexical Conventions.” It

consists of PDF objects denoting operands and operators. The operands needed

by an operator precede it in the stream. See Example 3.2 on page 39 for an exam-

ple of a content stream.

Content Streams and Resources3.7

An operand is a direct object belonging to any of the basic PDF data types except

a stream. Dictionaries are permitted as operands only by certain speciﬁc opera-

tors. Indirect objects and object references are not permitted at all.

An operator is a PDF keyword that speciﬁes some action to be performed, such as

painting a graphical shape on the page. An operator keyword is distinguished

from a name object by the absence of an initial slash character (

/). Operators are

meaningful only inside a content stream.

Note: This “postﬁx” notation, in which an operator is preceded by its operands, is

superﬁcially the same as in the PostScript language. However, PDF has no concept of

an operand stack as PostScript has. In PDF, all of the operands needed by an operator

must immediately precede that operator. Operators do not return results, and there

may not be operands left over when an operator ﬁnishes execution.

Most operators have to do with painting graphical elements on the page or with

specifying parameters that affect subsequent painting operations. The individual

operators are described in the chapters devoted to their functions:

• Chapter 4 describes operators that paint general graphics, such as ﬁlled areas,

strokes, and sampled images, and that specify device-independent graphical

parameters, such as color.

• Chapter 5 describes operators that paint text using character glyphs deﬁned in

fonts.

• Chapter 6 describes operators that specify device-dependent rendering param-

eters.

• Chapter 8 describes the marked-content operators that associate higher-level

logical information with objects in the content stream. These operators do not

affect the rendered appearance of the content; rather, they specify information

useful to applications that use PDF for document interchange.

Ordinarily, when a viewer application encounters an operator in a content stream

that it does not recognize, an error will occur. (See implementation note 22 in

Appendix H.) A pair of compatibility operators,

BX and EX (PDF 1.1), modify

this behavior (see Table 3.19). These operators must occur in pairs and may be

nested. They bracket a compatibility section, a portion of a content stream within

which unrecognized operators are to be ignored without error. This mechanism

enables a PDF document to use operators deﬁned in newer versions of PDF with-

out sacriﬁcing compatibility with older viewers; it should be used only in cases

SyntaxCHAPTER 3

where ignoring such newer operators is the appropriate thing to do. The BX and

EX operators are not themselves part of any graphics object (see Section 4.1,

“Graphics Objects”) or of the graphics state (Section 4.3, “Graphics State”).

TABLE 3.19 Compatibility operators

OPERANDS OPERATOR DESCRIPTION

— BX (PDF 1.1) Begin a compatibility section. Unrecognized operators (along with their

operands) will be ignored without error until the balancing

EX operator is encoun-

tered.

— EX (PDF 1.1) End a compatibility section begun by a balancing BX operator.

3.7.2 Resource Dictionaries

As stated above, the operands supplied to operators in a content stream may only

be direct objects; indirect objects and object references are not permitted. In

some cases, an operator needs to refer to a PDF object that is deﬁned outside the

content stream, such as a font dictionary or a stream containing image data. This

can be accomplished by deﬁning such objects as named resources and referring to

them by name from within the content stream.

Note: Named resources are meaningful only in the context of a content stream. The

scope of a resource name is local to a particular content stream, and is unrelated to

externally known identiﬁers for objects such as fonts. References from one object to

another outside of content streams should be made by means of indirect object refer-

ences rather than named resources.

A content stream’s named resources are deﬁned by a resource dictionary, which

enumerates the named resources needed by the operators in the content stream

and the names by which they can be referred to. For example, if a text operator

appearing within the content stream needed a certain font, the content stream’s

resource dictionary might associate the name

F42 with the corresponding font

dictionary. The text operator could then use this name to refer to the font.

Content Streams and Resources3.7

A resource dictionary is associated with a content stream in one of the following

ways:

• For a content stream that is the value of a page’s Contents entry (or is an

element of an array that is the value of that entry), the resource dictionary is

designated by the page dictionary’s

Resources entry. (Since a page’s Resources

attribute is inheritable, as described under “Inheritance of Page Attributes” on

page 80, it may actually reside in some ancestor node of the page object.)

• For other content streams, the resource dictionary is speciﬁed by the Resources

entry in the stream dictionary of the content stream itself. This applies to con-

tent streams that deﬁne form XObjects, patterns, Type 3 fonts, and annotation

appearances.

• A form XObject or a Type 3 font’s glyph description may omit the Resources

entry, in which case resources will be looked up in the Resources entry of the

page on which the form or font is used. This practice is not recommended.

In the context of a given content stream, the term current resource dictionary

refers to the resource dictionary associated with the stream in one of the ways

described above.

Each key in a resource dictionary is the name of a resource type, as shown in

Table 3.20. For most resource types, the corresponding value is a subdictionary

whose keys, in turn, are the names of resources of the given type and whose

values are the PDF objects representing those resources. (For resource type

Proc-

Set

, the value is an array of procedure set names instead of a subdictionary.)

TABLE 3.20 Entries in a resource dictionary

KEY TYPE VALUE

ExtGState dictionary (Optional) A dictionary mapping resource names to graphics state parameter

dictionaries (see Section 4.3.4, “Graphics State Parameter Dictionaries”).

ColorSpace dictionary (Optional) A dictionary mapping each resource name to either the name of a

device-dependent color space or an array describing a color space (see

Section 4.5, “Color Spaces”).

Pattern dictionary (Optional) A dictionary mapping resource names to pattern objects (see

Section 4.6, “Patterns”).

Shading dictionary (Optional; PDF 1.3) A dictionary mapping resource names to shading dic-

tionaries (see “Shading Dictionaries” on page 214).

SyntaxCHAPTER 3

XObject stream (Optional) A dictionary mapping resource names to external objects (see

Section 4.7, “External Objects”).

Font dictionary (Optional) A dictionary mapping resource names to font dictionaries (see

Chapter 5).

ProcSet array (Optional) An array of predeﬁned procedure set names (see Section 8.1,

“Procedure Sets”).

Properties dictionary (Optional; PDF 1.2) A dictionary mapping resource names to property list

dictionaries for marked content (see “Property Lists” on page 481).

Example 3.10 shows a resource dictionary containing procedure sets, fonts, and

external objects. The procedure sets are speciﬁed by an array, as described in

Section 8.1, “Procedure Sets.” The fonts are speciﬁed with a subdictionary associ-

ating the names

F5, F6, F7, and F8 with objects 6, 8, 10, and 12, respectively. Like-

wise, the

XObject subdictionary associates the names Im1 and Im2 with objects 13

and 15, respectively.

Example 3.10

<< /ProcSet [/PDF /ImageB]

/Font << /F5 6 0 R

/F6 8 0 R

/F7 10 0 R

/F8 12 0 R

/XObject << /Im1 13 0 R

/Im2 15 0 R

3.8 Common Data Structures

As mentioned at the beginning of this chapter, there are some general-purpose

data structures that are built from the basic object types described in Section 3.2,

“Objects,” and are used in many places throughout PDF. This section describes

data structures for text strings, dates, rectangles, name trees, and number trees.

The subsequent two sections describe more complex data structures for func-

tions and ﬁle speciﬁcations.

Common Data Structures3.8

All of these data structures are meaningful only as part of the document hier-

archy; they cannot appear within content streams. In particular, the special con-

ventions for interpreting the values of string objects apply only to strings outside

content streams. An entirely different convention is used within content streams

for using strings to select sequences of glyphs to be painted on the page (see

Chapter 5). Table 3.21 summarizes the basic and higher-level data types that are

used throughout this book to describe the values of dictionary entries and other

PDF data values.

TABLE 3.21 PDF data types

TYPE DESCRIPTION SECTION PAGE

array Array object 3.2.5 32

boolean Boolean value 3.2.1 26

date Date (string) 3.8.2 89

dictionary Dictionary object 3.2.6 32

ﬁle speciﬁcation File speciﬁcation (string or dictionary) 3.10 107

function Function (dictionary or stream) 3.9 95

integer Integer number 3.2.2 26

name Name object 3.2.4 30

name tree Name tree (dictionary) 3.8.4 90

null Null object 3.2.8 39

number Number (integer or real) 3.2.2 26

number tree Number tree (dictionary) 3.8.5 94

rectangle Rectangle (array) 3.8.3 90

stream Stream object 3.2.7 33

string String object 3.2.3 27

text string Text string 3.8.1 88

SyntaxCHAPTER 3

3.8.1 Text Strings

Certain strings contain information that is intended to be human-readable, such

as text annotations, bookmark names, article names, document information, and

so forth. Such strings are referred to as text strings. Text strings are encoded in

either

PDFDocEncoding or Unicode character encoding. PDFDocEncoding is a

superset of the ISO Latin 1 encoding and is documented in Appendix D. Unicode

is described in the document The Unicode Standard (see the Bibliography).

For text strings encoded in Unicode, the ﬁrst two bytes must be 254 followed by

255, representing the Unicode byte order marker,

U+FEFF. (This sequence con-

ﬂicts with the

PDFDocEncoding character sequence thorn ydieresis, which is un-

likely to be a meaningful beginning of a word or phrase.) The remainder of the

string consists of Unicode character codes, according to the UTF-16 encoding

speciﬁed in the Unicode standard, version 2.0. Commonly used Unicode values

are represented as 2 bytes per character, with the high-order byte appearing ﬁrst

in the string.

Anywhere in a Unicode text string, an escape sequence may appear to indicate the

language in which subsequent text is written; this is useful when the language

cannot be determined from the character codes used in the text itself. The escape

sequence consists of the following elements, in order:

1. The Unicode value

U+001B (that is, the byte sequence 0 followed by 27)

2. A 2-character ISO 639 language code—for example,

EN for English or JA for

Japanese

3. (Optional) A 2-character ISO 3166 country code—for example,

US for the

United States or

JP for Japan

4. The Unicode value

U+001B

The complete list of codes deﬁned by ISO 639 and ISO 3166 can be obtained

from the International Organization for Standardization (see the Bibliography).

Common Data Structures3.8

3.8.2 Dates

PDF deﬁnes a standard date format, which closely follows that of the internation-

al standard ASN.1 (Abstract Syntax Notation One), deﬁned in ISO/IEC 8824 (see

the Bibliography). A date is a string of the form

(D:YYYYMMDDHHmmSSOHH 'mm')

where

YYYY is the year

MM is the month

DD is the day (01–31)

HH is the hour (00–23)

mm is the minute (00–59)

SS is the second (00–59)

O is the relationship of local time to Universal Time (UT), denoted by one of

the characters

+, −, or Z (see below)

HH followed by ' is the absolute value of the offset from UT in hours (00–23)

mm followed by ' is the absolute value of the offset from UT in minutes (00–59)

The quotation mark character (

') after HH and mm is part of the syntax. All ﬁelds

after the year are optional. (The preﬁx

D:, although also optional, is strongly rec-

ommended.) The default values for

MM and DD are both 01; all other numerical

ﬁelds default to zero values. A plus sign (

+) as the value of the O ﬁeld signiﬁes that

local time is later than UT, a minus sign (

−) that local time is earlier than UT, and

the letter

Z that local time is equal to UT. If no UT information is speciﬁed, the

relationship of the speciﬁed time to UT is considered to be unknown. Whether or

not the time zone is known, the rest of the date should be speciﬁed in local time.

For example, December 23, 1998, at 7:52 PM, U.S. Paciﬁc Standard Time, is rep-

resented by the string

D:199812231952−08'00'

SyntaxCHAPTER 3

3.8.3 Rectangles

Rectangles are used to describe locations on a page and bounding boxes for a

variety of objects, such as fonts. A rectangle is written as an array of four numbers

giving the coordinates of a pair of diagonally opposite corners. Typically, the

array takes the form

[ll

]

specifying the lower-left x, lower-left y, upper-right x, and upper-right y coordi-

nates of the rectangle, in that order.

Note: Although rectangles are conventionally speciﬁed by their lower-left and upper-

right corners, it is acceptable to specify any two diagonally opposite corners. Applica-

tions that process PDF should be prepared to normalize such rectangles in situations

where speciﬁc corners are required.

3.8.4 Name Trees

A name tree serves a similar purpose to a dictionary—associating keys and

values—but by different means. A name tree differs from a dictionary in the fol-

lowing important ways:

• Unlike the keys in a dictionary, which are name objects, those in a name tree

are strings.

• The keys are ordered.

• The values associated with the keys may be objects of any type, but they must

always be speciﬁed via indirect object references.

• The data structure can represent an arbitrarily large collection of key-value

pairs, which can be looked up efﬁciently without requiring the entire data

structure to be read from the PDF ﬁle. (In contrast, a dictionary is subject to an

implementation limit on the number of entries it can contain.)

A name tree is constructed of nodes, each of which is a dictionary object.

Table 3.22 shows the entries in a node dictionary. The nodes are of three kinds,

depending on the speciﬁc entries they contain. The tree always has exactly one

root node, which contains a single entry: either

Kids or Names but not both. If the

root node has a

Names entry, it is the only node in the tree. If it has a Kids entry,

Common Data Structures3.8

then each of the remaining nodes is either an intermediate node, containing a

Limits entry and a Kids entry, or a leaf node, containing a Limits entry and a

Names entry.

TABLE 3.22 Entries in a name tree node dictionary

KEY TYPE VALUE

Kids array (Root and intermediate nodes only; required in intermediate nodes; present in the root node

if and only if

Names is not present) An array of indirect references to the immediate chil-

dren of this node. The children may be intermediate or leaf nodes.

Names array (Root and leaf nodes only; required in leaf nodes; present in the root node if and only if Kids

is not present) An array of the form

[key

value

key

value

… key

value

]

where each key

is a string and the corresponding value

is an indirect reference to the ob-

ject associated with that key. The keys are sorted in lexical order, as described below.

Limits array (Intermediate and leaf nodes only; required) An array of two strings, specifying the (lexi-

cally) least and greatest keys included in the

Names array of a leaf node or in the Names

arrays of any leaf nodes that are descendants of an intermediate node.

The Kids entries in the root and intermediate nodes deﬁne the tree’s structure by

identifying the immediate children of each node. The

Names entries in the leaf

(or root) nodes contain the tree’s keys and their associated values, arranged in

key-value pairs and sorted lexically in ascending order by key. Shorter keys

appear before longer ones beginning with the same byte sequence. The encoding

of the keys is immaterial as long as it is self-consistent; keys are compared for

equality on a simple byte-by-byte basis.

The keys contained within the various nodes’

Names entries do not overlap; that

is, each

Names entry contains a single contiguous range of all the keys in the tree.

In a leaf node, the

Limits entry speciﬁes the least and greatest keys contained

within the node’s

Names entry; in an intermediate node, it speciﬁes the least and

greatest keys contained within the

Names entries of any of that node’s descen-

dants. The value associated with a given key can thus be found by walking the tree

in order, searching for the leaf node whose

Names entry contains that key.

SyntaxCHAPTER 3

Table 3.23 is an abbreviated outline, showing object numbers and nodes, of a

name tree that maps the names of all the chemical elements, from actinium to

zirconium, to their atomic numbers. Example 3.11 shows the representation of

this tree in a PDF ﬁle.

TABLE 3.23 Example of a name tree

1: Root node

2: Intermediate node: Actinium to Gold

5: Leaf node: Actinium = 25, …, Astatine = 31

25: Integer: 89

…

31: Integer: 85

…

11: Leaf node: Gadolinium = 56, …, Gold = 59

56: Integer: 64

…

59: Integer: 79

3: Intermediate node: Hafnium to Protactinium

12: Leaf node: Hafnium = 60, …, Hydrogen = 65

60: Integer: 72

…

65: Integer: 1

…

19: Leaf node: Palladium = 92, …, Protactinium = 100

92: Integer: 46

…

100: Integer: 91

4: Intermediate node: Radium to Zirconium

20: Leaf node: Radium = 101, …, Ruthenium = 107

88: Integer: 89

…

44: Integer: 85

…

24: Leaf node: Xenon = 129, …, Zirconium = 133

129: Integer: 54

…

133: Integer: 40

Common Data Structures3.8

Example 3.11

1 0 obj

/Kids [ 2 0 R % Root node

3 0 R

4 0 R

]

endobj

2 0 obj

<< /Limits [(Actinium) (Gold)] % Intermediate node

/Kids [ 5 0 R

6 0 R

7 0 R

8 0 R

9 0 R

10 0 R

11 0 R

]

endobj

3 0 obj

<< /Limits [(Hafnium) (Protactinium)] % Intermediate node

/Kids [ 12 0 R

13 0 R

14 0 R

15 0 R

16 0 R

17 0 R

18 0 R

19 0 R

]

endobj

4 0 obj

<< /Limits [(Radium) (Zirconium)] % Intermediate node

/Kids [ 20 0 R

21 0 R

22 0 R

23 0 R

24 0 R

]

endobj

SyntaxCHAPTER 3

5 0 obj

<< /Limits [(Actinium) (Astatine)] % Leaf node

/Names [ (Actinium) 25 0 R

(Aluminum) 26 0 R

(Americium) 27 0 R

(Antimony) 28 0 R

(Argon) 29 0 R

(Arsenic) 30 0 R

(Astatine) 31 0 R

]

endobj

…

24 0 obj

<< /Limits [(Xenon) (Zirconium)] % Leaf node

/Names [ (Xenon) 129 0 R

(Ytterbium) 130 0 R

(Yttrium) 131 0 R

(Zinc) 132 0 R

(Zirconium) 133 0 R

]

endobj

25 0 obj

89 % Atomic number (Actinium)

endobj

…

133 0 obj

40 % Atomic number (Zirconium)

endobj

3.8.5 Number Trees

A number tree is similar to a name tree (see Section 3.8.4, “Name Trees”), except

that its keys are integers instead of strings, sorted in ascending numerical order.

The entries in the leaf (or root) nodes containing the key-value pairs are named

Nums instead of Names as in a name tree. Table 3.24 shows the entries in a num-

ber tree’s node dictionaries.

Functions3.9

TABLE 3.24 Entries in a number tree node dictionary

KEY TYPE VALUE

Kids array (Root and intermediate nodes only; required in intermediate nodes; present in the root node

if and only if

Nums is not present) An array of indirect references to the immediate chil-

dren of this node. The children may be intermediate or leaf nodes.

Nums array (Root and leaf nodes only; required in leaf nodes; present in the root node if and only if Kids

is not present) An array of the form

[key

value

key

value

… key

value

]

where each key

is an integer and the corresponding value

is an indirect reference to the

object associated with that key. The keys are sorted in numerical order, analogously to

the arrangement of keys in a name tree as described in Section 3.8.4, “Name Trees.”

Limits array (Intermediate and leaf nodes only; required) An array of two integers, specifying the (nu-

merically) least and greatest keys included in the

Nums array of a leaf node or in the

Nums arrays of any leaf nodes that are descendants of an intermediate node.

3.9 Functions

PDF is not a programming language, and a PDF ﬁle is not a program; however,

PDF does provide several types of function object (PDF 1.2) that represent param-

eterized classes of functions, including mathematical formulas and sampled

representations with arbitrary resolution. Functions are used in various ways in

PDF: device-dependent rasterization information for high-quality printing (half-

tone spot functions and transfer functions), color transform functions for certain

color spaces, and speciﬁcation of colors as a function of position for smooth

shadings.

Functions in PDF represent static, self-contained numerical transformations. A

function to add two numbers has two input values and one output value:

Similarly, a function that computes the arithmetic and geometric mean of two

numbers could be viewed as a function of two input values and two output

values:

,()x

,()

-----------------

×,=

SyntaxCHAPTER 3

In general, a function can take any number (m) of input values and produce any

number (n) of output values:

In PDF functions, all the input values and all the output values are numbers, and

functions have no side effects.

Each function deﬁnition includes a domain, the set of legal values for the input.

Some types of function also deﬁne a range, the set of legal values for the output.

Input values passed to the function are clipped to the domain, and output values

produced by the function are clipped to the range. For example, suppose the

function

is deﬁned with a domain of

[−11]. If the function is called with the input value 6,

that value is replaced with the nearest value in the deﬁned domain, 1, before the

function is evaluated; the resulting output value is therefore 3. Similarly, if the

function

is deﬁned with a range of

[0 100], and if the input values −6 and 4 are passed to

the function (and are within its domain), then the output value produced by the

function,

−14, is replaced with 0, the nearest value in the deﬁned range.

A function object may be a dictionary or a stream, depending on the type of

function; the term function dictionary will be used generically in this section to

refer to either a dictionary object or the dictionary portion of a stream object. A

function dictionary speciﬁes the function’s representation, the set of attributes

that parameterize that representation, and the additional data needed by that

representation. Four types of function are available, as indicated by the diction-

ary’s

FunctionType entry:

• (PDF 1.2) A sampled function (type 0) uses a table of sample values to deﬁne the

function. Various techniques are used to interpolate values between the sample

values.

• (PDF 1.3) An exponential interpolation function (type 2) deﬁnes a set of coef-

ﬁcients for an exponential function.

… x

m 1–

,,()y

… y

n 1–

,,=

fx() x 2+=

,()3 x

× x

Functions3.9

• (PDF 1.3) A stitching function (type 3) is a combination of other functions, par-

titioned across a domain.

• (PDF 1.3) A PostScript calculator function (type 4) uses operators from the

PostScript language to describe an arithmetic expression.

All function dictionaries share the entries listed in Table 3.25.

TABLE 3.25 Entries common to all function dictionaries

KEY TYPE VALUE

FunctionType integer (Required) The function type:

0 Sampled function

2 Exponential interpolation function

3 Stitching function

4 PostScript calculator function

Domain array (Required) An array of 2 × m numbers, where m is the number of input

values. For each i from 0 to m − 1,

Domain

must be less than or equal to

Domain

2i+1

, and the ith input value, x

, must lie in the interval

Domain

≤ x

≤ Domain

2i+1

. Input values outside the declared domain are

clipped to the nearest boundary value.

Range array (Required for type 0 and type 4 functions, optional otherwise; see below) An

array of 2 × n numbers, where n is the number of output values. For each j

from 0 to n − 1,

Range

must be less than or equal to Range

2j+1

, and the jth

output value, y

, must lie in the interval Range

≤ y

≤ Range

2j+1

. Output

values outside the declared range are clipped to the nearest boundary value.

If this entry is absent, no clipping is done.

In addition, each type of function dictionary must include entries appropriate to

the particular function type. The number of output values can usually be inferred

from other attributes of the function; if not (as is always the case for type 0 and

type 4 functions), the

Range entry is required. The dimensionality of the func-

tion implied by the

Domain and Range entries must be consistent with that im-

plied by other attributes of the function.

SyntaxCHAPTER 3

3.9.1 Type 0 (Sampled) Functions

Type 0 functions use a sequence of sample values (contained in a stream) to pro-

vide an approximation for functions whose domains and ranges are bounded.

The samples are organized as an m-dimensional table in which each entry has n

components.

Sampled functions are highly general and offer reasonably accurate representa-

tions of arbitrary analytic functions at low expense. For example, a 1-input sinus-

oidal function can be represented over the range

[0 180] with an average error of

only 1 percent, using just ten samples and linear interpolation. Two-input func-

tions require signiﬁcantly more samples, but usually not a prohibitive number, so

long as the function does not have high frequency variations.

The dimensionality of a sampled function is restricted only by implementation

limits. However, the number of samples required to represent high-dimensionality

functions multiplies rapidly unless the sampling resolution is very low. Also, the

process of multilinear interpolation becomes computationally intensive if the

number of inputs m is greater than 2. The multidimensional spline interpolation

is even more computationally intensive.

In addition to the entries in Table 3.25, a type 0 function dictionary includes

those shown in Table 3.26.

TABLE 3.26 Additional entries speciﬁc to a type 0 function dictionary

KEY TYPE VALUE

Size array (Required) An array of m positive integers specifying the number of samples

in each input dimension of the sample table.

BitsPerSample integer (Required) The number of bits used to represent each sample. (If the function

has multiple output values, each one occupies

BitsPerSample bits.) Valid

values are 1, 2, 4, 8, 12, 16, 24, and 32.

Order integer (Optional) The order of interpolation between samples. Valid values are 1

and 3, specifying linear and cubic spline interpolation, respectively. (See im-

plementation note 23 in Appendix H.) Default value: 1.

Encode array (Optional) An array of 2 × m numbers specifying the linear mapping of input

values into the domain of the function’s sample table. Default value:

[0 (Size

− 1) 0 (Size

− 1) …].

Functions3.9

Decode array (Optional) An array of 2 × n numbers specifying the linear mapping of sam-

ple values into the range appropriate for the function’s output values. Default

value: Same as the value of

Range.

other stream (various) (Optional) Other attributes of the stream that provides the sample values, as

attributes appropriate (see Table 3.4 on page 35).

The Domain, Encode, and Size entries determine how the function’s input vari-

able values are mapped into the sample table. For example, if

Size is [21 31], the

default

Encode array is [0 20 0 30], which maps the entire domain into the full

set of sample table entries. Other values of

Encode may be used.

To explain the relationship between

Domain, Encode, Size, Decode, and Range,

we use the following notation:

For a given value of x, Interpolate calculates the y value on the line deﬁned by the

two points (x

min

, y

min

) and (x

max

, y

max

When a sampled function is called, each input value x

, for 0 ≤ i < m, is clipped to

the domain:

That value is encoded:

That value is clipped to the size of the sample table in that dimension:

The encoded input values are real numbers, not restricted to integers. Interpola-

tion is then used to determine output values from the nearest surrounding values

in the sample table. Each output value r

, for 0 ≤ j < n, is then decoded:

y Interpolate xx

min

max

min

max

,,,,)( xx

min

– )(

max

min

– )(

max

min

– )(

---------------------------------- y

min

+×==

′ min max x

Domain

,)( Domain

2i 1+

,)(=

Interpolate x

′ Domain

Domain

2i 1+

Encode

2i 1+

,, ,, )(=

′ min max e

0,)( Size

1–,)(=

′ Interpolate r

BitsPerSample

1– Decode

Decode

2j 1+

,, , , )(=

SyntaxCHAPTER 3

100

Finally, each decoded value is clipped to the range:

Sample data is represented as a stream of unsigned 8-bit bytes (integers in the

range 0 to 255). The bytes constitute a continuous bit stream, with the high-order

bit of each byte ﬁrst. Each sample value is represented as a sequence of

BitsPer-

Sample

bits. Successive values are adjacent in the bit stream; there is no padding

at byte boundaries.

For a function with multidimensional input (more than one input variable), the

sample values in the ﬁrst dimension vary fastest, and the values in the last dimen-

sion vary slowest. For example, for a function f(a, b, c), where a, b, and c vary

from 0 to 9 in steps of 1, the sample values would appear in this order: f(0, 0, 0),

f(1, 0, 0), …, f(9, 0, 0), f(0, 1, 0), f(1, 1, 0), …, f(9, 1, 0), f(0, 2, 0), f(1, 2, 0), …,

f(9, 9, 0), f(0, 0, 1), f(1, 0, 1), and so on.

For a function with multidimensional output (more than one output value), the

values are stored in the same order as

Range.

The stream data must be long enough to contain the entire sample array, as indi-

cated by

Size, Range, and BitsPerSample; see “Stream Extent” on page 36.

Example 3.12 illustrates a sampled function with 4-bit samples in an array con-

taining 21 columns and 31 rows (651 values). The function takes two arguments,

x and y, in the domain

[−1.0 1.0], and returns one value, z, in that same range.

The x argument is linearly transformed by the encoding to the domain

[0 20]

and the y argument to the domain [0 30]. Using bilinear interpolation between

sample points, the function computes a value for z, which (because

BitsPer-

Sample

is 4) will be in the range [0 15], and the decoding transforms z to a num-

ber in the range

[−1.0 1.0] for the result. The sample array is stored in a string of

326 bytes, calculated as follows (rounded up):

326 bytes = 31 rows × 21 samples/row × 4 bits/sample ÷ 8 bits/byte

The ﬁrst byte contains the sample for the point (−1.0, −1.0) in the high-order 4

bits and the sample for the point (−0.9, −1.0) in the low-order 4 bits.

min max r

′ Range

,)( Range

2j 1+

,)(=

Functions3.9

101

Example 3.12

14 0 obj

<< /FunctionType 0

/Domain [−1.0 1.0 −1.0 1.0]

/Size [21 31]

/Encode [0 20 0 30]

/BitsPerSample 4

/Range [−1.0 1.0]

/Decode [−1.0 1.0]

/Length …

/Filter …

stream

… 651 sample values …

endstream

endobj

The Decode entry can be used creatively to increase the accuracy of encoded

samples corresponding to certain values in the range. For example, if the desired

range of the function is

[−1.0 1.0] and BitsPerSample is 4, the usual value of

Decode would be [−1.0 1.0] and the sample values would be integers in the inter-

val

[0 15] (as shown in Figure 3.6). But if these values were used, the midpoint of

the range, 0.0, would not be represented exactly by any sample value, since it

would fall halfway between 7 and 8. On the other hand, if the

Decode array were

[−1.0 +1.1429] (1.1429 being approximately equal to 16 ÷ 14) and the sample

values supplied were in the interval

[0 14], then the desired effective range of

[−1.0 1.0] would be achieved, and the range value 0.0 would be represented by

the sample value 7.

FIGURE 3.6 Mapping with the Decode array

−1

2345 789 101112131415

Samples

Range

/Decode [−1 1]

−1

234 6789 1011121314 15

Samples

Range

/Decode [−1 1.1429]

SyntaxCHAPTER 3

102

The Size value for an input dimension can be 1, in which case all input values in

that dimension will be mapped to the single allowed value. If

Size is less than 4,

cubic spline interpolation is not possible and

Order 3 will be ignored if speciﬁed.

3.9.2 Type 2 (Exponential Interpolation) Functions

Type 2 functions (PDF 1.3) include a set of parameters that deﬁne an exponential

interpolation of one input value and n output values:

In addition to the entries in Table 3.25 on page 97, a type 2 function dictionary

includes those; listed in Table 3.27. (See implementation note 24 in Appendix H.)

TABLE 3.27 Additional entries speciﬁc to a type 2 function dictionary

KEY TYPE VALUE

C0 array (Optional) An array of n numbers deﬁning the function result when x = 0.0 (hence the “0”

in the name). Default value:

[0.0].

C1 array (Optional) An array of n numbers deﬁning the function result when x = 1.0 (hence the “1”

in the name). Default value:

[1.0].

N number (Required) The interpolation exponent. Each input value x will return n values, given by

= C0

+ x

× (C1

− C0

), for 0 ≤ j < n.

Values of Domain must constrain x in such a way that if N is not an integer, all

values of x must be nonnegative, and if

N is negative, no value of x may be zero.

Typically,

Domain will be declared as [0.0 1.0], and N will be a positive number.

The

Range parameter is optional and can be used to clip the output to a desired

range. Note that when

N is 1, the function performs a linear interpolation be-

tween

C0 and C1. This can also be expressed as a sampled function (type 0).

fx() y

… y

n 1–

,,=

Functions3.9

103

3.9.3 Type 3 (Stitching) Functions

Type 3 functions (PDF 1.3) deﬁne a “stitching” of the subdomains of several

1-input functions to produce a single new 1-input function. Since the resulting

stitching function is a 1-input function, the domain is given by a two-element

array,

[Domain

Domain

In addition to the entries in Table 3.25 on page 97, a type 3 function dictionary

includes those listed in Table 3.28. (See implementation note 25 in Appendix H.)

TABLE 3.28 Additional entries speciﬁc to a type 3 function dictionary

KEY TYPE VALUE

Functions array (Required) An array of k 1-input functions making up the stitching function. The out-

put dimensionality of all functions must be the same, and compatible with the value of

Range if Range is present.

Bounds array (Required) An array of k − 1 numbers that, in combination with Domain, deﬁne the

intervals to which each function from the

Functions array applies. Bounds elements

must be in order of increasing value, and each value must be within the domain

deﬁned by

Domain.

Encode array (Required) An array of 2 × k numbers that, taken in pairs, map each subset of the do-

main deﬁned by

Domain and the Bounds array to the domain of the corresponding

function.

Domain must be of size 2 (that is, m = 1), and Domain

must be strictly less than

Domain

unless k = 1. The domain is partitioned into k subdomains, as indicated

by the dictionary’s

Bounds entry, which is an array of k − 1 numbers that obey the

following relationships (with exceptions as noted below):

The

Bounds array describes a series of half-open intervals, closed on the left and

open on the right (except the last, which is closed on the right as well). The value

of the

Functions entry is an array of k functions. The ﬁrst function applies to x

values in the ﬁrst subdomain,

Domain

≤ x < Bounds

; the second function ap-

plies to x values in the second subdomain,

Bounds

≤ x < Bounds

; and so on.

The last function applies to x values in the last subdomain, which includes the

upper bound:

Bounds

k−2

≤ x ≤ Domain

. The value of k may be 1, in which case

Domain

Bounds

… Bounds

k 2–

Domain

<<<< <

SyntaxCHAPTER 3

104

the Bounds array is empty and the single item in the Functions array applies to all

x values,

Domain

≤ x ≤ Domain

The

Encode array contains 2 × k numbers. A value x from the ith subdomain is

encoded as follows:

for 0 ≤ i < k. In this equation,

Bounds

−1

means Domain

, and Bounds

k−1

means

Domain

. If the last bound, Bounds

k−2

, is equal to Domain

, then x′ is deﬁned to

Encode

The stitching function is designed to make it easy to combine several functions to

be used within one shading pattern, over different parts of the shading’s domain.

(Shading patterns are discussed in Section 4.6.3, “Shading Patterns.”) The same

effect could be achieved by creating a separate shading dictionary for each of the

functions, with adjacent domains. However, since each shading would have simi-

lar parameters, and because the overall effect is one shading, it is more con-

venient to have a single shading with multiple function deﬁnitions.

Also, function type 3 provides a general mechanism for inverting the domains of

1-input functions. For example, consider a function f with a

Domain of [0.0 1.0],

and a stitching function g with a

Domain of [0.0 1.0], a Functions array contain-

ing f, and an

Encode array of [1.0 0.0]. In effect, g(x) = f(1 − x).

3.9.4 Type 4 (PostScript Calculator) Functions

A type 4 function (PDF 1.3), also called a PostScript calculator function, is repre-

sented as a stream containing code written in a small subset of the PostScript lan-

guage. While any function can be sampled (in a type 0 PDF function) and others

can be described with exponential functions (type 2 in PDF), type 4 functions

offer greater ﬂexibility and potentially greater accuracy. For example, a tint

transformation function for a hexachrome (six-component)

DeviceN color space

with an alternate color space of

DeviceCMYK (see “DeviceN Color Spaces” on

page 186) requires a 6-in, 4-out function. If such a function were sampled with m

values for each input variable, the number of samples, 4 × m

, could be prohibi-

tively large. In practice, such functions are often written as short, simple Post-

Script functions. (See implementation note 26 in Appendix H.)

x′ Interpolate x

Bounds

i 1–

Bounds

Encode

2i 1+

,,,, )(=

Functions3.9

105

Type 4 functions also make it possible to include a wide variety of halftone spot

functions without the loss of accuracy that comes from sampling, and without

adding to the list of predeﬁned spot functions (see Section 6.4.2, “Spot Func-

tions”). All of the predeﬁned spot functions can be written as type 4 functions.

The language that can be used in a type 4 function contains expressions involving

integers, real numbers, and boolean values only. There are no composite data

structures such as strings or arrays, no procedures, and no variables or names.

Table 3.29 lists the operators that can be used in this type of function. (For more

information on these operators, see Appendix B or the PostScript Language Refer-

ence, Third Edition.) Although the semantics are those of the corresponding

PostScript operators, a PostScript interpreter is not required.

TABLE 3.29 Operators in type 4 functions

OPERATOR TYPE OPERATORS

Arithmetic operators abs cvi ﬂoor mod sin

add cvr idiv mul sqrt

atan div ln neg sub

ceiling exp log round truncate

cos

Relational, boolean, and false le not true

and bitwise operators bitshift ge lt or xor

eq gt ne

Conditional operators if ifelse

Stack operators copy exch pop

dup index roll

The operand syntax for type 4 functions follows PDF conventions rather than

PostScript conventions. The entire code stream deﬁning the function is enclosed

in braces

{ }. Braces also delimit expressions that are executed conditionally by the

if and ifelse operators:

boolean {expression}if

boolean {expression

}{expression

} ifelse

Note that this is a purely syntactic construct; unlike in PostScript, no “procedure

objects” are involved.

SyntaxCHAPTER 3

106

A type 4 function dictionary includes the entries in Table 3.25 on page 97, as well

as other stream attributes as appropriate (see Table 3.4 on page 35). Example 3.13

shows a type 4 function equivalent to the predeﬁned spot function

DoubleDot

(see Section 6.4.2, “Spot Functions”).

Example 3.13

10 0 obj

<< /FunctionType 4

/Domain [−1.0 1.0 −1.0 1.0]

/Range [−1.0 1.0]

/Length 71

stream

{ 360 mul sin

2 div

exch 360 mul sin

2 div

add

}

endstream

endobj

The Domain and Range keys are both required. The input variables constitute the

initial operand stack; the items remaining on the operand stack after execution of

the function are the output variables. It is an error for the number of remaining

operands to differ from the number of output variables speciﬁed by

Range, or for

any of them to be objects other than numbers.

Implementations of type 4 functions must provide a stack with room for at least

100 entries. No implementation is required to provide a larger stack, and it is an

error to overﬂow the stack.

Although any integers or real numbers that may appear in the stream fall under

the same implementation limits (deﬁned in Appendix C) as in other contexts, the

intermediate results in type 4 function computations do not. An implementation

may use a representation that exceeds those limits. Operations on real numbers,

for example, might use single-precision or double-precision ﬂoating-point num-

bers. (See implementation note 27 in Appendix H.)

File Specifications3.10

107

Errors in Type 4 Functions

The code that reads a type 4 function (analogous to the PostScript scanner) must

detect and report syntax errors. It may also be able to detect some errors that will

occur when the function is used, although this is not always possible. Any errors

detected by the scanner are considered to be errors in the PDF ﬁle itself and are

handled like other errors in the ﬁle.

The code that executes a type 4 function (analogous to the PostScript interpreter)

must detect and report errors. PDF does not deﬁne a representation for the

errors; those details are provided by the application that processes the PDF ﬁle.

The following types of error can occur (among others):

• Stack overﬂow

• Stack underﬂow

• A type error (for example, applying not to a real number)

• A range error (for example, applying sqrt to a negative number)

• An undeﬁned result (for example, dividing by 0)

3.10 File Speciﬁcations

A PDF ﬁle can refer to the contents of another ﬁle by using a ﬁle speciﬁcation

(PDF 1.1), which can take either of two forms. A simple ﬁle speciﬁcation gives just

the name of the target ﬁle in a standard format, independent of the naming con-

ventions of any particular ﬁle system; a full ﬁle speciﬁcation includes information

related to one or more speciﬁc ﬁle systems. A simple ﬁle speciﬁcation may take

the form of either a string or a dictionary; a full ﬁle speciﬁcation can only be rep-

resented as a dictionary.

Although the ﬁle designated by a ﬁle speciﬁcation is normally external to the PDF

ﬁle referring to it, PDF 1.3 permits a copy of the external ﬁle to be embedded

within the PDF ﬁle itself, allowing its contents to be stored or transmitted along

with the PDF ﬁle. However, embedding a ﬁle does not change the presumption

that it is external to the PDF ﬁle. Consequently, in order for the PDF ﬁle to be

processed correctly, it may be necessary to copy the embedded ﬁles it contains

back into a local ﬁle system.

SyntaxCHAPTER 3

108

3.10.1 File Speciﬁcation Strings

The standard format for representing a simple ﬁle speciﬁcation in string form

divides the string into component substrings separated by the slash character (

/).

The slash is a generic component separator that is mapped to the appropriate

platform-speciﬁc separator when generating a platform-dependent ﬁle name.

Any of the components may be empty. If a component contains one or more lit-

eral slashes, each must be preceded by a backslash (

\), which in turn must be pre-

ceded by another backslash to indicate that it is part of the string and not an

escape character. For example, the string

(in\\/out)

represents the ﬁle name

in/out

The backslashes are removed in processing the string; they are needed only to dis-

tinguish the component values from the component separators. The component

substrings are stored as bytes and are passed to the operating system without in-

terpretation or conversion of any sort.

Absolute and Relative File Speciﬁcations

A simple ﬁle speciﬁcation that begins with a slash is an absolute ﬁle speciﬁcation.

The last component is the ﬁle name; the preceding components specify its con-

text. In some ﬁle speciﬁcations, the ﬁle name may be empty; for example, URL

(uniform resource locator) speciﬁcations can specify directories instead of ﬁles. A

ﬁle speciﬁcation that does not begin with a slash is a relative ﬁle speciﬁcation giv-

ing the location of the ﬁle relative to that of the PDF ﬁle containing it.

In the case of a URL ﬁle system, the rules of Internet RFC 1808, Relative Uniform

Resource Locators (see the Bibliography), are used to compute an absolute URL

from a relative ﬁle speciﬁcation and the speciﬁcation of the PDF ﬁle. Prior to this

process, the relative ﬁle speciﬁcation is converted to a relative URL by using the

escape mechanism of RFC 1738, Uniform Resource Locators, to represent any

bytes that would be either “unsafe” according to RFC 1738 or not representable in

7-bit U.S. ASCII. In addition, such URL-based relative ﬁle speciﬁcations are lim-

ited to paths as deﬁned in RFC 1808; the scheme, network location/login, frag-

ment identiﬁer, query information, and parameter sections are not allowed.

File Specifications3.10

109

In the case of other ﬁle systems, a relative ﬁle speciﬁcation is converted to an ab-

solute ﬁle speciﬁcation by removing the ﬁle name component from the speciﬁca-

tion of the containing PDF ﬁle and appending the relative ﬁle speciﬁcation in its

place. For example, the relative ﬁle speciﬁcation

ArtFiles/Figure1.pdf

appearing in a PDF ﬁle whose speciﬁcation is

/HardDisk/PDFDocuments/AnnualReport/Summary.pdf

yields the absolute speciﬁcation

/HardDisk/PDFDocuments/AnnualReport/ArtFiles/Figure1.pdf

The special component .. (two periods) can be used in a relative ﬁle speciﬁcation

to move up a level in the ﬁle system hierarchy. When the component immediately

preceding

.. is not another .., the two cancel each other; both are eliminated from

the ﬁle speciﬁcation and the process is repeated. Thus in the example above, the

relative ﬁle speciﬁcation

../../ArtFiles/Figure1.pdf

would yield the absolute speciﬁcation

/HardDisk/ArtFiles/Figure1.pdf

Conversion to Platform-Dependent File Names

The conversion of a ﬁle speciﬁcation into a platform-dependent ﬁle name de-

pends on the speciﬁc ﬁle naming conventions of each platform. For example:

• For the Apple Macintosh

, all components are separated by colons (:).

• For UNIX, all components are separated by slashes (/). An initial slash, if

present, is preserved.

• For DOS, the initial component is either a physical or logical drive identiﬁer or

a network resource name as returned by the Microsoft Windows function

WNetGetConnection, and is followed by a colon. A network resource name is

constructed from the ﬁrst two components; the ﬁrst component is the server

name and the second is the share name (volume name). All components are

SyntaxCHAPTER 3

110

then separated by backslashes. It is possible to specify an absolute DOS path

without a drive by making the ﬁrst component empty. (Empty components are

ignored by other platforms.)

Strings used to specify a ﬁle name are interpreted in the standard encoding for

the platform on which the document is being viewed. Table 3.30 shows examples

of ﬁle speciﬁcations on the most common platforms.

TABLE 3.30 Examples of ﬁle speciﬁcations

SYSTEM SYSTEM-DEPENDENT PATHS WRITTEN FORM

Macintosh Mac HD:PDFDocs:spec.pdf (/Mac HD/PDFDocs/spec.pdf)

DOS \pdfdocs\spec.pdf (no drive) (//pdfdocs/spec.pdf)

r:\pdfdocs\spec.pdf (/r/pdfdocs/spec.pdf)

pclib/eng:\pdfdocs\spec.pdf (/pclib/eng/pdfdocs/spec.pdf)

UNIX /user/fred/pdfdocs/spec.pdf (/user/fred/pdfdocs/spec.pdf)

pdfdocs/spec.pdf (relative) (pdfdocs/spec.pdf)

When creating documents that are to be viewed on multiple platforms, care must

be taken to ensure ﬁle name compatibility. Only a subset of the U.S. ASCII char-

acter set should be used in ﬁle speciﬁcations: the uppercase alphabetic characters

(

A–Z), the numeric characters (0–9), and the underscore (_). The period (.) has

special meaning in DOS and Windows ﬁle names, and as the ﬁrst character in a

Macintosh pathname. In ﬁle speciﬁcations, the period should be used only to

separate a base ﬁle name from a ﬁle extension.

Some ﬁle systems are case-insensitive, so names within a directory should remain

distinguishable if lowercase letters are changed to uppercase or vice versa. On

DOS and Windows 3.1 systems and on some CD-ROM ﬁle systems, ﬁle names

are limited to 8 characters plus a 3-character extension. File system software typi-

cally converts long names to short names by retaining the ﬁrst 6 or 7 characters of

the ﬁle name and the ﬁrst 3 characters after the last period, if any. Since charac-

ters beyond the sixth or seventh are often converted to other values unrelated to

the original value, ﬁle names must be distinguishable from the ﬁrst 6 characters.

File Specifications3.10

111

Multiple-Byte Strings in File Speciﬁcations

In PDF 1.2 or higher, a ﬁle speciﬁcation may contain multiple-byte character

codes, represented in hexadecimal form between angle brackets (

< and >). Since

the slash character

<2F> is used as a component delimiter and the backslash

<5C> is used as an escape character, any occurrence of either of these bytes in a

multiple-byte character must be preceded by the ASCII code for the backslash

character. For example, a ﬁle name containing the 2-byte character code

<89 5C> must write it as <89 5C 5C>. When the viewer application encounters

this sequence of bytes in a ﬁle name, it replaces the sequence with the original

2-byte code.

3.10.2 File Speciﬁcation Dictionaries

The dictionary form of ﬁle speciﬁcation provides more ﬂexibility than the string

form, allowing different ﬁles to be speciﬁed for different ﬁle systems or platforms,

or for ﬁle systems other than the standard ones (Macintosh, DOS/Windows, and

UNIX). Table 3.31 shows the entries in a ﬁle speciﬁcation dictionary. Viewer ap-

plications running on a particular platform should use the appropriate platform-

speciﬁc entry (

Mac, DOS, or Unix) if available. If the required platform-speciﬁc

entry is not present and there is no ﬁle system entry (

FS), the generic F entry

should be used as a simple ﬁle speciﬁcation.

TABLE 3.31 Entries in a ﬁle speciﬁcation dictionary

KEY TYPE VALUE

Type name (Required if an EF or RF entry is present; recommended always) The type of PDF object

that this dictionary describes; must be

Filespec for a ﬁle speciﬁcation dictionary.

FS name (Optional) The name of the ﬁle system to be used to interpret this ﬁle speciﬁcation. If

this entry is present, all other entries in the dictionary are interpreted by the desig-

nated ﬁle system. PDF deﬁnes only one standard ﬁle system,

URL (see Section 3.10.4,

“URL Speciﬁcations”); a viewer application or plug-in extension can register a differ-

ent one (see Appendix E). Note that this entry is independent of the

F, Mac, DOS, and

Unix entries.

F string (Required if the Mac, DOS, and Unix entries are all absent) A ﬁle speciﬁcation string of

the form described in Section 3.10.1, “File Speciﬁcation Strings,” or (if the ﬁle system

URL) a uniform resource locator, as described in Section 3.10.4, “URL Speciﬁca-

tions.”

SyntaxCHAPTER 3

112

Mac string (Optional) A ﬁle speciﬁcation string (see Section 3.10.1, “File Speciﬁcation Strings”)

representing a Macintosh ﬁle name.

DOS string (Optional) A ﬁle speciﬁcation string (see Section 3.10.1, “File Speciﬁcation Strings”)

representing a DOS ﬁle name.

Unix string (Optional) A ﬁle speciﬁcation string (see Section 3.10.1, “File Speciﬁcation Strings”)

representing a UNIX ﬁle name.

ID array (Optional) An array of two strings, each of which is a ﬁle identiﬁer (see Section 8.3,

“File Identiﬁers”) that is also included in the referenced ﬁle. The ﬁrst identiﬁer is es-

tablished permanently when the ﬁle is created; the second is changed each time the

ﬁle is updated. This improves a viewer application’s chances of ﬁnding the intended

ﬁle and allows it to warn the user if the ﬁle has changed since the link was made.

V boolean (Optional; PDF 1.2) A ﬂag indicating whether the ﬁle referenced by the ﬁle speciﬁca-

tion is volatile (changes frequently with time). If the value is true, viewer applications

should never cache a copy of the ﬁle. For example, a movie annotation referencing a

URL to a live video camera could set this ﬂag to true, notifying the application that it

should reacquire the movie each time it is played. Default value: false.

EF dictionary (Optional; PDF 1.3) A dictionary containing a subset of the ﬁle name entries F, Mac,

DOS, and Unix. The value of each such key is an embedded ﬁle stream (see

Section 3.10.3, “Embedded File Streams”) containing the corresponding ﬁle. This

entry is required if

RF is present. If this entry is present, the Type entry is required and

the ﬁle speciﬁcation dictionary must be indirectly referenced.

RF dictionary (Optional; PDF 1.3) A dictionary with the same structure as the EF dictionary, which

must also be present. Each entry in the

RF dictionary must also be present in the EF

dictionary. Each value is a related ﬁles array (see “Related Files Arrays” on page 114)

identifying ﬁles that are related to the corresponding ﬁle in the

EF dictionary. If this

entry is present, the

Type entry is required and the ﬁle speciﬁcation dictionary must

be indirectly referenced.

3.10.3 Embedded File Streams

File speciﬁcations ordinarily refer to ﬁles external to the PDF ﬁle in which they

occur. To preserve the integrity of the PDF ﬁle, this requires that all external ﬁles

it refers to must accompany it when it is archived or transmitted. Embedded ﬁle

streams (PDF 1.3) address this problem by allowing the contents of the referenced

ﬁles to be embedded directly within the body of the PDF ﬁle itself. For example, if

the ﬁle contains OPI (Open Prepress Interface) dictionaries that refer to external-

ly stored high-resolution images (see Section 8.6.4, “Open Prepress Interface

(OPI)”), the image data can be incorporated into the PDF ﬁle with embedded ﬁle

File Specifications3.10

113

streams. This makes the PDF ﬁle a self-contained unit that can be stored or trans-

mitted as a single entity. (The embedded ﬁles are included purely for conve-

nience, and need not be directly processed by any PDF consumer application.)

The stream dictionary describing an embedded ﬁle contains the standard entries

for any stream, such as

Length and Filter (see Table 3.4 on page 35), as well as the

additional entries shown in Table 3.32.

TABLE 3.32 Additional entries in an embedded ﬁle stream dictionary

KEY TYPE VALUE

Type name (Optional) The type of PDF object that this dictionary describes; if present,

must be

EmbeddedFile for an embedded ﬁle stream.

Subtype name (Optional) The subtype of the embedded ﬁle. The value of this entry must be

a ﬁrst-class name, as deﬁned in Appendix E. Names without a registered pre-

ﬁx must conform to the MIME media type names deﬁned in Internet

RFC 2046, Multipurpose Internet Mail Extensions (MIME), Part Two: Media

Typ es (see the Bibliography), with the provision that characters not allowed

in names must use the 2-character hexadecimal code format described in

Section 3.2.4, “Name Objects.”

Params dictionary (Optional) An embedded ﬁle parameter dictionary containing additional, ﬁle-

speciﬁc information (see Table 3.33).

TABLE 3.33 Entries in an embedded ﬁle parameter dictionary

KEY TYPE VALUE

Size integer (Optional) The size of the embedded ﬁle, in bytes.

CreationDate date (Optional) The date and time when the embedded ﬁle was created.

ModDate date (Optional) The date and time when the embedded ﬁle was last modiﬁed.

Mac dictionary (Optional) A subdictionary containing additional information speciﬁc to

Macintosh ﬁles (see Table 3.34).

CheckSum string (Optional) A 16-byte string that is the checksum of the bytes of the uncom-

pressed embedded ﬁle. The checksum is calculated by applying the standard

MD5 message-digest algorithm (described in Internet RFC 1321, The MD5

Message-Digest Algorithm; see the Bibliography) to the bytes of the embedded

ﬁle stream.

SyntaxCHAPTER 3

114

For Macintosh ﬁles, the Mac entry in the embedded ﬁle parameter dictionary

holds a further subdictionary containing Macintosh-speciﬁc ﬁle information.

Table 3.34 shows the contents of this subdictionary.

TABLE 3.34 Entries in a Macintosh-speciﬁc ﬁle information dictionary

KEY TYPE VALUE

Subtype string (Optional) The embedded ﬁle’s ﬁle type.

Creator string (Optional) The embedded ﬁle’s creator signature.

ResFork stream (Optional) The binary contents of the embedded ﬁle’s resource fork.

Related Files Arrays

In some circumstances, a PDF ﬁle can refer to a group of related ﬁles, such as the

set of ﬁve ﬁles that make up a DCS 1.0 color-separated image. The ﬁle speciﬁca-

tion explicitly names only one of the ﬁles; the rest are identiﬁed by some system-

atic variation of that ﬁle name (such as by altering the extension). When such a

ﬁle is to be embedded in a PDF ﬁle, the related ﬁles must be embedded as well.

This is accomplished by including a related ﬁles array (PDF 1.3) as the value of the

RF entry in the ﬁle speciﬁcation dictionary. The array has 2 × n elements, which

are paired in the form

[ string

stream

string

stream

…

string

stream

]

The ﬁrst element of each pair is a string giving the name of one of the related ﬁles;

the second element is an embedded ﬁle stream holding the ﬁle’s contents.

In Example 3.14, objects 21, 31, and 41 are embedded ﬁle streams containing the

Macintosh ﬁle

Sunset.eps, the DOS ﬁle SUNSET.EPS, and the UNIX ﬁle

Sunset.eps, respectively. The ﬁle speciﬁcation dictionary’s RF entry speciﬁes an

array, object 20, identifying a set of embedded ﬁles related to the Macintosh ﬁle,

forming a DCS 1.0 set. The example shows only the ﬁrst two embedded ﬁle

streams in the set; an actual PDF ﬁle would of course include all of them.

File Specifications3.10

115

Example 3.14

10 0 obj % File speciﬁcation dictionary

<< /Type /Filespec

/Mac (Sunset.eps) % Name of the Macintosh ﬁle

/DOS (SUNSET.EPS)

/Unix (Sunset.eps )

/EF << /Mac 21 0 R % Embedded Macintosh ﬁle

/DOS 31 0 R

/Unix 41 0 R

/RF << /Mac 20 0 R >> % Related ﬁles array for the Macintosh ﬁle

endobj

20 0 obj % Related ﬁles array for the Macintosh ﬁle

[ (Sunset.eps) 21 0 R % Includes ﬁle Sunset.eps itself

(Sunset.C) 22 0 R

(Sunset.M) 23 0 R

(Sunset.Y) 24 0 R

(Sunset.K) 25 0 R

]

endobj

21 0 obj % Embedded ﬁle stream for ﬁle Sunset.eps

<< /Type /EmbeddedFile

/Length …

/Filter …

stream

… Data for Sunset.eps …

endstream

endobj

22 0 obj % Embedded ﬁle stream for ﬁle Sunset.C

<< /Type /EmbeddedFile

/Length …

/Filter …

stream

… Data for Sunset.C …

endstream

endobj

SyntaxCHAPTER 3

116

3.10.4 URL Speciﬁcations

When the FS entry in a ﬁle speciﬁcation dictionary has the value URL, the value of

the

F entry in that dictionary is not a ﬁle speciﬁcation string, but a uniform

resource locator (URL) of the form deﬁned in Internet RFC 1738, Uniform

Resource Locators (see the Bibliography). Example 3.15 shows a URL speciﬁca-

tion.

Example 3.15

<< /FS /URL

/F (ftp://www.beatles.com/Movies/AbbeyRoad.mov)

The URL must adhere to the character-encoding requirements speciﬁed in

RFC 1738. Because 7-bit U.S. ASCII is a strict subset of

PDFDocEncoding, this

value may also be considered to be in that encoding.

3.10.5 Maintenance of File Speciﬁcations

The techniques described in this section can be used to maintain the integrity of

the ﬁle speciﬁcations within a PDF ﬁle during operations such as the following:

• Updating the relevant ﬁle speciﬁcation when a referenced ﬁle is renamed

• Determining the complete collection of ﬁles that must be copied to a mirror

site

• When creating new links to external ﬁles, discovering existing ﬁle speciﬁcations

that refer to the same ﬁles and sharing them

• Finding the ﬁle speciﬁcations associated with embedded ﬁles to be packed or

unpacked

It is not possible, in general, to ﬁnd all ﬁle speciﬁcation strings in a PDF ﬁle, be-

cause there is no way to determine whether a given string is a ﬁle speciﬁcation

string. It is possible, however, to ﬁnd all ﬁle speciﬁcation dictionaries, provided

that they meet the following conditions:

• They are indirect objects.

• They contain a Type entry whose value is the name Filespec.

File Specifications3.10

117

An application can then locate all of the ﬁle speciﬁcation dictionaries by travers-

ing the PDF ﬁle’s cross-reference table (see Section 3.4.3, “Cross-Reference

Table”) and ﬁnding all dictionaries with

Type keys whose value is Filespec. For

this reason, it is highly recommended that all ﬁle speciﬁcations be expressed in

dictionary form and meet the conditions stated above. Note that any ﬁle speciﬁ-

cation dictionary specifying embedded ﬁles (that is, one that contains an

EF en-

try) must satisfy these conditions (see Table 3.31 on page 111).

Note: It may not be possible to locate ﬁle speciﬁcation dictionaries that are direct ob-

jects, since they are neither self-typed nor necessarily reachable via any standard

path of object references.

Files may be embedded in a PDF ﬁle either directly, using the

EF entry in a ﬁle

speciﬁcation dictionary, or indirectly, using related ﬁles arrays speciﬁed in the

entry. If a ﬁle is embedded indirectly, its name is given by the string that precedes

the embedded ﬁle stream in the related ﬁles array; if it is embedded directly, its

name is obtained from the value of the corresponding entry in the ﬁle speciﬁca-

tion dictionary. In Example 3.14 on page 115, for instance, the

EF dictionary has

DOS entry identifying object number 31 as an embedded ﬁle stream; the name

of the embedded DOS ﬁle,

SUNSET.EPS, is given by the DOS entry in the ﬁle

speciﬁcation dictionary.

A given external ﬁle may be referenced from more than one ﬁle speciﬁcation.

Therefore, when embedding a ﬁle with a given name, it is necessary to check for

other occurrences of the same name as the value associated with the correspond-

ing key in other ﬁle speciﬁcation dictionaries. This requires ﬁnding all embed-

dable ﬁle speciﬁcations and, for each matching key, checking for both of the

following conditions:

• The string value associated with the key matches the name of the ﬁle being em-

bedded.

• A value has not already been embedded for the ﬁle speciﬁcation. (If there is

already a corresponding key in the

EF dictionary, then a ﬁle has already been

embedded for that use of the ﬁle name.)

Note that there is no requirement that the ﬁles associated with a given ﬁle name

be unique. The same ﬁle name, such as

readme.txt, may be associated with differ-

ent embedded ﬁles in distinct ﬁle speciﬁcations.

119

CHAPTER 4

4Graphics

THE GRAPHICS OPERATORS used in PDF content streams describe the ap-

pearance of pages that are to be reproduced on a raster output device. The facili-

ties described in this chapter are intended for both printer and display

applications.

The graphics operators form six main groups:

• Graphics state operators manipulate the data structure called the graphics state,

the global framework within which the other graphics operators execute. The

graphics state includes the current transformation matrix (CTM), which maps

user space coordinates used within a PDF content stream into output device

coordinates. It also includes the current color, the clipping path, and many other

parameters that are implicit operands of the painting operators.

• Path construction operators specify paths, which deﬁne shapes, line trajectories,

and regions of various sorts. They include operators for beginning a new path,

adding line segments and curves to it, and closing it.

• Path-painting operators ﬁll a path with a color, paint a stroke along it, or use it

as a clipping boundary.

• Other painting operators paint certain self-describing graphics objects. These

include sampled images, geometrically deﬁned shadings, and entire content

streams that in turn contain sequences of graphics operators.

• Text operators select and paint character glyphs from fonts (descriptions of type-

faces for representing text characters). Because PDF treats glyphs as general

graphical shapes, many of the text operators could be grouped with the graph-

ics state or painting operators. However, the data structures and mechanisms

for dealing with glyph and font descriptions are sufﬁciently specialized that

Chapter 5 focuses on them.

GraphicsCHAPTER 4

120

• Marked-content operators associate higher-level logical information with ob-

jects in the content stream. This information does not affect the rendered ap-

pearance of the content; it is useful to applications that use PDF for document

interchange. Marked content is described in Section 8.4.2, “Marked Content.”

This chapter presents general information about device-independent graphics in

PDF: how a PDF content stream describes the abstract appearance of a page.

Rendering—the device-dependent part of graphics—is covered in Chapter 6. The

Bibliography lists a number of books that give details of these computer graphics

concepts and their implementation.

4.1 Graphics Objects

As discussed in Section 3.7.1, “Content Streams,” the data in a content stream is

interpreted as a sequence of operators and their operands, expressed as basic data

objects according to standard PDF syntax. A content stream can describe the ap-

pearance of a page, or it can be treated as a graphical element in certain other

contexts.

The operands and operators are written sequentially using postﬁx notation. This

notation resembles the sequential execution model of the PostScript language.

However, a PDF content stream is not a program to be interpreted; rather, it is a

static description of a sequence of graphics objects. There are speciﬁc rules, de-

scribed below, for writing the operands and operators that describe a graphics

object.

PDF provides ﬁve types of graphics object:

• A path object is an arbitrary shape made up of straight lines, rectangles, and

cubic Bézier curves. A path may intersect itself and may have disconnected

sections and holes. A path object ends with one or more painting operators

that specify whether the path is ﬁlled, stroked, used as a clipping path, or some

combination of these operations.

• A text object consists of one or more character strings that identify sequences of

glyphs to be painted. Like a path, text can be ﬁlled, stroked, or used as a clip-

ping path.

• An external object (XObject) is an object deﬁned outside the content stream and

referenced as a named resource (see Section 3.7.2, “Resource Dictionaries”).

The interpretation of an XObject depends on its type. An image XObject deﬁnes

Graphics Objects4.1

121

a rectangular array of color samples to be painted; a form XObject is an entire

content stream to be treated as a single graphics object. (There is also a

PostScript XObject, whose use is not recommended.)

• An in-line image object is a means of expressing the data for a small image

directly in the content stream, using a special syntax.

• A shading object describes a geometric shape whose color is an arbitrary func-

tion of position within the shape. (A shading can also be treated as a color

when painting other graphics objects; it is not considered to be a graphics ob-

ject in that case.)

Each graphics object is painted on the page in sequence, obscuring any previously

painted objects that it overlaps, in accordance with the opaque painting model

introduced in Section 2.1.2, “Adobe Imaging Model.” Although this painting

behavior is often attributed to individual operators making up the object, it is

always the object as a whole that is painted. Figure 4.1 shows the ordering rules

for the operations that deﬁne graphics objects. Some operations are permitted

only in certain types of graphics object or in the intervals between graphics

objects (called the page description level in the ﬁgure). Every content stream

begins at the page description level, where changes can be made to the graphics

state, such as colors and text attributes, as discussed in the following sections.

In the ﬁgure, arrows indicate the operators that mark the beginning or end of

each type of graphics object. Some operators are identiﬁed individually, others by

general category. Table 4.1 summarizes these categories for all PDF operators. For

example, the path construction operators

m and re signal the beginning of a path

object. Inside the path object, additional path construction operators are permit-

ted, as are the clipping path operators

W and W*, but not general graphics state

operators such as

w or J. A path-painting operator, such as S or f, ends the path

object and returns to the page description level.

Note: A content stream whose operations violate these rules for describing graphics

objects can produce unpredictable behavior, even though it may display and print

correctly. Applications that attempt to extract graphics objects for editing or other

purposes depend on the objects’ being well formed. The rules for graphics objects are

also important for the proper interpretation of marked content (see Section 8.4.2,

“Marked Content”).

GraphicsCHAPTER 4

122

FIGURE 4.1 Graphics objects

Path object

Allowed operators:

• Path construction

Text object

Allowed operators:

• General graphics state

• Color

• Text state

• Text-showing

• Text-positioning

• Marked-content

Page description level

Allowed operators:

• General graphics state

• Special graphics state

• Color

• Text state

• Marked-content

Clipping path object

Allowed operators:

• None

Shading object

Allowed operators:

• None

In-line image object

Allowed operators:

• ID

External object

Allowed operators:

• None

(immediate)

Path-painting

operators

(immediate)

Path-painting

operators

m, re

EI BI Do

BT ET

W, W*

Graphics Objects4.1

123

TABLE 4.1 Operator categories

CATEGORY OPERATORS TABLE PAGE

General graphics state w, J, j, M, d, ri, i, gs 4.7 142

Special graphics state

q, Q, cm 4.7 142

Path construction

m, l, c, v, y, h, re 4.9 149

Path painting

S, s, f, F, f*, B, B*, b, b*, n 4.10 152

Clipping paths

W, W* 4.11 156

Text objects

BT, ET 5.4 286

Text state

Tc, Tw, Tz, TL, Tf, Tr, Ts 5.2 280

Text positioning

Td, TD, Tm, T* 5.5 287

Text showing

Tj, TJ, ', " 5.6 289

Type 3 fonts

d0, d1 5.10 303

Color

cs, CS, sc, scn, SC, SCN, g, G, rg, RG, k, K 4.21 198

Shading patterns

sh 4.24 214

In-line images

BI, ID, EI 4.38 260

XObjects

Do 4.34 243

Marked content

BMC, BDC, EMC, MP, DP 8.5 480

Compatibility BX, EX 3.19 84

A graphics object also implicitly includes all graphics state parameters that affect

its behavior. For instance, a path object depends on the value of the current color

parameter at the moment the path object is deﬁned. The effect is as if this param-

eter were speciﬁed as part of the deﬁnition of the path object. However, the oper-

ators that are invoked at the page description level to set graphics state

parameters are not considered to belong to any particular graphics object. Graph-

ics state parameters need to be speciﬁed only when they change. A graphics

object may depend on parameters that were deﬁned much earlier.

Similarly, the individual character strings within a text object implicitly include

the graphics state parameters on which they depend. Most of these parameters

GraphicsCHAPTER 4

124

may be set either inside or outside the text object. The effect is as if they were sep-

arately speciﬁed for each text string.

The important point is that there is no semantic signiﬁcance to the exact arrange-

ment of graphics state operators. An application that reads and writes a PDF con-

tent stream is not required to preserve this arrangement, but is free to change it to

any other arrangement that achieves the same values of the relevant graphics state

parameters for each graphics object. An application should not infer any higher-

level logical semantics from the arrangement of tokens constituting a graphics

object. A separate mechanism, marked content, allows such higher-level informa-

tion to be explicitly associated with the graphics objects; see Section 8.4.2,

“Marked Content.”

4.2 Coordinate Systems

Coordinate systems deﬁne the canvas on which all painting occurs. They deter-

mine the position, orientation, and size of the text, graphics, and images that

appear on a page. This section describes each of the coordinate systems used in

PDF, how they are related, and how transformations among them are speciﬁed.

4.2.1 Coordinate Spaces

Paths and positions are deﬁned in terms of pairs of coordinates on the Cartesian

plane. A coordinate pair is a pair of real numbers x and y that locate a point hori-

zontally and vertically within a two-dimensional coordinate space. A coordinate

space is determined by the following properties with respect to the current page:

• The location of the origin

• The orientation of the x and y axes

• The lengths of the units along each axis

PDF deﬁnes several coordinate spaces in which the coordinates specifying graph-

ics objects are interpreted. The following sections describe these spaces and the

relationships among them.

Transformations among coordinate spaces are deﬁned by transformation

matrices, which can specify any linear mapping of two-dimensional coordinates,

including translation, scaling, rotation, reﬂection, and skewing. Transformation

Coordinate Systems4.2

125

matrices are discussed in Sections 4.2.2, “Common Transformations,” and 4.2.3,

“Transformation Matrices.”

Device Space

The contents of a page ultimately appear on a raster output device such as a dis-

play or a printer. Such devices vary greatly in the built-in coordinate systems they

use to address pixels within their imageable areas. A particular device’s coordi-

nate system is called its device space. The origin of the device space on different

devices can fall in different places on the output page; on displays, the origin can

vary depending on the window system. Because the paper or other output me-

dium moves through different printers and imagesetters in different directions,

the axes of their device spaces may be oriented differently; for instance, vertical

(y) coordinates may increase from the top of the page to the bottom on some de-

vices and from bottom to top on others. Finally, different devices have different

resolutions; some even have resolutions that differ in the horizontal and vertical

directions.

If coordinates in a PDF ﬁle were speciﬁed in device space, the ﬁle would be

device-dependent and would appear differently on different devices. For exam-

ple, images speciﬁed in the typical device spaces of a 72-pixel-per-inch display

and a 600-dot-per-inch printer would differ in size by more than a factor of 8; an

8-inch line segment on the display would appear less than 1 inch long on the

printer. Figure 4.2 shows how the same graphics object, speciﬁed in device space,

can appear drastically different when rendered on different output devices.

FIGURE 4.2 Device space

Device space for

72-dpi screen

Device space for

300-dpi printer

GraphicsCHAPTER 4

126

User Space

To avoid the device-dependent effects of specifying objects in device space, PDF

deﬁnes a device-independent coordinate system that always bears the same rela-

tionship to the current page, regardless of the output device on which printing or

displaying will occur. This device-independent coordinate system is called user

space.

The user space coordinate system is initialized to a default state for each page of a

document. Initially, the origin is located at the lower-left corner of the output

page or display window, with the positive x axis extending horizontally to the

right and the positive y axis extending vertically upward, as in standard mathe-

matical practice. The length of a unit along both the x and y axes is 1⁄72 inch.

This coordinate system is the default user space, in which all points on a page have

positive x and y coordinate values.

Note: The unit size in default user space (1⁄72 inch) is approximately the same as a

point, a unit widely used in the printing industry. It is not exactly the same, how-

ever; there is no universal deﬁnition of a point.

Conceptually, user space is an inﬁnite plane. Only a small portion of this plane

corresponds to the imageable area of the output device: a rectangular area above

and to the right of the origin in default user space. The region of default user

space that is viewed or printed can be different for each page, and is described in

Section 8.6.1, “Page Boundaries.”

The default user space origin coincides with the lower-left corner of the physical

output medium. Portions of the physical medium may not be imageable on some

output devices; for example, many laser printers cannot place marks at the

extreme edges of the physical sheet of paper. Thus, in particular, it may not be

possible to place marks at or near the default user space origin. However, the cor-

respondence of physical corner to default origin ensures that marks within the

imageable portion of the output page will be consistently positioned with respect

to the edges of the medium.

Note: Because coordinates in user space (as in any other coordinate space) may be

speciﬁed as either integers or real numbers, the unit size in default user space does

not constrain positions to any arbitrary grid. The resolution of coordinates in user

space is not related in any way to the resolution of pixels in device space.

Coordinate Systems4.2

127

The transformation from user space to device space is deﬁned by the current

transformation matrix (CTM), an element of the PDF graphics state (see

Section 4.3, “Graphics State”). A PDF viewer application can adjust the CTM for

the native resolution of a particular output device, maintaining the device-

independence of the PDF page description itself. Figure 4.3 shows how this allows

an object speciﬁed in user space to appear the same regardless of the device on

which it is rendered.

FIGURE 4.3 User space

The default user space provides a consistent, dependable starting place for PDF

page descriptions regardless of the output device used. If necessary, a PDF con-

tent stream may then modify user space to be more suitable to its needs by apply-

ing the coordinate transformation operator,

cm (see Section 4.3.3, “Graphics State

Operators”). Thus what may appear to be absolute coordinates in a content

stream are not absolute with respect to the current page, because they are

expressed in a coordinate system that may slide around and shrink or expand.

Coordinate system transformation not only enhances device-independence but is

User space

Device space for

72-dpi screen

Device space for

300-dpi printer

CTM

GraphicsCHAPTER 4

128

a useful tool in its own right. For example, a content stream originally composed

to occupy an entire page can be incorporated without change as an element of

another page by shrinking the coordinate system in which it is drawn.

Other Coordinate Spaces

In addition to device space and user space, PDF uses a variety of other coordinate

spaces for specialized purposes:

• The coordinates of text are speciﬁed in text space. The transformation from text

space to user space is deﬁned by a text matrix in combination with several text-

related parameters in the graphics state (see Section 5.3.1, “Text-Positioning

Operators”).

• Character glyphs in a font are deﬁned in glyph space (see Section 5.1.3, “Glyph

Positioning and Metrics”). The transformation from glyph space to text space

is deﬁned by the font matrix. For most types of font, this matrix is predeﬁned to

map 1000 units of glyph space to 1 unit of text space; for Type 3 fonts, the font

matrix is given explicitly in the font dictionary (see Section 5.5.4, “Type 3

Fonts”).

• All sampled images are deﬁned in image space. The transformation from image

space to user space is predeﬁned and cannot be changed. All images are 1 unit

wide by 1 unit high in user space, regardless of the number of samples in the

image. To be painted, an image must be mapped to the desired region of the

page by temporarily altering the current transformation matrix (CTM).

Note: In PostScript, unlike PDF, the relationship between image space and user

space can be speciﬁed explicitly. The ﬁxed transformation prescribed in PDF corre-

sponds to the convention that is recommended for use in PostScript.

• A form XObject (discussed in Section 4.9, “Form XObjects”) is a self-contained

content stream that can be treated as a graphical element within another con-

tent stream. The space in which it is deﬁned is called form space. The transfor-

mation from form space to user space is speciﬁed by a matrix contained in the

form XObject.

• PDF 1.2 deﬁnes a type of color known as a pattern, discussed in Section 4.6,

“Patterns.” A pattern is deﬁned either by a content stream that is invoked re-

peatedly to tile an area or by a shading whose color is a function of position.

The space in which a pattern is deﬁned is called pattern space. The transforma-

tion from pattern space to user space is speciﬁed by a matrix contained in the

pattern.

Coordinate Systems4.2

129

Relationships among Coordinate Spaces

Figure 4.4 shows the relationships among the coordinate spaces described above.

Each arrow in the ﬁgure represents a transformation from one coordinate space

to another. PDF allows modiﬁcations to many of these transformations.

FIGURE 4.4 Relationships among coordinate systems

Because PDF coordinate spaces are deﬁned relative to one another, changes made

to one transformation can affect the appearance of objects deﬁned in several

coordinate spaces. For example, a change in the CTM, which deﬁnes the trans-

formation from user space to device space, will affect forms, text, images, and

patterns, since they are all “upstream” from user space.

4.2.2 Common Transformations

A transformation matrix specifies the relationship between two coordinate spaces.

By modifying a transformation matrix, objects can be scaled, rotated, translated,

or transformed in other ways.

A transformation matrix in PDF is speciﬁed by six numbers, usually in the form

of an array containing six elements. In its most general form, this array is denoted

[abcdef]; it can represent any linear transformation from one coordinate

system to another. This section lists the arrays that specify the most common

transformations; Section 4.2.3, “Transformation Matrices,” discusses more math-

User

space

Device

space

Form

space

Glyph

space

Text

space

Image

space

Pattern

space

GraphicsCHAPTER 4

130

ematical details of transformations, including information on specifying trans-

formations that are combinations of those listed here.

• Translations are speciﬁed as [1001t

], where t

and t

are the distances to

translate the origin of the coordinate system in the horizontal and vertical

dimensions, respectively.

• Scaling is obtained by [s

00s

00]. This scales the coordinates so that 1

unit in the horizontal and vertical dimensions of the new coordinate system is

the same size as

and s

units, respectively, in the previous coordinate system.

• Rotations are produced by [cos

sin

−sin

cos

00], which has the effect

of rotating the coordinate system axes by an angle

counterclockwise.

• Skew is speciﬁed by [1 tan

tan

100], which skews the x axis by an angle

and the y axis by an angle

Figure 4.5 shows examples of each transformation. The directions of translation,

rotation, and skew shown in the ﬁgure correspond to positive values of the array

elements.

FIGURE 4.5 Effects of coordinate transformations

If several transformations are combined, the order in which they are applied is

signiﬁcant. For example, ﬁrst scaling and then translating the x axis is not the

SkewingRotationScalingTranslation

Coordinate Systems4.2

131

same as ﬁrst translating and then scaling it. In general, to obtain the expected

results, transformations should be done in the following order:

1. Translate

2. Rotate

3. Scale or skew

Figure 4.6 shows the effect of the order in which transformations are applied. The

ﬁgure shows two sequences of transformations applied to a coordinate system.

After each successive transformation, an outline of the letter

n is drawn.

FIGURE 4.6 Effect of transformation order

The transformations shown in the ﬁgure are as follows:

• A translation of 10 units in the x direction and 20 units in the y direction

• A rotation of 30 degrees

• A scaling by a factor of 3 in the x direction

Original Step 1: Translation Step 2: Rotation Step 3: Scaling

Original Step 1: Scaling Step 2: Rotation Step 3: Translation

GraphicsCHAPTER 4

132

In the ﬁgure, the axes are shown with a dash pattern having a 2-unit dash and a

2-unit gap. In addition, the original (untransformed) axes are shown in a lighter

color for reference. Notice that the scale-rotate-translate ordering results in a

distortion of the coordinate system, leaving the x and y axes no longer perpendic-

ular, while the recommended translate-rotate-scale ordering does not.

4.2.3 Transformation Matrices

This section discusses the mathematics of transformation matrices. It is not

necessary to read this section in order to use the transformations described previ-

ously; the information is presented for the beneﬁt of readers who want to gain a

deeper understanding of the theoretical basis of coordinate transformations.

To understand the mathematics of coordinate transformations in PDF, it is vital

to remember two points:

• Transformations alter coordinate systems, not graphics objects. All objects painted

before a transformation is applied are unaffected by the transformation. Ob-

jects painted after the transformation is applied will be interpreted in the trans-

formed coordinate system.

• Transformation matrices specify the transformation from the new (transformed)

coordinate system to the original (untransformed) coordinate system. All coor-

dinates used after the transformation are expressed in the transformed coordi-

nate system. PDF applies the transformation matrix to ﬁnd the equivalent

coordinates in the untransformed coordinate system.

Note: Many computer graphics textbooks consider transformations of graphics

objects rather than of coordinate systems. Although either approach is correct and

self-consistent, some details of the calculations differ depending on which point of

view is taken.

PDF represents coordinates in a two-dimensional space. The point (x, y) in such

a space can be expressed in vector form as

[xy1]. The constant third element of

this vector (

1) is needed so that the vector can be used with 3-by-3 matrices in the

calculations described below.

Coordinate Systems4.2

133

The transformation between two coordinate systems is represented by a 3-by-3

transformation matrix written as

Because a transformation matrix has only six elements that can be changed, it is

usually speciﬁed in PDF as the six-element array

[abcdef].

Coordinate transformations are expressed as matrix multiplications:

Because PDF transformation matrices specify the conversion from the trans-

formed coordinate system to the original (untransformed) coordinate system, x′

and y′ in this equation are the coordinates in the untransformed coordinate sys-

tem, while x and y are the coordinates in the transformed system. Carrying out

the multiplication, we have

If a series of transformations is carried out, the matrices representing each of the

individual transformations can be multiplied together to produce a single equiv-

alent matrix representing the composite transformation.

Matrix multiplication is not commutative—the order in which matrices are mul-

tiplied is signiﬁcant. Consider a sequence of two transformations: a scaling trans-

formation applied to the user space coordinate system, followed by a conversion

from the resulting scaled user space to device space. Let M

be the matrix specify-

ing the scaling and M

the current transformation matrix, which transforms user

space to device space. Recalling that coordinates are always speciﬁed in the trans-

formed space, the correct order of transformations must ﬁrst convert the scaled

ab0

cd0

ef1

x′ y′ 1[]xy1[]

ab0

cd0

ef1

×=

x′ ax× cy× e++=

y′ bx× dy× f++=

GraphicsCHAPTER 4

134

coordinates to default user space and then the default user space coordinates to

device space. This can be expressed as

where

denotes the coordinates in device space

denotes the coordinates in default user space

denotes the coordinates in scaled user space

This shows that when a new transformation is concatenated with an existing one,

the matrix representing it must be multiplied before (premultiplied with) the ex-

isting transformation matrix.

This result is true in general for PDF: when a sequence of transformations is car-

ried out, the matrix representing the combined transformation (M′) is calculated

by premultiplying the matrix representing the additional transformation (M

)

with the one representing all previously existing transformations (M):

4.3 Graphics State

A PDF viewer application maintains an internal data structure called the graphics

state that holds current graphics control parameters. These parameters deﬁne the

global framework within which the graphics operators execute. For example, the

f (ﬁll) operator implicitly uses the current color parameter, and the S (stroke) op-

erator additionally uses the current line width parameter from the graphics state.

The graphics state is initialized at the beginning of each page, using the default

values speciﬁed in Tables 4.2 and 4.3. Table 4.2 lists those graphics state parame-

ters that are device-independent and are appropriate to specify in page descrip-

tions. The parameters listed in Table 4.3 control details of the rendering (scan

conversion) process and are device-dependent; a page description that is intend-

ed to be device-independent should not modify these parameters.

× X

×()M

× X

×()×== =

M′ M

M×=

Graphics State4.3

135

TABLE 4.2 Device-independent parameters of the graphics state

PARAMETER TYPE VALUE

CTM array The current transformation matrix, which maps positions from user

coordinates to device coordinates (see Section 4.2, “Coordinate Sys-

tems”). This matrix is modiﬁed by each application of the coordi-

nate transformation operator,

cm. Initial value: a matrix that

transforms default user coordinates to device coordinates.

clipping path (internal) A path deﬁning the current clipping boundary against which all out-

put is to be cropped (see Section 4.4.3, “Clipping Path Operators”).

Initial value: the boundary of the entire imageable portion of the

output page.

color space name or array The current color space in which color values are to be interpreted

(see Section 4.5, “Color Spaces”). There are two separate color

space parameters: one for stroking and one for all other painting

operations. Initial value:

DeviceGray.

color (various) The current color to use during painting operations (see Section 4.5,

“Color Spaces”). The type and interpretation of this parameter

depend on the current color space; for most color spaces, a color

value consists of one to four numbers. There are two separate color

parameters: one for stroking and one for all other painting opera-

tions. Initial value: black.

text state (various) A set of eight graphics state parameters that pertain only to the

painting of text. These include parameters that select the font, scale

the glyphs to an appropriate size, and accomplish other effects. The

text state parameters are described in Section 5.2, “Text State

Parameters and Operators.”

line width number The thickness, in user space units, of paths to be stroked (see “Line

Width” on page 139). Initial value: 1.0.

line cap integer A code specifying the shape of the endpoints for any open path that

is stroked (see “Line Cap Style” on page 139). Initial value: 0, for

square butt caps.

line join integer A code specifying the shape of joints between connected segments

of a stroked path (see “Line Join Style” on page 140). Initial value:

0, for mitered joins.

miter limit number The maximum length of mitered line joins for stroked paths (see

“Miter Limit” on page 140). This parameter limits the length of

GraphicsCHAPTER 4

136

“spikes” produced when line segments join at sharp angles. Initial

value: 10.0, for a miter cutoff below approximately 11.5 degrees.

dash pattern array and A description of the dash pattern to be used when paths are

number stroked (see “Line Dash Pattern” on page 141). Initial value: a solid

line.

rendering intent name The rendering intent to use when converting CIE-based colors to

device colors (see “Rendering Intents” on page 179). Default value:

RelativeColorimetric.

stroke adjustment boolean (PDF 1.2) A ﬂag specifying whether to compensate for possible

rasterization effects when stroking a path with a line width that is

small relative to the pixel resolution of the output device (see

Section 6.5.4, “Automatic Stroke Adjustment”). Note that this is

considered a device-independent parameter, even though the

details of its effects are device-dependent. Initial value: false.

TABLE 4.3 Device-dependent parameters of the graphics state

PARAMETER TYPE VALUE

overprint boolean (PDF 1.2) A ﬂag specifying (on output devices that support the

overprint control feature) whether painting in one set of colorants

should cause the corresponding areas of other colorants to be

erased (false) or left unchanged (true); see Section 4.5.6, “Overprint

Control.” In PDF 1.3, there are two separate overprint parameters:

one for stroking and one for all other painting operations. Initial

value: false.

overprint mode number (PDF 1.3) A code specifying whether a color component value of 0

in a

DeviceCMYK color space should erase that component (0) or

leave it unchanged (1) when overprinting (see Section 4.5.6, “Over-

print Control”). Initial value: 0.

black generation function (PDF 1.2) A function that calculates the level of black colorant

or name to use when converting RGB colors to CMYK (see Section 6.2.3,

“Conversion from DeviceRGB to DeviceCMYK”). Initial value:

installation-dependent.

undercolor removal function (PDF 1.2) A function that calculates the reduction in the levels of

or name cyan, magenta, and yellow colorants to compensate for the amount

of black added by black generation (see Section 6.2.3, “Conversion

from DeviceRGB to DeviceCMYK”). Initial value: installation-

dependent.

Graphics State4.3

137

transfer function, (PDF 1.2) A function that adjusts device gray or color component

array, or name levels to compensate for nonlinear response in a particular output

device (see Section 6.3, “Transfer Functions”). Initial value:

installation-dependent.

halftone dictionary, (PDF 1.2) A halftone screen for gray and color rendering, speciﬁed

stream, or name as a halftone dictionary or stream (see Section 6.4, “Halftones”).

Initial value: installation-dependent.

ﬂatness number The precision with which curves are to be rendered on the output

device (see Section 6.5.1, “Flatness Tolerance”). The value of this

parameter gives the maximum error tolerance, measured in output

device pixels; smaller numbers give smoother curves at the expense

of more computation and memory use. Initial value: 1.0.

smoothness number (PDF 1.3) The precision with which color gradients are to be

rendered on the output device (see Section 6.5.2, “Smoothness Tol-

erance”). The value of this parameter gives the maximum error

tolerance, expressed as a fraction of the range of each color compo-

nent; smaller numbers give smoother color transitions at the

expense of more computation and memory use. Initial value:

installation-dependent.

Some graphics state parameters are set with speciﬁc PDF operators, some are set

by including a particular entry in a graphics state parameter dictionary, and some

can be speciﬁed either way. The current line width, for example, can be set either

with the

w operator or (in PDF 1.3) with the LW entry in a graphics state param-

eter dictionary, whereas the current color is set only with speciﬁc operators and

the current halftone is set only with a graphics state parameter dictionary. It is ex-

pected that all future graphics state parameters will be speciﬁed with new entries

in the graphics state parameter dictionary rather than with new operators.

In general, the operators that set graphics state parameters simply store them un-

changed for later use by the painting operators. However, some parameters have

special properties or behavior:

• Most parameters must be of the correct type or have values that fall within a

certain range.

• Parameters that are numeric values, such as color, line width, and miter limit,

are forced into valid range, if necessary. However, they are not adjusted to re-

ﬂect capabilities of the raster output device, such as resolution or number of

GraphicsCHAPTER 4

138

distinguishable colors. Painting operators perform such adjustments, but the

adjusted values are not stored back into the graphics state.

• Paths are internal objects that are not directly represented in PDF.

Note: As indicated in Tables 4.2 and 4.3, some of the parameters—color space, color,

and overprint—have two values, one used for stroking (of path and text objects) and

one for all other painting operations. The two parameter values can be set indepen-

dently, allowing for operations such as combined ﬁlling and stroking of the same path

with different colors. Except where noted, a term such as current color should be in-

terpreted to refer to whichever color parameter applies to the operation being per-

formed. When necessary, the individual color parameters are distinguished explicitly

as the stroking color and the nonstroking color.

4.3.1 Graphics State Stack

A well-structured PDF document typically contains many graphical elements

that are essentially independent of each other and sometimes nested to multiple

levels. The graphics state stack allows these elements to make local changes to the

graphics state without disturbing the graphics state of the surrounding environ-

ment. The stack is a LIFO (last in, ﬁrst out) data structure in which the contents

of the graphics state can be saved and later restored using the following operators:

• The q operator pushes a copy of the entire graphics state onto the stack.

• The Q operator restores the entire graphics state to its former value by popping

it from the stack.

These operators can be used to encapsulate a graphical element so that it can

modify parameters of the graphics state and later restore them to their previous

values. Occurrences of the

q and Q operators must be balanced within a given

content stream (or within the sequence of streams speciﬁed in a page dictionary’s

Contents array).

4.3.2 Details of Graphics State Parameters

This section gives details of several of the device-independent graphics state

parameters listed in Table 4.2 on page 135.

Graphics State4.3

139

Line Width

The line width parameter speciﬁes the thickness of the line used to stroke a path.

It is a nonnegative number expressed in user space units; stroking a path entails

painting all points whose perpendicular distance from the path in user space is

less than or equal to half the line width. The effect produced in device space de-

pends on the current transformation matrix (CTM) in effect at the time the path

is stroked. If the CTM speciﬁes scaling by different factors in the x and y dimen-

sions, the thickness of stroked lines in device space will vary according to their

orientation. The actual line width achieved can differ from the requested width

by as much as 2 device pixels, depending on the positions of lines with respect to

the pixel grid. Automatic stroke adjustment can be used to ensure uniform line

width; see Section 6.5.4, “Automatic Stroke Adjustment.”

A line width of 0 denotes the thinnest line that can be rendered at device resolu-

tion: 1 device pixel wide. However, some devices cannot reproduce 1-pixel lines,

and on high-resolution devices, they are nearly invisible. Since the results of ren-

dering such “zero-width” lines are device-dependent, their use is not recom-

mended.

Line Cap Style

The line cap style speciﬁes the shape to be used at the ends of open subpaths (and

dashes, if any) when they are stroked. Table 4.4 shows the possible values.

TABLE 4.4 Line cap styles

STYLE APPEARANCE DESCRIPTION

0 Butt cap. The stroke is squared off at the endpoint of

the path. There is no projection beyond the end of

the path.

1 Round cap. A semicircular arc with a diameter equal

to the line width is drawn around the endpoint and

ﬁlled in.

2 Projecting square cap. The stroke continues beyond

the endpoint of the path for a distance equal to half

the line width and is then squared off.

GraphicsCHAPTER 4

140

Line Join Style

The line join style speciﬁes the shape to be used at the corners of paths that are

stroked. Table 4.5 shows the possible values. Join styles are signiﬁcant only at

points where consecutive segments of a path connect at an angle; segments that

meet or intersect fortuitously receive no special treatment.

TABLE 4.5 Line join styles

STYLE APPEARANCE DESCRIPTION

0 Miter join. The outer edges of the strokes for the two

segments are extended until they meet at an angle, as

in a picture frame. If the segments meet at too sharp

an angle (as deﬁned by the miter limit parameter—

see “Miter Limit,” below), a bevel join is used instead.

1 Round join. A circle with a diameter equal to the line

width is drawn around the point where the two

segments meet and is ﬁlled in, producing a rounded

corner.

Note: If path segments shorter than half the line width

meet at a sharp angle, an unintended “wrong side” of

the circle may appear.

2 Bevel join. The two segments are ﬁnished with butt

caps (see “Line Cap Style” on page 139) and the

resulting notch beyond the ends of the segments is

ﬁlled with a triangle.

Miter Limit

When two line segments meet at a sharp angle and mitered joins have been spec-

iﬁed as the line join style, it is possible for the miter to extend far beyond the

thickness of the line stroking the path. The miter limit imposes a maximum on

the ratio of the miter length to the line width (see Figure 4.7). When the limit is

exceeded, the join is converted from a miter to a bevel.

Graphics State4.3

141

FIGURE 4.7 Miter length

The ratio of miter length to line width is directly related to the angle

between

the segments in user space by the formula

For example, a miter limit of 1.414 converts miters to bevels for

less than 90

degrees, a limit of 2.0 converts them for

less than 60 degrees, and a limit of 10.0

converts them for

less than approximately 11.5 degrees.

Line Dash Pattern

The line dash pattern controls the pattern of dashes and gaps used to stroke paths.

It is speciﬁed by a dash array and a dash phase. The dash array’s elements are

numbers that specify the lengths of alternating dashes and gaps; the dash phase

speciﬁes the distance into the dash pattern at which to start the dash. The ele-

ments of both the dash array and the dash phase are expressed in user space units.

Before beginning to stroke a path, the dash array is cycled through, adding up the

lengths of dashes and gaps. When the accumulated length equals the value speci-

ﬁed by the dash phase, stroking of the path begins, using the dash array cyclically

from that point onward. Table 4.6 shows examples of line dash patterns. As can

be seen from the table, an empty dash array and zero phase can be used to restore

the dash pattern to a solid line.

Miter

length

Line width

miterLength

lineWidth

----------------------------

---





sin

------------------=

GraphicsCHAPTER 4

142

TABLE 4.6 Examples of line dash patterns

DASH ARRAY

AND PHASE APPEARANCE DESCRIPTION

[ ] 0 No dash; solid, unbroken lines

[3] 0 3 units on, 3 units off, …

[2] 1 1 on, 2 off, 2 on, 2 off, …

[2 1] 0 2 on, 1 off, 2 on, 1 off, …

[3 5] 6 2 off, 3 on, 5 off, 3 on, 5 off, …

[2 3] 11 1 on, 3 off, 2 on, 3 off, 2 on, …

Dashed lines wrap around curves and corners just as solid stroked lines do. The

ends of each dash are treated with the current line cap style, and corners within

dashes are treated with the current line join style. A stroking operation takes no

measures to coordinate the dash pattern with features of the path; it simply dis-

penses dashes and gaps along the path in the pattern deﬁned by the dash array.

When a path consisting of several subpaths is stroked, each subpath is treated in-

dependently—that is, the dash pattern is restarted and the dash phase is reapplied

to it at the beginning of each subpath.

4.3.3 Graphics State Operators

Table 4.7 shows the operators that set the values of parameters in the graphics

state. (See also the color operators listed in Table 4.21 on page 198 and the text

state operators in Table 5.2 on page 280.)

TABLE 4.7 Graphics state operators

OPERANDS OPERATOR DESCRIPTION

— q Save the current graphics state on the graphics state stack (see “Graphics

State Stack” on page 138).

— Q Restore the graphics state by removing the most recently saved state from

the stack and making it the current state (see “Graphics State Stack” on

page 138).

Graphics State4.3

143

abcdef cm Modify the CTM by concatenating the speciﬁed matrix (see Section 4.2.1,

“Coordinate Spaces”). Although the operands specify a matrix, they are

written as six separate numbers, not as an array.

lineWidth w Set the line width in the graphics state (see “Line Width” on page 139).

lineCap J Set the line cap style in the graphics state (see “Line Cap Style” on

page 139).

lineJoin j Set the line join style in the graphics state (see “Line Join Style” on

page 140).

miterLimit M Set the miter limit in the graphics state (see “Miter Limit” on page 140).

dashArray dashPhase d Set the line dash pattern in the graphics state (see “Line Dash Pattern” on

page 141).

intent ri (PDF 1.1) Set the color rendering intent in the graphics state (see “Ren-

dering Intents” on page 179).

ﬂatness i Set the ﬂatness tolerance in the graphics state (see Section 6.5.1, “Flatness

Tolerance”).

ﬂatness is a number in the range 0 to 100; a value of 0 speci-

ﬁes the output device’s default ﬂatness tolerance.

dictName gs (PDF 1.2) Set the speciﬁed parameters in the graphics state. dictName is

the name of a graphics state parameter dictionary in the

ExtGState sub-

dictionary of the current resource dictionary (see the next section).

4.3.4 Graphics State Parameter Dictionaries

While some parameters in the graphics state can be set with individual operators,

as shown in Table 4.7, others cannot. The latter can only be set with the generic

graphics state operator

gs (PDF 1.2). The operand supplied to this operator is the

name of a graphics state parameter dictionary whose contents specify the values of

one or more graphics state parameters. This name is looked up in the

ExtGState

subdictionary of the current resource dictionary. (The name ExtGState, for

“extended graphics state,” is a vestige of earlier versions of PDF.)

Note: The graphics state parameter dictionary is also used by type 2 patterns, which

do not have a content stream in which the graphics state operators could be invoked

(see Section 4.6.3, “Shading Patterns”).

Each entry in the parameter dictionary speciﬁes the value of an individual graph-

ics state parameter, as shown in Table 4.8. It is not necessary for all entries to be

GraphicsCHAPTER 4

144

present for every invocation of the gs operator; the parameter dictionary sup-

plied may include any desired combination of parameter entries. The results of

are cumulative; parameter values established in previous invocations will persist

until explicitly overridden. Note that some parameters appear in both Tables 4.7

and 4.8; these parameters can be set either with individual graphics state opera-

tors or with

gs. It is expected that any future extensions to the graphics state will

be implemented by adding new keys to the graphics state parameter dictionary,

rather than by introducing new graphics state operators.

TABLE 4.8 Entries in a graphics state parameter dictionary

KEY TYPE DESCRIPTION

Type name (Optional) The type of PDF object that this dictionary describes; must be

ExtGState for a graphics state parameter dictionary.

Font array (Optional; PDF 1.3) An array of the form [font size], where font is an indirect

reference to a font dictionary and

size is a number expressed in text space

units. These two objects correspond to the operands of the

Tf operator (see

Section 5.2, “Text State Parameters and Operators”); however, the ﬁrst oper-

and is an indirect object reference instead of a resource name.

LW number (Optional; PDF 1.3) The line width (see “Line Width” on page 139).

LC integer (Optional; PDF 1.3) The line cap style (see “Line Cap Style” on page 139).

LJ integer (Optional; PDF 1.3) The line join style (see “Line Join Style” on page 140).

ML number (Optional; PDF 1.3) The miter limit (see “Miter Limit” on page 140).

D array (Optional; PDF 1.3) The line dash pattern, expressed as an array of the form

[dashArray dashPhase], where dashArray is itself an array and dashPhase is an

integer (see “Line Dash Pattern” on page 141).

RI name (Optional; PDF 1.3) The name of the rendering intent (see “Rendering In-

tents” on page 179).

SA boolean (Optional) A ﬂag specifying whether to apply automatic stroke adjustment

(see Section 6.5.4, “Automatic Stroke Adjustment”).

OP boolean (Optional) A ﬂag specifying whether to apply overprint (see Section 4.5.6,

“Overprint Control”). In PDF 1.2 and earlier, there is a single overprint

parameter that applies to all painting operations. In PDF 1.3, there are two

separate overprint parameters: one for stroking and one for all other painting

operations. Specifying an

OP entry sets both parameters unless there is also

op entry in the same graphics state parameter dictionary, in which case

the

OP entry sets only the overprint parameter for stroking.

Graphics State4.3

145

op boolean (Optional; PDF 1.3) A ﬂag specifying whether to apply overprint (see

Section 4.5.6, “Overprint Control”) for painting operations other than strok-

ing. If this entry is absent, the

OP entry, if any, sets this parameter.

OPM integer (Optional; PDF 1.3) The overprint mode (see Section 4.5.6, “Overprint Con-

trol”).

BG function (Optional) The black-generation function, which maps the interval [0.0 1.0]

to the interval [0.0 1.0] (see Section 6.2.3, “Conversion from DeviceRGB to

DeviceCMYK”).

BG2 function or name (Optional; PDF 1.3) Same as BG except that the value may also be the name

Default, denoting the black-generation function that was in effect at the start

of the page. If both

BG and BG2 are present, BG2 takes precedence.

UCR function (Optional) The undercolor-removal function, which maps the interval

[0.0 1.0] to the interval [−1.0 1.0] (see Section 6.2.3, “Conversion from

DeviceRGB to DeviceCMYK”).

UCR2 function or name (Optional; PDF 1.3) Same as UCR except that the value may also be the name

Default, denoting the undercolor-removal function that was in effect at the

start of the page. If both

UCR and UCR2 are present, UCR2 takes precedence.

TR function, array, (Optional) The transfer function, which maps the interval [0.0 1.0] to the

or name interval [0.0 1.0] (see Section 6.3, “Transfer Functions”). The value is either

a single function (which applies to all process colorants) or an array of four

functions (which apply to the process colorants individually). The name

Identity may be used to represent the identity function.

TR2 function, array, (Optional; PDF 1.3) Same as TR except that the value may also be the name

or name

Default, denoting the transfer function that was in effect at the start of the

page. If both

TR and TR2 are present, TR2 takes precedence.

HT dictionary, (Optional) The halftone dictionary or stream (see Section 6.4, “Halftones”)

stream, or name or the name

Default, denoting the halftone that was in effect at the start of

the page.

FL number (Optional; PDF 1.3) The ﬂatness tolerance (see Section 6.5.1, “Flatness Toler-

ance”).

SM number (Optional; PDF 1.3) The smoothness tolerance (see Section 6.5.2, “Smooth-

ness Tolerance”).

Example 4.1 shows two graphics state parameter dictionaries. In the ﬁrst, auto-

matic stroke adjustment is turned on, and the dictionary includes a transfer func-

tion that inverts its value, f(x) = 1 − x. In the second, overprint is turned off, and

GraphicsCHAPTER 4

146

the dictionary includes a parabolic transfer function, f(x) = (2x − 1)

, with a

sample of 21 values. The domain of the transfer function, [0.0 1.0], is mapped to

[0 20], and the range of the sample values, [0 255], is mapped to the range of

the transfer function, [0.0 1.0].

Example 4.1

10 0 obj % Page object

<< /Type /Page

/Parent 5 0 R

/Resources 20 0 R

/Contents 40 0 R

endobj

20 0 obj % Resource dictionary for page

<< /ProcSet [/PDF /Text]

/Font << /F1 25 0 R >>

/ExtGState << /GS1 30 0 R

/GS2 35 0 R

endobj

30 0 obj % First graphics state parameter dictionary

<< /Type /ExtGState

/SA true

/TR 31 0 R

endobj

31 0 obj % First transfer function

<< /FunctionType 0

/Domain [0.0 1.0]

/Range [0.0 1.0]

/Size 2

/BitsPerSample 8

/Length 7

/Filter /ASCIIHexDecode

stream

01 00 >

endstream

endobj

Path Construction and Painting4.4

147

35 0 obj % Second graphics state parameter dictionary

<< /Type /ExtGState

/OP false

/TR 36 0 R

endobj

36 0 obj % Second transfer function

<< /FunctionType 0

/Domain [0.0 1.0]

/Range [0.0 1.0]

/Size 21

/BitsPerSample 8

/Length 63

/Filter /ASCIIHexDecode

stream

FF CE A3 7C 5B 3F 28 16 0A 02 00 02 0A 16 28 3F 5B 7C A3 CE FF >

endstream

endobj

4.4 Path Construction and Painting

Paths deﬁne shapes, trajectories, and regions of all sorts. They are used to draw

lines, deﬁne the shapes of ﬁlled areas, and specify boundaries for clipping other

graphics. The graphics state includes a clipping path that deﬁnes the clipping

boundary for the current page. At the beginning of each page, the clipping path is

initialized to include the entire page.

A path is composed of straight and curved line segments, which may connect to

one another or may be disconnected. A pair of segments are said to connect only if

they are deﬁned consecutively, with the second segment starting where the ﬁrst

one ends. Thus the order in which the segments of a path are deﬁned is signiﬁ-

cant. Nonconsecutive segments that meet or intersect fortuitously are not consid-

ered to connect.

A path is made up of one or more disconnected subpaths, each comprising a

sequence of connected segments. The topology of the path is unrestricted: it may

be concave or convex, may contain multiple subpaths representing disjoint areas,

and may intersect itself in arbitrary ways. There is an operator,

h, that explicitly

connects the end of a subpath back to its starting point; such a subpath is said to

be closed. A subpath that has not been explicitly closed is open.

GraphicsCHAPTER 4

148

As discussed in Section 4.1, “Graphics Objects,” a path object is deﬁned by a

sequence of operators to construct the path, followed by one or more operators

to paint the path or to use it as a clipping path. PDF path operators fall into three

categories:

• Path construction operators (Section 4.4.1) deﬁne the geometry of a path. A

path is constructed by sequentially applying one or more of these operators.

• Path-painting operators (Section 4.4.2) end a path object, usually causing the

object to be painted on the current page in any of a variety of ways.

• Clipping path operators (Section 4.4.3), invoked immediately prior to a path-

painting operator, cause the path object also to be used for clipping of sub-

sequent graphics objects.

4.4.1 Path Construction Operators

A page description begins with an empty path and builds up its deﬁnition by in-

voking one or more path construction operators to add segments to it. The path

construction operators may be invoked in any sequence, but the ﬁrst one invoked

must be

m or re to begin a new subpath. The path deﬁnition concludes with the

application of a path-painting operator such as

S, f, or b (see Section 4.4.2, “Path-

Painting Operators”); this may optionally be preceded by one of the clipping path

operators

W or W* (Section 4.4.3, “Clipping Path Operators”). Note that the path

construction operators in themselves do not place any marks on the page; only

the painting operators do that. A path deﬁnition is not complete until a path-

painting operator has been applied to it.

The path currently under construction is called the current path. In PDF (unlike

PostScript), the current path is not part of the graphics state and is not saved and

restored along with the other graphics state parameters. PDF paths are strictly in-

ternal objects with no explicit representation. Once a path has been painted, it is

no longer deﬁned; there is then no current path until a new one is begun with the

m or re operator.

The trailing endpoint of the segment most recently added to the current path is

referred to as the current point. If the current path is empty, the current point is

undeﬁned. Most operators that add a segment to the current path start at the cur-

rent point; if the current point is undeﬁned, they generate an error.

Table 4.9 shows the path construction operators. All operands are numbers de-

noting coordinates in user space.

Path Construction and Painting4.4

149

TABLE 4.9 Path construction operators

OPERANDS OPERATOR DESCRIPTION

xy m Begin a new subpath by moving the current point to coordinates

(

x, y), omitting any connecting line segment. If the previous path

construction operator in the current path was also

m, the new m

overrides it; no vestige of the previous m operation remains in the

path.

xy l (lowercase L) Append a straight line segment from the current point to the point

(

x, y). The new current point is (x, y).

c Append a cubic Bézier curve to the current path. The curve extends

from the current point to the point (

, y

), using (x

, y

) and

(

, y

) as the Bézier control points (see “Cubic Bézier Curves,” be-

low). The new current point is (

, y

v Append a cubic Bézier curve to the current path. The curve extends

from the current point to the point (

, y

), using the current point

and (

, y

) as the Bézier control points (see “Cubic Bézier Curves,”

below). The new current point is (

, y

y Append a cubic Bézier curve to the current path. The curve extends

from the current point to the point (

, y

), using (x

, y

) and

(

, y

) as the Bézier control points (see “Cubic Bézier Curves,” be-

low). The new current point is (

, y

— h Close the current subpath by appending a straight line segment

from the current point to the starting point of the subpath. This

operator terminates the current subpath; appending another seg-

ment to the current path will begin a new subpath, even if the new

segment begins at the endpoint reached by the

h operation. If the

current subpath is already closed or the current path is empty,

does nothing.

x y width height re Append a rectangle to the current path as a complete subpath, with

lower-left corner (

x, y) and dimensions width and height in user

space. The operation

x y width height re

is equivalent to

xym

(x + width) y l

(x + width)(y + height) l

x (y

+ height) l

GraphicsCHAPTER 4

150

Cubic Bézier Curves

Curved path segments are speciﬁed as cubic Bézier curves. Such curves are deﬁned

by four points: the two endpoints (the current point P

and the ﬁnal point P

)

and two control points P

and P

. Given the coordinates of the four points, the

curve is generated by varying the parameter t from 0.0 to 1.0 in the following

equation:

When t = 0.0, the value of the function R(t) coincides with the current point P

;

when t

1.0, R(t) coincides with the ﬁnal point P

. Intermediate values of t gen-

erate intermediate points along the curve. The curve does not, in general, pass

through the two control points P

and P

Cubic Bézier curves have two desirable properties:

• The curve can be very quickly split into smaller pieces for rapid rendering.

• The curve is contained within the convex hull of the four points deﬁning the

curve, most easily visualized as the polygon obtained by stretching a rubber

band around the outside of the four points. This property allows rapid testing

of whether the curve lies completely outside the visible region, and hence does

not have to be rendered.

The Bibliography lists several books that describe cubic Bézier curves in more

depth.

The most general PDF operator for constructing curved path segments is

which speciﬁes the coordinates of points P

, P

, and P

explicitly, as shown in

Figure 4.8. (The starting point, P

, is deﬁned implicitly by the current point.)

Two more operators,

v and y, each specify one of the two control points implicitly

(see Figure 4.9). In each case, one control point and the ﬁnal point of the curve

are supplied as operands; the other control point is implied, as follows:

• For the v operator, the ﬁrst control point coincides with initial point of the

curve.

• For the y operator, the second control point coincides with ﬁnal point of the

curve.

Rt() 1 t–()

3t 1 t–()

1 t–()P

+++=

Path Construction and Painting4.4

151

FIGURE 4.8 Cubic Bézier curve generated by the c operator

FIGURE 4.9 Cubic Bézier curves generated by the v and y operators

4.4.2 Path-Painting Operators

The path-painting operators end a path object, causing it to be painted on the

current page in the manner that the operator speciﬁes. The principal path-

painting operators are

S (for stroking) and f (for ﬁlling). Variants of these opera-

(current point)

)

Current point

)

Current point

)

GraphicsCHAPTER 4

152

tors combine stroking and ﬁlling in a single operation or apply different rules for

determining the area to be ﬁlled. Table 4.10 lists all the path-painting operators.

TABLE 4.10 Path-painting operators

OPERANDS OPERATOR DESCRIPTION

— S Stroke the path.

— s Close and stroke the path. This operator has the same effect as the sequence h S.

— f Fill the path, using the nonzero winding number rule to determine the region to ﬁll

(see “Nonzero Winding Number Rule” on page 154).

— F Equivalent to f; included only for compatibility. Although applications that read

PDF ﬁles must be able to accept this operator, those that generate PDF ﬁles should

use

f instead.

— f* Fill the path, using the even-odd rule to determine the region to ﬁll (see “Even-Odd

Rule” on page 155).

— B Fill and then stroke the path, using the nonzero winding number rule to determine

the region to ﬁll. This produces the same result as constructing two identical path

objects, painting the ﬁrst with

f and the second with S. Note, however, that the ﬁll-

ing and stroking portions of the operation consult different values of several graph-

ics state parameters, such as the color.

— B* Fill and then stroke the path, using the even-odd rule to determine the region to ﬁll.

This operator produces the same result as

B, except that the path is ﬁlled as if with

f* instead of f.

— b Close, ﬁll, and then stroke the path, using the nonzero winding number rule to

determine the region to ﬁll. This operator has the same effect as the sequence

h B.

— b* Close, ﬁll, and then stroke the path, using the even-odd rule to determine the

region to ﬁll. This operator has the same effect as the sequence

h B*.

— n End the path object without ﬁlling or stroking it. This operator is a “path-painting

no-op,” used primarily for the side effect of changing the clipping path (see

Section 4.4.3, “Clipping Path Operators”).

Stroking

The S operator paints a line along the current path. The stroked line follows each

straight or curved segment in the path, centered on the segment with sides paral-

lel to it. Each of the path’s subpaths is treated separately.

Path Construction and Painting4.4

153

The results of the S operator depend on the current settings of various

parameters in the graphics state. See Section 4.3, “Graphics State,” for further

information on these parameters.

• The width of the stroked line is determined by the line width parameter (“Line

Width” on page 139).

• The color or pattern of the line is determined by the color and color space

parameters for stroking.

• The line can be painted either solid or with a dash pattern, as speciﬁed by the

dash pattern parameter (“Line Dash Pattern” on page 141).

• If a subpath is open, the unconnected ends are treated according to the line cap

parameter, which may be butt, rounded, or square (“Line Cap Style” on

page 139).

• Wherever two consecutive segments are connected, the joint between them is

treated according to the line join parameter, which may be mitered, rounded, or

beveled (“Line Join Style” on page 140). Mitered joins are also subject to the

miter limit parameter (“Miter Limit” on page 140).

Note: Points at which unconnected segments happen to meet or intersect receive no

special treatment. In particular, “closing” a subpath with an explicit

l operator

rather than with

h may result in a messy corner, because line caps will be applied

instead of a line join.

• The stroke adjustment parameter (PDF 1.2) speciﬁes that coordinates and line

widths be adjusted automatically to produce strokes of uniform thickness

despite rasterization effects (Section 6.5.4, “Automatic Stroke Adjustment”).

If a subpath is degenerate (consists of a single-point closed path or of two or

more points at the same coordinates), the

S operator paints it only if round line

caps have been speciﬁed, producing a ﬁlled circle centered at the single point. If

butt or projecting square line caps have been speciﬁed,

S produces no output, be-

cause the orientation of the caps would be indeterminate. A single-point open

subpath (speciﬁed by a trailing

m operator) produces no output.

GraphicsCHAPTER 4

154

Filling

The f operator uses the current nonstroking color to paint the entire region en-

closed by the current path. If the path consists of several disconnected subpaths,

paints the insides of all subpaths, considered together. Any subpaths that are open

are implicitly closed before being ﬁlled.

If a subpath is degenerate (consists of a single-point closed path or of two or

more points at the same coordinates),

f paints the single device pixel lying under

that point; the result is device-dependent and not generally useful. A single-point

open subpath (speciﬁed by a trailing

m operator) produces no output.

For a simple path, it is intuitively clear what region lies inside. However, for a

more complex path—for example, a path that intersects itself or has one subpath

that encloses another—the interpretation of “inside” is not always obvious. The

path machinery uses one of two rules for determining which points lie inside a

path: the nonzero winding number rule and the even-odd rule, both discussed in

detail below.

The nonzero winding number rule is more versatile than the even-odd rule and is

the standard rule the

f operator uses. Similarly, the W operator uses this rule to

determine the inside of the current clipping path. The even-odd rule is occasion-

ally useful for special effects or for compatibility with other graphics systems; the

f* and W* operators invoke this rule.

Nonzero Winding Number Rule

The nonzero winding number rule determines whether a given point is inside a

path by conceptually drawing a ray from that point to inﬁnity in any direction

and then examining the places where a segment of the path crosses the ray. Start-

ing with a count of 0, the rule adds 1 each time a path segment crosses the ray

from left to right and subtracts 1 each time a segment crosses from right to left.

After counting all the crossings, if the result is 0 then the point is outside the path;

otherwise it is inside.

Note: The method just described does not specify what to do if a path segment coin-

cides with or is tangent to the chosen ray. Since the direction of the ray is arbitrary,

the rule simply chooses a ray that does not encounter such problem intersections.

Path Construction and Painting4.4

155

For simple convex paths, the nonzero winding number rule deﬁnes the inside and

outside as one would intuitively expect. The more interesting cases are those in-

volving complex or self-intersecting paths like the ones shown in Figure 4.10. For

a path consisting of a ﬁve-pointed star, drawn with ﬁve connected straight line

segments intersecting each other, the rule considers the inside to be the entire

area enclosed by the star, including the pentagon in the center. For a path com-

posed of two concentric circles, the areas enclosed by both circles are considered

to be inside, provided that both are drawn in the same direction. If the circles are

drawn in opposite directions, only the “doughnut” shape between them is inside,

according to the rule; the “doughnut hole” is outside.

FIGURE 4.10 Nonzero winding number rule

Even-Odd Rule

An alternative to the nonzero winding number rule is the even-odd rule. This rule

determines the “insideness” of a point by drawing a ray from that point in any

direction and simply counting the number of path segments that cross the ray,

regardless of direction. If this number is odd, the point is inside; if even, the point

is outside. This yields the same results as the nonzero winding number rule for

paths with simple shapes, but produces different results for more complex

shapes.

Figure 4.11 shows the effects of applying the even-odd rule to complex paths. For

the ﬁve-pointed star, the rule considers the triangular points to be inside the path,

but not the pentagon in the center. For the two concentric circles, only the

“doughnut” shape between the two circles is considered inside, regardless of the

directions in which the circles are drawn.

GraphicsCHAPTER 4

156

FIGURE 4.11 Even-odd rule

4.4.3 Clipping Path Operators

The graphics state contains a clipping path that limits the regions of the page

affected by painting operators. The closed subpaths of this path deﬁne the area

that can be painted. Marks falling inside this area will be applied to the page;

those falling outside it will not. (Precisely what is considered to be “inside” a path

is discussed under “Filling,” above.)

The initial clipping path includes the entire page. A clipping path operator (

W or

W*, shown in Table 4.11) may appear after the last path construction operator

and before the path-painting operator that terminates a path object. Although

the clipping path operator appears before the painting operator, it does not alter

the clipping path at the point where it appears. Rather, it modiﬁes the effect of the

succeeding painting operator. After the path has been painted, the clipping path

in the graphics state is set to the intersection of the current clipping path and the

newly constructed path.

TABLE 4.11 Clipping path operators

OPERANDS OPERATOR DESCRIPTION

— W Modify the current clipping path by intersecting it with the current path, using the

nonzero winding number rule to determine which regions lie inside the clipping

path.

— W* Modify the current clipping path by intersecting it with the current path, using the

even-odd rule to determine which regions lie inside the clipping path.

Note: In addition to path objects, text objects can also be used for clipping; see

Section 5.2.5, “Text Rendering Mode.”

Color Spaces4.5

157

The n operator (see Table 4.10 on page 152) is a “no-op” path-painting operator;

it causes no marks to be placed on the page, but it can be used with a clipping

path operator to establish a new clipping path. That is, after a path has been con-

structed, the sequence

W n will intersect that path with the current clipping path

to establish a new clipping path.

There is no way to enlarge the current clipping path or to set a new clipping path

without reference to the current one. However, since the clipping path is part of

the graphics state, its effect can be localized to speciﬁc graphics objects by en-

closing the modiﬁcation of the clipping path and the painting of those objects

between a pair of

q and Q operators (see Section 4.3.1, “Graphics State Stack”).

Execution of the

Q operator causes the clipping path to revert to the value that

was saved by the

q operator, before the clipping path was modiﬁed.

4.5 Color Spaces

PDF includes powerful facilities for specifying the colors of graphics objects to be

painted on the current page. The color facilities are divided into two parts:

• Color speciﬁcation. A PDF ﬁle can specify abstract colors in a device-

independent way. Colors can be described in any of a variety of color systems,

or color spaces. Some color spaces are related to device color representation

(grayscale, RGB, CMYK), others to human visual perception (CIE-based). Cer-

tain special features are also modeled as color spaces: patterns, color mapping,

separations, and high-ﬁdelity and multitone color.

• Color rendering. The viewer application reproduces colors on the raster output

device by a multiple-step process that includes some combination of color con-

version, gamma correction, halftoning, and scan conversion. Some aspects of

this process use information that is speciﬁed in PDF. However, unlike the facil-

ities for color speciﬁcation, the color rendering facilities are device-dependent

and ordinarily should not be included in a page description.

Figures 4.12 and 4.13 on pages 158 and 159 illustrate the division between PDF’s

(device-independent) color speciﬁcation and (device-dependent) color render-

ing facilities. This section describes the color speciﬁcation features, covering

everything that most PDF documents need in order to specify colors. The facili-

ties for controlling color rendering are described in Chapter 6; a PDF document

should use these facilities only to conﬁgure or calibrate an output device or to

achieve special device-dependent effects.

GraphicsCHAPTER 4

158

FIGURE 4.12 Color speciﬁcation

Color spaces Color values

Sources of

color values

CalRGB

Conversion

to internal

X, Y, Z

values

A, B, C

X, Y, Z

sc, SC, sh,

BI, Do (image XObject)

Alternative

color

transform

tint

Another

color space

scn, SCN, sh,

BI, Do (image XObject)

Indexed

Table

lookup

Another

color space

CIE-

based

color

spaces

Device

color

spaces

Special

color

spaces

Pattern

sc, SC, sh,

BI, Do (image XObject)

scn, SCN

Another

color space

Pattern

dictionary

CalGray

sc, SC, sh,

BI, Do (image XObject)

index

pattern

Separation

Lab

A, B, C

sc, SC, sh,

BI, Do (image XObject)

ICCBased

scn, SCN, sh,

BI, Do (image XObject)

DeviceCMYK

C, M, Y, K

k, K, sc, SC, sh,

BI, Do (image XObject)

Another

(4-component)

color space

DeviceGray

gray

g, G, sc, SC, sh,

BI, Do (image XObject)

Another

(1-component)

color space

Alternative

color

transform

components

Another

color space

scn, SCN, sh,

BI, Do (image XObject)

DeviceN

DeviceRGB

rg, RG, sc, SC, sh,

BI, Do (image XObject)

R, G, B

DefaultCMYK

DefaultGray

DefaultRGB

Another

(3-component)

color space

components

Color Spaces4.5

159

FIGURE 4.13 Color rendering

Conversion

from CIE-based

to device

color space

R, G, B

C, M, Y, K

gray

Device color values

(depending on

results of

conversion)

R, G, B

C, M, Y, K

Conversion

from input

device color

space to

device’s

process color

model

Transfer

functions

(per

component)

Halftones

(per

component)

Any single

device

colorant

Device’s

process

colorant(s)

UCR, BG

TR, HT HT

X, Y, Z

gray

tint

(not specified by PDF)

components

Any n device

colorants

Component(s)

of device’s

process

color model

GraphicsCHAPTER 4

160

4.5.1 Color Values

As described in Section 4.4.2, “Path-Painting Operators,” marks placed on the

page by operators such as

f and S have a color that is determined by the current

color parameter of the graphics state. A color value consists of one or more color

components, which are usually numbers. For example, a gray level can be speciﬁed

by a single number ranging from 0.0 (black) to 1.0 (white). Full color values can

be speciﬁed in any of several ways; a common method uses three numeric values

to specify red, green, and blue components.

Color values are interpreted according to the current color space, another parame-

ter of the graphics state. A PDF content stream ﬁrst selects a color space by invok-

ing the

cs operator (for the nonstroking color) or the CS operator (for the

stroking color). It then selects color values within that color space with the

sc op-

erator (nonstroking) or the

SC operator (stroking). There are also convenience

operators—

g, G, rg, RG, k, and K—that select both a color space and a color value

within it in a single step. Table 4.21 on page 198 lists all the color-setting opera-

tors.

Sampled images (see Section 4.8, “Images”) specify the color values of individual

samples with respect to a color space designated by the image object itself. While

these values are independent of the current color space and color parameters in

the graphics state, all later stages of color processing treat them in exactly the

same way as color values speciﬁed with the

sc or SC operator.

4.5.2 Types of Color Space

Color spaces can be classiﬁed into color space families. Spaces within a family

share the same general characteristics; they are distinguished by parameter values

supplied at the time the space is speciﬁed. The families, in turn, fall into three

broad categories:

• Device color spaces directly specify colors or shades of gray that the output

device is to produce. They provide a variety of color speciﬁcation methods,

including gray level, RGB (red-green-blue), and CMYK (cyan-magenta-yellow-

black), corresponding to the color space families

DeviceGray, DeviceRGB, and

DeviceCMYK. Since each of these families consists of just a single color space

with no parameters, they are sometimes loosely referred to as the

DeviceGray,

DeviceRGB, and DeviceCMYK color spaces.

Color Spaces4.5

161

• CIE-based color spaces are based on an international standard for color speciﬁ-

cation created by the Commission Internationale de l’Éclairage (International

Commission on Illumination). These spaces allow colors to be speciﬁed in a

way that is independent of the characteristics of any particular output device.

Color space families in this category include

CalGray, CalRGB, Lab, and ICC-

Based

. Individual color spaces within these families are speciﬁed by means of

dictionaries containing the parameter values needed to deﬁne the space.

• Special color spaces add features or properties to an underlying color space.

They include facilities for patterns, color mapping, separations, and high-

ﬁdelity and multitone color. The corresponding color space families are

Pattern, Indexed, Separation, and DeviceN. Individual color spaces within

these families are speciﬁed by means of additional parameters.

Table 4.12 summarizes the color space families supported by PDF. (See imple-

mentation note 28 in Appendix H.)

TABLE 4.12 Color space families

DEVICE CIE-BASED SPECIAL

DeviceGray (PDF 1.1) CalGray (PDF 1.1) Indexed (PDF 1.1)

DeviceRGB (PDF 1.1) CalRGB (PDF 1.1) Pattern (PDF 1.2)

DeviceCMYK (PDF 1.1) Lab (PDF 1.1) Separation (PDF 1.2)

ICCBased (PDF 1.3) DeviceN (PDF 1.3)

A color space is deﬁned by an array object whose ﬁrst element is a name object

identifying the color space family. The remaining array elements, if any, are

parameters that further characterize the color space; their number and types vary

according to the particular family. For families that do not require parameters,

the color space can be speciﬁed simply by the family name itself instead of an

array.

There are two principal ways in which a color space can be speciﬁed:

• Within a content stream, the cs or CS operator establishes the color space

parameter in the graphics state. The operand is always a name object, which

either identiﬁes one of the color spaces that need no additional parameters

(

DeviceGray, DeviceRGB, DeviceCMYK, or some cases of Pattern) or is used as a

GraphicsCHAPTER 4

162

key in the ColorSpace subdictionary of the current resource dictionary (see

Section 3.7.2, “Resource Dictionaries”). In the latter case, the value of the dic-

tionary entry is in turn a color space array or name. A color space array is never

permitted in-line within a content stream.

• Outside a content stream, certain objects, such as image XObjects, specify a

color space as an explicit parameter, often associated with the key

ColorSpace.

In this case, the color space array or name is always deﬁned directly as a PDF

object, not by an entry in the

ColorSpace resource subdictionary. This conven-

tion also applies when color spaces are deﬁned in terms of other color spaces.

The following operators set the color space and color value parameters in the

graphics state:

• cs sets the nonstroking color space; CS sets the stroking color space.

• sc and scn set the nonstroking color; SC and SCN set the stroking color.

Depending on the color space, these operators require one or more operands,

each specifying one component of the color value.

• g, rg, and k set the nonstroking color space implicitly and the nonstroking

color as speciﬁed by the operands;

G, RG, and K do the same for the stroking

color space and color.

4.5.3 Device Color Spaces

The device color spaces enable a page description to specify color values that are

directly related to their representation on an output device. Color values in these

spaces map directly (or via simple conversions) to the application of device color-

ants, such as quantities of ink or intensities of display phosphors. This enables a

PDF document to control colors precisely for a particular device, but the results

may not be consistent between different devices.

Output devices form colors either by adding light sources together or by sub-

tracting light from an illuminating source. Computer displays and ﬁlm recorders

typically add colors, while printing inks typically subtract them. These two ways

of forming colors give rise to two complementary forms of color speciﬁcation,

Color Spaces4.5

163

the additive RGB speciﬁcation and the subtractive CMYK speciﬁcation. The cor-

responding device color spaces are as follows:

• DeviceGray controls the intensity of achromatic light, on a scale from black to

white.

• DeviceRGB controls the intensities of red, green, and blue light, the three addi-

tive primary colors used in displays.

• DeviceCMYK controls the concentrations of cyan, magenta, yellow, and black

inks, the four subtractive process colors used in printing.

Although the notion of explicit color spaces is a PDF 1.1 feature, the operators for

specifying colors in the device color spaces—

g, G, rg, RG, k, and K—are available

in all versions of PDF. In PDF 1.2, colors speciﬁed in device color spaces can op-

tionally be remapped systematically into other color spaces; see “Default Color

Spaces” on page 177.

DeviceGray Color Space

Black, white, and intermediate shades of gray are special cases of full color. A

grayscale value is represented by a single number in the range 0.0 to 1.0, where

0.0 corresponds to black, 1.0 to white, and intermediate values to different gray

levels. Example 4.2 shows alternative ways to select the

DeviceGray color space

and a speciﬁc gray level within that space for nonstroking operations.

Example 4.2

/DeviceGray cs % Set DeviceGray color space

gray sc % Set gray level

gray g % Set both in one operation

The cs and sc operators select the color space and color value separately; g sets

them in combination. (The

CS, SC, and G operators perform the same functions

for stroking operations.) When the speciﬁed color space is

DeviceGray, the cs or

CS operator sets the corresponding color value to 0.0.

GraphicsCHAPTER 4

164

DeviceRGB Color Space

Colors in the DeviceRGB color space are speciﬁed according to the additive RGB

(red-green-blue) color model, in which color values are deﬁned by three compo-

nents representing the intensities of the additive primary colors red, green, and

blue. Each component is speciﬁed by a number in the range 0.0 to 1.0, where 0.0

denotes the complete absence of a primary component and 1.0 denotes maxi-

mum intensity. If all three components have equal intensity, the perceived result

theoretically is a pure gray on the scale from black to white. If the intensities are

not all equal, the result is some color other than a pure gray.

Example 4.3 shows alternative ways to select the

DeviceRGB color space and a

speciﬁc color within that space for nonstroking operations.

Example 4.3

/DeviceRGB cs % Set DeviceRGB color space

red green blue sc % Set color

red green blue rg % Set both in one operation

The cs and sc operators select the color space and color value separately; rg sets

them in combination. (The

CS, SC, and RG operators perform the same functions

for stroking operations.) When the speciﬁed color space is

DeviceRGB, the cs or

CS operator sets the red, green, and blue components of the corresponding color

value to 0.0.

DeviceCMYK Color Space

The DeviceCMYK color space allows colors to be speciﬁed according to the sub-

tractive CMYK (cyan-magenta-yellow-black) model typical of printers and other

paper-based output devices. In theory, each of the three standard process colorants

used in printing (cyan, magenta, and yellow) absorbs one of the additive primary

colors (red, green, and blue, respectively). Black, a fourth standard process color-

ant, absorbs all of the additive primaries in equal amounts. The four components

in a

DeviceCMYK color value represent the concentrations of these process color-

ants. Each component is speciﬁed by a number in the range 0.0 to 1.0, where 0.0

denotes the complete absence of a process colorant (that is, absorbs none of the

corresponding additive primary) and 1.0 denotes maximum concentration (ab-

sorbs as much as possible of the additive primary). Note that the sense of these

numbers is opposite to that of RGB color components.

Color Spaces4.5

165

Example 4.4 shows alternative ways to select the DeviceCMYK color space and a

speciﬁc color within that space for nonstroking operations.

Example 4.4

/DeviceCMYK cs % Set DeviceCMYK color space

cyan magenta yellow black sc % Set color

cyan magenta yellow black k % Set both in one operation

The cs and sc operators select the color space and color value separately; k sets

them in combination. (The

CS, SC, and K operators perform the same functions

for stroking operations.) When the speciﬁed color space is

DeviceCMYK, the cs or

CS operator sets the cyan, magenta, and yellow components of the corresponding

color value to 0.0 and the black component to 1.0.

4.5.4 CIE-Based Color Spaces

Calibrated color in PDF is deﬁned in terms of an international standard used in

the graphic arts, television, and printing industries. CIE-based color spaces en-

able a page description to specify color values in a way that is related to human

visual perception. The goal is for the same color speciﬁcation to produce consis-

tent results on different output devices, within the limitations of each device.

PDF 1.1 supports three CIE-based color space families, named

CalGray, CalRGB,

and

Lab; PDF 1.3 adds a fourth, named ICCBased.

Note: In PDF 1.1, a color space family named

CalCMYK was partially deﬁned, with

the expectation that its deﬁnition would be completed in a future version. However,

this is no longer being considered. PDF 1.3 supports calibrated four-component color

spaces by means of ICC proﬁles (see “ICCBased Color Spaces” on page 173). PDF

consumer applications should ignore

CalCMYK color space attributes and render

colors speciﬁed in this family as if they had been speciﬁed using

DeviceCMYK.

The details of the CIE colorimetric system and the theory on which it is based are

beyond the scope of this book; see the Bibliography for sources of further in-

formation. The semantics of CIE-based color spaces are deﬁned in terms of the

relationship between the space’s components and the tristimulus values X, Y, and

Z of the CIE 1931 XYZ space. The

CalRGB and Lab color spaces (PDF 1.1) are

special cases of three-component CIE-based color spaces, known as CIE-based

ABC color spaces. These spaces are deﬁned in terms of a two-stage, nonlinear

transformation of the CIE 1931 XYZ space. The formulation of such color spaces

GraphicsCHAPTER 4

166

models a simple zone theory of color vision, consisting of a nonlinear trichro-

matic ﬁrst stage combined with a nonlinear opponent-color second stage. This

formulation allows colors to be digitized with minimum loss of ﬁdelity, an im-

portant consideration in sampled images.

Color values in a CIE-based ABC color space have three components, arbitrarily

named A, B, and C. The ﬁrst stage transforms these components by ﬁrst forcing

their values to a speciﬁed range, then applying decoding functions, and ﬁnally

multiplying the results by a 3-by-3 matrix, producing three intermediate com-

ponents arbitrarily named L, M, and N. The second stage transforms these inter-

mediate components in a similar fashion, producing the ﬁnal X, Y, and Z

components of the CIE 1931 XYZ space (see Figure 4.14).

FIGURE 4.14 Component transformations in a CIE-based ABC color space

Color spaces in the CIE-based families are deﬁned by an array

[name dictionary]

where name is the name of the family and dictionary is a dictionary containing

parameters that further characterize the space. The entries in this dictionary have

speciﬁc interpretations that vary depending on the color space; some entries are

required and some are optional.

When any CIE-based color space is established, its initial color value has all com-

ponents set to 0.0 (unless the range of valid values for a given component does

not include 0.0, in which case the nearest valid value is substituted.)

Decode ABC

Decode LMN

Matrix ABC

Matrix LMN

Color Spaces4.5

167

Note: The model and terminology used here—CIE-based ABC (above) and CIE-

based A (below)—are derived from the PostScript language, which supports these

classes of spaces in their full generality. PDF supports speciﬁc useful cases of CIE-

based ABC and CIE-based A spaces; most others can be represented as

ICCBased

spaces.

CalGray Color Spaces

A CalGray color space (PDF 1.1) is a special case of a single-component CIE-

based color space, known as a CIE-based A color space. This type of space is the

one-dimensional (and usually achromatic) analog of CIE-based ABC spaces.

Color values in a CIE-based A space have a single component, arbitrarily named

A. Figure 4.15 illustrates the transformations of the A component to X, Y, and Z

components of the CIE 1931 XYZ space.

FIGURE 4.15 Component transformations in a CIE-based A color space

A CalGray color space is a CIE-based A color space with only one transformation

stage instead of two. In this type of space, A represents the gray component of a

calibrated gray space. This component must be in the range 0.0 to 1.0. The de-

coding function (denoted by “Decode A” in Figure 4.15) is a gamma function

whose coefﬁcient is speciﬁed by the

Gamma entry in the color space dictionary

(see Table 4.13). The transformation matrix denoted by “Matrix A” in the ﬁgure

is derived from the dictionary’s

WhitePoint entry, as described below. Since there

is no second transformation stage, “Decode LMN” and “Matrix LMN” are im-

plicitly taken to be identity transformations.

Decode A

Decode LMN

Matrix A Matrix LMN

GraphicsCHAPTER 4

168

TABLE 4.13 Entries in a CalGray color space dictionary

KEY TYPE VALUE

WhitePoint array (Required) An array of three numbers [X

] specifying the tristimulus

value, in the CIE 1931 XYZ space, of the diffuse white point; see “CalRGB

Color Spaces,” below, for further discussion. The numbers X

and Z

must

be positive, and Y

must be equal to 1.0.

BlackPoint array (Optional) An array of three numbers [X

] specifying the tristimulus

value, in the CIE 1931 XYZ space, of the diffuse black point; see “CalRGB

Color Spaces,” below, for further discussion. All three of these numbers must

be nonnegative. Default value:

[0.0 0.0 0.0].

Gamma array (Optional) A number G deﬁning the gamma for the gray (A) component. G

must be positive and will generally be greater than or equal to 1. Default

value: 1.

The transformation deﬁned by the Gamma and WhitePoint entries is

In other words, the A component is ﬁrst decoded by the gamma function, and the

result is multiplied by the components of the white point to obtain the L, M, and

N components of the intermediate representation. Since there is no second stage,

these are also the X, Y, and Z components of the ﬁnal representation.

The following examples illustrate various interesting and useful special cases of

CalGray spaces. Example 4.5 establishes a space consisting of the Y dimension of

the CIE 1931 XYZ space with the CCIR XA/11–recommended D65 white point.

Example 4.5

[ /CalGray

<< /WhitePoint [0.9505 1.0000 1.0890] >>

]

Example 4.6 establishes a calibrated gray space with the CCIR XA/11–

recommended D65 white point and opto-electronic transfer function.

XLX

×==

YMY

×==

ZNZ

×==

Color Spaces4.5

169

Example 4.6

[ /CalGray

<< /WhitePoint [0.9505 1.0000 1.0890]

/Gamma 2.222

]

CalRGB Color Spaces

A CalRGB color space is a CIE-based ABC color space with only one transforma-

tion stage instead of two. In this type of space, A, B, and C represent calibrated

red, green, and blue color values. These three color components must be in the

range 0.0 to 1.0; component values falling outside that range will be adjusted to

the nearest valid value without error indication. The decoding functions

(denoted by “Decode ABC” in Figure 4.14 on page 166) are gamma functions

whose coefﬁcients are speciﬁed by the

Gamma entry in the color space dictionary

(see Table 4.14). The transformation matrix denoted by “Matrix ABC” in

Figure 4.14 is deﬁned by the dictionary’s

Matrix entry. Since there is no second

transformation stage, “Decode LMN” and “Matrix LMN” are implicitly taken to

be identity transformations.

TABLE 4.14 Entries in a CalRGB color space dictionary

KEY TYPE VALUE

WhitePoint array (Required) An array of three numbers [X

] specifying the tristimulus value,

in the CIE 1931 XYZ space, of the diffuse white point; see below for further discus-

sion. The numbers X

and Z

must be positive, and Y

must be equal to 1.0.

BlackPoint array (Optional) An array of three numbers [X

] specifying the tristimulus value, in

the CIE 1931 XYZ space, of the diffuse black point; see below for further discussion.

All three of these numbers must be nonnegative. Default value:

[0.0 0.0 0.0].

Gamma array (Optional) An array of three numbers [G

] specifying the gamma for the red,

green, and blue (A, B, and C) components of the color space. Default value:

[1.0 1.0 1.0].

Matrix array (Optional) An array of nine numbers [X

] specifying

the linear interpretation of the decoded A, B, and C components of the color space

with respect to the ﬁnal XYZ representation. Default value: the identity matrix

[100010001].

GraphicsCHAPTER 4

170

The WhitePoint and BlackPoint entries in the color space dictionary control the

overall effect of the CIE-based gamut mapping function described in Section 6.1,

“CIE-Based Color to Device Color.” Typically, the colors speciﬁed by

WhitePoint

and BlackPoint are mapped to the nearly lightest and nearly darkest achromatic

colors that the output device is capable of rendering in a way that preserves color

appearance and visual contrast.

WhitePoint is assumed to represent the diffuse achromatic highlight, not a specu-

lar highlight. Specular highlights, achromatic or otherwise, are often reproduced

lighter than the diffuse highlight.

BlackPoint is assumed to represent the diffuse

achromatic shadow; its value is typically limited by the dynamic range of the

input device. In images produced by a photographic system, the values of

White-

Point

and BlackPoint vary with exposure, system response, and artistic intent;

hence, their values are image-dependent.

The transformation deﬁned by the

Gamma and Matrix entries in the CalRGB

color space dictionary is

In other words, the A, B, and C components are ﬁrst decoded individually by the

gamma functions. The results are treated as a three-element vector and multi-

plied by

Matrix (a 3-by-3 matrix) to obtain the L, M, and N components of the

intermediate representation. Since there is no second stage, these are also the

, Y,

and Z components of the ﬁnal representation.

Example 4.7 shows an example of a

CalRGB color space for the CCIR XA/11–

recommended D65 white point with 1.8 gammas and Sony Trinitron

phosphor

chromaticities.

Example 4.7

[ /CalRGB

<< /WhitePoint [0.9505 1.0000 1.0890]

/Gamma [1.8000 1.8000 1.8000]

/Matrix [ 0.4497 0.2446 0.0252

0.3163 0.6720 0.1412

0.1845 0.0833 0.9227

]

XLX

× X

×++==

YMY

× Y

×++==

ZNZ

× Z

×++==

Color Spaces4.5

171

In some cases, the parameters of a CalRGB color space may be speciﬁed in terms

of the CIE 1931 chromaticity coordinates (x

, y

), (x

, y

), (x

, y

) of the red,

green, and blue phosphors, respectively, and the chromaticity (x

, y

) of the

diffuse white point corresponding to some linear RGB value (R, G, B), where

usually R = G = B = 1.0. Note that standard CIE notation uses lowercase letters to

specify chromaticity coordinates and uppercase letters to specify tristimulus

values. Given this information,

Matrix and WhitePoint can be found as follows:

Lab Color Spaces

A Lab color space is a CIE-based ABC color space with two transformation stages

(see Figure 4.14 on page 166). In a this type of space, A, B, and C represent the L*,

a*, and b* components of a CIE 1976 L*a*b* space. The range of the ﬁrst (L*)

component is always 0 to 100. The ranges of the second and third (a* and b*)

–()y

× x

–()y

×– x

–()y

×)+(×=

-----

–()y

× x

–()y

× x

–()y

×+–

------------------------------------------------------------------------------------------------------------------------------

×=

------

×= Z

1 x

–

--------------- 1–







×=

------–

–()y

× x

–()y

× x

–()y

×+–

----------------------------------------------------------------------------------------------------------------------------

×=

------

×= Z

1 x

–

--------------- 1–







×=

-----

–()y

× x

–()y

× x

–()y

×+–

--------------------------------------------------------------------------------------------------------------------------------

×=

-----

×= Z

1 x

–

-------------- 1–







×=

R× X

G× X

B×++=

R× Y

G× Y

B×++=

R× Z

G× Z

B×++=

GraphicsCHAPTER 4

172

components are deﬁned by the Range entry in the color space dictionary (see

Table 4.15).

TABLE 4.15 Entries in a Lab color space dictionary

KEY TYPE VALUE

WhitePoint array (Required) An array of three numbers [X

] specifying the tristimulus value,

in the CIE 1931 XYZ space, of the diffuse white point; see “CalRGB Color Spaces” on

page 169 for further discussion. The numbers X

and Z

must be positive, and Y

must be equal to 1.0.

BlackPoint array (Optional) An array of three numbers [X

] specifying the tristimulus value, in

the CIE 1931 XYZ space, of the diffuse black point; see “CalRGB Color Spaces” on

page 169 for further discussion. All three of these numbers must be nonnegative.

Default value:

[0.0 0.0 0.0].

Range array (Optional) An array of four numbers [a

min

max

min

max

] specifying the range of

valid values for the a* and b* (B and C) components of the color space—that is,

and

Component values falling outside the speciﬁed range will be adjusted to the nearest

valid value without error indication. Default value: [−100 100 −100 100].

A Lab color space does not specify explicit decoding functions or matrix coef-

ﬁcients for either stage of the transformation from L*a*b* space to XYZ space

(denoted by “Decode ABC,” “Matrix ABC,” “Decode LMN,” and “Matrix LMN” in

Figure 4.14 on page 166). Instead, these parameters have constant implicit values.

The ﬁrst transformation stage is deﬁned by the equations

min

a* a

max

≤≤

min

b* b

max

≤≤

L* 16+

116

------------------

500

--------+=

L* 16+

116

------------------=

L* 16+

116

------------------

200

--------–=

Color Spaces4.5

173

The second transformation stage is given by

where the function g(x) is deﬁned as

Example 4.8 deﬁnes the CIE 1976 L*a*b* space with the CCIR XA/11–

recommended D65 white point. The a* and b* components, although theoreti-

cally unbounded, are deﬁned to lie in the useful range −128 to +127.

Example 4.8

[/Lab

<< /WhitePoint [0.9505 1.0000 1.0890]

/Range [−128 127 −128 127]

]

ICCBased Color Spaces

ICCBased color spaces (PDF 1.3) are based on a cross-platform color proﬁle as

deﬁned by the International Color Consortium (ICC). Unlike the

CalGray, Cal-

RGB

, and Lab color spaces, which are characterized by entries in the color space

dictionary, an

ICCBased color space is characterized by a sequence of bytes in a

standard format. Details of the proﬁle format can be found in the ICC speciﬁca-

tion (see the Bibliography).

ICCBased color space is speciﬁed as an array:

[/ICCBased stream]

gL()×=

gM()×=

gN()×=

gx() x

= if x

-----

≥

gx()

108

841

--------

-----–





×=

otherwise

GraphicsCHAPTER 4

174

The stream contains the ICC proﬁle. Besides the usual entries common to all

streams (see Table 3.4 on page 35), the proﬁle stream has the additional entries

listed in Table 4.16.

TABLE 4.16 Entries in an ICC proﬁle stream dictionary

KEY TYPE VALUE

N integer (Required) The number of color components in the color space described by the ICC

proﬁle data. This number must match the number of components actually in the ICC

proﬁle. In PDF 1.3,

N must be 1, 3, or 4.

Alternate array or (Optional) An alternate color space to be used in case the one specified in the stream

name data is not supported (for example, by viewer applications designed for earlier

versions of PDF). The alternate space may be any valid color space, except a

Pattern

color space, that has the number of components specified by N. If this entry is omit-

ted and the viewer application does not understand the ICC profile data, the color

space used will be

DeviceGray, DeviceRGB, or DeviceCMYK, depending on whether

the value of

N is 1, 3, or 4, respectively.

Note that there is no conversion of color values, such as a tint transformation, when

using the alternate color space. Color values that are within the range of the

ICC-

Based

color space might not be within the range of the alternate color space. In this

case, the nearest values within the range of the alternate space will be substituted.

Range array (Optional) An array of 2 × N numbers [min

max

min

max

…] specifying

the minimum and maximum valid values of the corresponding color components.

These values must match the information in the ICC proﬁle. Default value:

[0.0 1.0 0.0 1.0 …].

The ICC speciﬁcation is an evolving standard. The ICCBased color spaces sup-

ported in PDF 1.3 are based on version 3.3 of the ICC speciﬁcation. Early ver-

sions of the ICC speciﬁcation are also supported. (The version number is

available in the ICC proﬁle’s header.)

PDF 1.3 supports only the proﬁle types shown in Table 4.17; other types may be

supported in the future. (In particular, note that XYZ and 16-bit L*a*b* proﬁles

are not supported.) Each of the indicated ﬁelds must have one of the values listed

for that ﬁeld in the second column of the table. (Proﬁles must satisfy both the cri-

teria shown in the table.) The terminology is taken from the ICC speciﬁcations.

Color Spaces4.5

175

TABLE 4.17 ICC proﬁle types

HEADER FIELD REQUIRED VALUE

deviceClass icSigInputClass ('scnr')

icSigDisplayClass ('mntr')

icSigOutputClass ('prtr')

icSigColorSpaceClass ('spac')

colorSpace icSigGrayData ('GRAY')

icSigRgbData ('RGB ')

icSigCmykData ('CMYK')

icSigLabData ('Lab ')

The terminology used in PDF color spaces and ICC color proﬁles is similar, but

sometimes the same terms are used with different meanings. For example, the

default value for each component in an

ICCBased color space is 0. The range of

each color component is a function of the color space speciﬁed by the proﬁle and

is indicated in the ICC speciﬁcation. The ranges for several ICC color spaces are

shown in Table 4.18.

TABLE 4.18 Ranges for typical ICC color spaces

ICC COLOR SPACE COMPONENT RANGES

Gray [0.0 1.0]

RGB [0.0 1.0]

CMYK [0.0 1.0]

L*a*b* L*: [0 100]; a* and b*: [−128 127]

Since the ICCBased color space is being used as a source color space, only the “to

CIE” proﬁle information (ATo B in ICC terminology) is used; the “from CIE”

(BToA) information is ignored when present. Additionally, an ICC proﬁle may

specify a rendering intent; however, a PDF viewer application ignores this infor-

mation. The rendering intent is speciﬁed in PDF by a separate parameter; see

“Rendering Intents” on page 179.

GraphicsCHAPTER 4

176

The representations of ICCBased color spaces are less compact than CalGray, Cal-

RGB

, and Lab, but can represent a wider range of color spaces. In those cases

where a given color space can be expressed by more than one of the CIE-based

color space families, the resulting colors are expected to be rendered similarly,

regardless of the method selected for representation.

One particular color space is the so-called “standard RGB” or sRGB, deﬁned in

the International Electrotechnical Commission (IEC) document Colour Measure-

ment and Management in Multimedia Systems and Equipment (see the Bibliogra-

phy). In PDF, the sRGB color space can be expressed precisely only as an

ICCBased color space, although it can be approximated by a CalRGB color space.

Example 4.9 shows an

ICCBased color space for a typical three-component RGB

space. The proﬁle’s data has been encoded in hexadecimal representation for

readability; in actual practice, a lossless decompression ﬁlter such as

FlateDecode

should be used.

Example 4.9

10 0 obj % Color space

[/ICCBased 15 0 R]

endobj

15 0 obj % ICC proﬁle stream

<< /N 3

/Alternate /DeviceRGB

/Length 1605

/Filter /ASCIIHexDecode

stream

00 00 02 0C 61 70 70 6C 02 00 00 00 6D 6E 74 72

52 47 42 20 58 59 5A 20 07 CB 00 02 00 16 00 0E

00 22 00 2C 61 63 73 70 41 50 50 4C 00 00 00 00

61 70 70 6C 00 00 04 01 00 00 00 00 00 00 00 02

00 00 00 00 00 00 F6 D4 00 01 00 00 00 00 D3 2B

00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

00 00 00 09 64 65 73 63 00 00 00 F0 00 00 00 71

72 58 59 5A 00 00 01 64 00 00 00 14 67 58 59 5A

00 00 01 78 00 00 00 14 62 58 59 5A 00 00 01 8C

00 00 00 14 72 54 52 43 00 00 01 A0 00 00 00 0E

67 54 52 43 00 00 01 B0 00 00 00 0E 62 54 52 43

Color Spaces4.5

177

00 00 01 C0 00 00 00 0E 77 74 70 74 00 00 01 D0

00 00 00 14 63 70 72 74 00 00 01 E4 00 00 00 27

64 65 73 63 00 00 00 00 00 00 00 17 41 70 70 6C

65 20 31 33 22 20 52 47 42 20 53 74 61 6E 64 61

72 64 00 00 00 00 00 00 00 00 00 00 00 17 41 70

70 6C 65 20 31 33 22 20 52 47 42 20 53 74 61 6E

64 61 72 64 00 00 00 00 00 00 00 00 00 00 00 00

00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

00 58 59 5A 58 59 5A 20 00 00 00 00 00 00 63 0A

00 00 35 0F 00 00 03 30 58 59 5A 20 00 00 00 00

00 00 53 3D 00 00 AE 37 00 00 15 76 58 59 5A 20

00 00 00 00 00 00 40 89 00 00 1C AF 00 00 BA 82

63 75 72 76 00 00 00 00 00 00 00 01 01 CC 63 75

63 75 72 76 00 00 00 00 00 00 00 01 01 CC 58 59

58 59 5A 20 00 00 00 00 00 00 F3 1B 00 01 00 00

00 01 67 E7 74 65 78 74 00 00 00 00 20 43 6F 70

79 72 69 67 68 74 20 41 70 70 6C 65 20 43 6F 6D

70 75 74 65 72 73 20 31 39 39 34 00

endstream

endobj

Default Color Spaces

Specifying colors in a device color space (DeviceGray, DeviceRGB, or Device-

CMYK

) makes them device-dependent. By setting default color spaces (PDF 1.1), a

PDF document can request that such colors be systematically transformed into

device-independent CIE-based color spaces. This capability can be useful in a va-

riety of circumstances, such as the following:

• A document originally intended for one output device is redirected to a differ-

ent device.

• A document is intended to be compatible with viewer applications designed for

earlier versions of PDF, and thus cannot specify CIE-based colors directly.

• Color corrections or rendering intents need to be applied to device colors (see

“Rendering Intents” on page 179).

GraphicsCHAPTER 4

178

A color space is selected for painting each graphics object. This is either the cur-

rent color space parameter in the graphics state or a color space given as an entry

in an image XObject, in-line image, or shading dictionary. Regardless of how the

color space is speciﬁed, it may be subject to remapping as described below.

When a device color space is selected, the

ColorSpace subdictionary of the cur-

rent resource dictionary (see Section 3.7.2, “Resource Dictionaries”) is checked

for the presence of an entry designating a corresponding default color space

(

DefaultGray, DefaultRGB, or DefaultCMYK, corresponding to DeviceGray,

DeviceRGB, or DeviceCMYK, respectively). If such an entry is present, its value is

used as the color space for the operation currently being performed. (If the view-

er application does not recognize this color space, no remapping will occur; the

original device color space will be used.)

Color values in the original device color space are passed unchanged to the

default color space, which must have the same number of components as the

original space. The default color space should be chosen to be compatible with

the original, taking into account the components’ ranges and whether the com-

ponents are additive or subtractive. If a color value lies outside the range of the

default color space, it will be adjusted to the nearest valid value.

Note: Any color space other than a

Lab, Indexed, or Pattern color space may be used

as a default color space, provided that it is compatible with the original device color

space as described above.

If the selected space is a special color space based on an underlying device color

space, the default color space will be used in place of the underlying space. This

applies to the following:

• The base color space of an Indexed color space

• The underlying color space of a Pattern color space

• The alternate color space of a Separation or DeviceN color space (but only if

the alternate color space is actually selected)

See Section 4.5.5, “Special Color Spaces,” for details on these color spaces.

Color Spaces4.5

179

Note: Note that there is no conversion of color values, such as a tint transformation,

when using the default color space. Color values that are within the range of the

device color space might not be within the range of the default color space (particu-

larly if the default is an

ICCBased color space). In this case, the nearest values within

the range of the default space will be used. For this reason, a

Lab color space is not

permitted as the

DefaultRGB color space.

Rendering Intents

Although CIE-based color speciﬁcations are theoretically device-independent,

they are subject to practical limitations in the color reproduction capabilities of

the output device. Such limitations may sometimes require compromises to be

made among various properties of a color speciﬁcation when rendering colors for

a given device. Specifying a rendering intent (PDF 1.1) allows a PDF ﬁle to set

priorities regarding which of these properties to preserve and which to sacriﬁce.

For example, the PDF ﬁle might request that colors falling within the output

device’s gamut (the range of colors it can reproduce) be rendered exactly while

sacriﬁcing the accuracy of out-of-gamut colors, or that a scanned image such as a

photograph be rendered in a perceptually “pleasing” manner at the cost of strict

colorimetric accuracy.

Rendering intents are speciﬁed with the

ri operator and with the Intent entry in

image dictionaries. The value is a name identifying the desired rendering intent.

Table 4.19 lists the standard rendering intents recognized in the initial release of

PDF viewer applications from Adobe Systems. These have been deliberately

chosen to correspond closely to the rendering intents deﬁned by the International

Color Consortium (ICC), an industry organization that has developed standards

for device-independent color. Note, however, that the exact set of rendering in-

tents supported may vary from one output device to another; a particular device

may not support all possible intents, or may support additional ones beyond

those listed in the table. If the viewer application does not recognize the speciﬁed

name, it uses the

RelativeColorimetric intent by default. (See implementation

note 29 in Appendix H.)

GraphicsCHAPTER 4

180

TABLE 4.19 Rendering intents

NAME DESCRIPTION

AbsoluteColorimetric Colors are represented solely with respect to the light source; no

correction is made for the output medium’s white point (such as

the color of unprinted paper). Thus, for example, a monitor’s

white point, which is bluish compared to that of a printer’s

paper, would be reproduced with a blue cast. In-gamut colors

are reproduced exactly; out-of-gamut colors are mapped to the

nearest value within the reproducible gamut. This style of

reproduction has the advantage of providing exact color

matches from one output medium to another. It has the

disadvantage of causing colors with Y values between the

medium’s white point and 1.0 to be out of gamut. A typical use

might be for logos and solid colors that require exact

reproduction across different media.

RelativeColorimetric Colors are represented with respect to the combination of the

light source and the output medium’s white point (such as the

color of unprinted paper). Thus, for example, a monitor’s white

point would be reproduced on a printer by simply leaving the

paper unmarked, ignoring color differences between the two

media. In-gamut colors are reproduced exactly; out-of-gamut

colors are mapped to the nearest value within the reproducible

gamut. This style of reproduction has the advantage of adapting

for the varying white points of different output media. It has the

disadvantage of not providing exact color matches from one

medium to another. A typical use might be for vector graphics.

Saturation Colors are represented in a manner that preserves or emphasizes

saturation. Reproduction of in-gamut colors may or may not be

colorimetrically accurate. A typical use might be for business

graphics, where saturation is the most important attribute of the

color.

Perceptual Colors are represented in a manner that provides a pleasing

perceptual appearance. This generally means that both in-gamut

and out-of-gamut colors are modiﬁed from their precise

colorimetric values in order to preserve color relationships. A

typical use might be for scanned images.

Color Spaces4.5

181

4.5.5 Special Color Spaces

Special color spaces add features or properties to an underlying color space.

There are four special color space families:

Pattern, Indexed, Separation, and

DeviceN.

Pattern Color Spaces

A Pattern color space (PDF 1.2) enables a PDF content stream to paint an area

with a “color” deﬁned as a pattern, which may be either a tiling pattern

(

PatternType 1) or a shading pattern (PatternType 2). Section 4.6, “Patterns,” dis-

cusses patterns in detail.

Indexed Color Spaces

An Indexed color space allows a PDF content stream to select from a color map or

color table of arbitrary colors in some other space, using small integers as indices.

A PDF viewer application treats each sample value as an index into the color table

and uses the color value it ﬁnds there. This technique can considerably reduce the

amount of data required to represent a sampled image—for example, by using

8-bit index values as samples instead of 24-bit RGB color values.

Indexed color space is deﬁned by a four-element array, as follows:

[/Indexed base hival lookup]

The ﬁrst element is the color space family name Indexed. The remaining ele-

ments are parameters that an

Indexed color space requires; their meanings are

discussed below. When the color space is set to an

Indexed color space, the cur-

rent color is set to 0.

The

base parameter is an array or name that identiﬁes the base color space in

which the values in the color table are to be interpreted. It can be any device or

CIE-based color space or (in PDF 1.3) a

Separation or DeviceN space, but not a

Pattern space or another Indexed space. For example, if the base color space is

DeviceRGB, the values in the color table are to be interpreted as red, green, and

blue components; if the base color space is a CIE-based ABC space such as a

Cal-

RGB

or Lab space, the values are to be interpreted as A, B, and C components.

GraphicsCHAPTER 4

182

Note: Attempting to use a Separation or DeviceN color space as the base for an

Indexed color space will generate an error in PDF 1.2.

The

hival parameter is an integer that speciﬁes the maximum valid index value. In

other words, the color table is to be indexed by integers in the range 0 to

hival.

hival can be no greater than 255, which is what would be required to index a table

with 8-bit index values.

The color table is deﬁned by the

lookup parameter, which can be either a stream

or (in PDF 1.2) a string. It provides the mapping between index values and the

corresponding colors in the base color space.

The color table data must be m × (

hival + 1) bytes long, where m is the number of

color components in the base color space. Each byte is an unsigned integer in the

range 0 to 255 that is scaled to the range of the corresponding color component

in the base color space; that is, 0 corresponds to the minimum value in the range

for that component, and 255 corresponds to the maximum value in the range.

Note: This is different from the interpretation of an

Indexed color space’s color table

in PostScript. In PostScript, the component value is always scaled to the range 0.0 to

1.0, regardless of the range of color values in the base color space.

The color components for each entry in the table appear consecutively in the

string or stream. For example, if the base color space is

DeviceRGB and the

indexed color space contains two colors, the order of bytes in the string or stream

, where letters denote the color component and numeric

subscripts denote the table entry.

Example 4.10 illustrates the speciﬁcation of an

Indexed color space that maps

8-bit index values to three-component color values in the

DeviceRGB color space.

Example 4.10

[ /Indexed

/DeviceRGB

255

<000000 FF0000 00FF00 0000FF B57342 …>

]

Color Spaces4.5

183

The example shows only the ﬁrst ﬁve color values in the lookup string; in all, there

should be 256 color values and the string should be 768 bytes long. Having

established this color space, the program can now specify colors using single-

component values in the range 0 to 255. For example, a color value of 4 selects an

RGB color whose components are coded as the hexadecimal integers

B5, 73, and

42. Dividing these by 255 and scaling the results to the range 0.0 to 1.0 yields a

color with red, green, and blue components of 0.710, 0.451, and 0.259, respec-

tively.

Although an

Indexed color space is useful mainly for images, index values can

also be used with the color selection operators

sc, scn, SC, and SCN. For example,

123 sc

selects the same color as does an image sample value of 123. The index value

should be an integer in the range 0 to

hival. If it is a real number, it is rounded to

the nearest integer; if it is outside the range 0 to

hival, it is adjusted to the nearest

value within that range.

Separation Color Spaces

Color output devices produce full color by combining primary or process

colorants in varying amounts. On a display, the primary colorants consist of red,

green, and blue phosphors; on a printer, they typically consist of cyan, magenta,

yellow, and sometimes black inks. In addition, some devices can apply special

colorants, often called spot colorants, to produce effects that cannot be achieved

with the standard process colorants alone. Examples include metallic and ﬂuores-

cent colors and special textures.

When printing a page, most devices produce a single composite page on which all

process colorants (and spot colorants, if any) are combined. However, some

devices, such as imagesetters, produce a separate, monochromatic rendition of

the page, called a separation, for each individual colorant. When the separations

are later combined—on a printing press, for example—and the proper inks or

other colorants are applied to them, a full-color page results.

Separation color space (PDF 1.2) provides a means for specifying the use of

additional colorants or for isolating the control of individual color components

of a device color space. When such a space is the current color space, the current

GraphicsCHAPTER 4

184

color is a single-component value, called a tint, that controls the application of

the given colorant or color component only.

Note: The term separation is often misused as a synonym for an individual device

colorant. In the context of this discussion, a printing system that produces separa-

tions generates a separate piece of physical medium (generally ﬁlm) for each color-

ant. It is these pieces of physical medium that are correctly referred to as separations.

A particular colorant properly constitutes a separation only if the device is generating

physical separations, one of which corresponds to the given colorant. The

Separation

color space is so named for historical reasons, but it has evolved to the broader pur-

pose of controlling the application of individual colorants in general, whether or not

they are actually realized as physical separations.

Note also that the operation of a

Separation color space itself is independent of the

characteristics of any particular output device. Depending on the device, the space

may or may not correspond to a true, physical separation or to an actual colorant.

For example, a

Separation color space could be used to control the application of a

single process colorant (such as cyan) on a composite device that does not produce

physical separations, or could represent a color (such as orange) for which no speciﬁc

colorant exists on the device. A

Separation color space provides consistent, predict-

able behavior, even on devices that cannot directly generate the requested color.

Separation color space is deﬁned as follows:

[/Separation name alternateSpace tintTransform]

In other words, it is a four-element array whose ﬁrst element is the color space

family name

Separation. The remaining elements are parameters that a

Separation color space requires; their meanings are discussed below.

A color value in a

Separation color space consists of a single tint component in

the range 0.0 to 1.0. The value 0.0 represents the minimum amount of colorant

that can be applied; 1.0 represents the maximum. Tints are always treated as

subtractive colors, even if the device produces output for the designated compo-

nent by an additive method. Thus a tint value of 0.0 denotes the lightest color

that can be achieved with the given colorant, and 1.0 the darkest. (Note that this

is the same as the convention for

DeviceCMYK color components, but opposite to

the one for

DeviceGray and DeviceRGB.) The scn and SCN operators respectively

set the current ﬁll and stroke color in the graphics state to a tint value; the initial

value in either case is 1.0. A sampled image with single-component samples can

also be used as a source of tint values.

Color Spaces4.5

185

The name parameter in the color space array is a name object specifying the name

of the colorant that this

Separation color space is intended to represent (or one of

the special names

All or None; see below). Such colorant names are arbitrary, and

there can be any number of them, subject to implementation limits.

The special colorant name

All refers collectively to all colorants available on an

output device, including those for the standard process colorants. When a

Separation space with this colorant name is the current color space, painting

operators apply tint values to all available colorants at once. This is useful for

purposes such as painting registration marks in the same place on every separa-

tion. Such marks would typically be painted as the last step in composing a page,

to ensure that they are not overwritten by subsequent painting operations.

The special colorant name

None will never produce any visible output. Painting

operations in a

Separation space with this colorant name have no effect on the

current page.

All devices support

Separation color spaces with the colorant names All and

None, even if they do not support any others. Separation spaces with either of

these colorant names ignore the

alternateSpace and tintTransform parameters (dis-

cussed below), although valid values must still be provided.

At the moment the color space is set to a

Separation space, the viewer application

determines whether the device has an available colorant corresponding to the

name of the requested space. If so, the application ignores the

alternateSpace and

tintTransform parameters; subsequent painting operations within the space will

apply the designated colorant directly, according to the tint values supplied.

If the colorant name associated with a

Separation color space does not cor-

respond to a colorant available on the device, the viewer application arranges

instead for subsequent painting operations to be performed in an alternate color

space. This enables the intended colors to be approximated by colors in some

device or CIE-based color space, which are then rendered using the usual prima-

ry or process colorants. This works as follows:

• The alternateSpace parameter must be an array or name object that identiﬁes

the alternate color space. This can be any device or CIE-based color space, but

not another special color space (

Pattern, Indexed, Separation, or DeviceN).

GraphicsCHAPTER 4

186

• The tintTransform parameter must be a function (see Section 3.9, “Functions”).

During subsequent painting operations, a viewer application will call this

function to transform a tint value into color component values in the alternate

color space. The function is called with the tint value and must return the cor-

responding color component values. That is, the number of components and

the interpretation of their values depend on the alternate color space.

Example 4.11 illustrates the speciﬁcation of a

Separation color space (object 5)

that is intended to produce a color named

LogoGreen. If the output device has no

colorant corresponding to this color,

DeviceCMYK will be used as the alternate

color space; the tint transformation function provided (object 12) maps tint

values linearly into shades of a CMYK color value approximating the “logo green”

color.

Example 4.11

5 0 obj % Color space

[ /Separation

/LogoGreen

/DeviceCMYK

12 0 R

]

endobj

12 0 obj % Tint transformation function

<< /FunctionType 4

/Domain [0.0 1.0]

/Range [0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0]

/Length 62

stream

{ dup 0.84 mul

exch 0.00 exch dup 0.44 mul

exch 0.21 mul

}

endstream

endobj

DeviceN Color Spaces

DeviceN color spaces (PDF 1.3) support the use of high-ﬁdelity and multitone

color. High-ﬁdelity color is the use of more than the standard CMYK process

Color Spaces4.5

187

colorants to produce an extended gamut, or range of colors. A popular example

of such a system is the PANTONE

Hexachrome™ system, which uses six color-

ants: the usual cyan, magenta, yellow, and black, plus orange and green.

Multitone color systems use a single-component image to specify multiple color

components. In a duotone, for example, a single-component image can be used to

specify both the black component and a spot color component. The tone

reproduction is generally different for the different components; for example, the

black component might be painted with the exact sample data from the single-

component image, while the spot color component might be generated as a non-

linear function of the image data in a manner that emphasizes the shadows.

DeviceN color spaces allow any subset of the available device colorants to be

treated as a device color space with multiple components. This provides greater

ﬂexibility than is possible with standard device color spaces such as

DeviceCMYK

or with individual Separation color spaces. For example, it is possible to create a

DeviceN color space consisting of only the cyan, magenta, and yellow color com-

ponents, while excluding the black component. If overprinting is enabled (see

Section 4.5.6, “Overprint Control”), painting in this color space will leave the

black component unchanged.

DeviceN color space is speciﬁed as follows:

[/DeviceN names alternateSpace tintTransform]

[/DeviceN names alternateSpace tintTransform attributes]

It is a four- or ﬁve-element array whose ﬁrst element is the color space family

name

DeviceN. The remaining elements are parameters that a DeviceN color

space requires; their meanings are discussed below.

Color values in the

DeviceN color space are tint components in the range 0.0 to

1.0. The value 0.0 represents the minimum amount of colorant; 1.0 represents the

maximum. The

scn and SCN operators set the current color in the graphics state

to a set of tint values; the initial value is 1.0 for each tint. A sampled image can

also be treated as a source of tint values.

GraphicsCHAPTER 4

188

A DeviceN color space works almost the same as a Separation color space—in

fact, a

DeviceN color space with only one component is exactly equivalent to a

Separation color space. The following are the only differences between DeviceN

and Separation:

• Color values in a DeviceN color space consist of multiple tint components,

rather than only one. The number of components is subject to an implementa-

tion limit; see Appendix C.

• The names parameter in the color space array is an array of name objects speci-

fying the individual colorants. (The special colorant name

All is not allowed.)

The length of the array determines the number of components, and hence the

number of operands required by the

scn and SCN operators when this space is

the current color space. Operand values supplied to

scn or SCN are interpreted

as color component values in the order in which the colors are given in the

names array.

• At the moment the color space is set to a DeviceN space, the viewer application

will select the requested set of colorants only if all of them are available on the

device; otherwise, it will select the alternate color space designated by the

alternateSpace parameter.

• The tint transformation function is called with n tint values and must return

the corresponding m color component values, where n is the number of com-

ponents needed to specify a color in the

DeviceN color space and m is the num-

ber required by the alternate color space.

In a

DeviceN color space, one or more of the colorant names in the names array

may be the name

None. This indicates that the corresponding color component is

never painted on the page, as in a

Separation color space for the None colorant.

When a

DeviceN color space is painting the named device colorants directly,

color components corresponding to

None colorants are discarded. However,

when the

DeviceN color space reverts to its alternate color space, those com-

ponents are passed to the tint transformation function, which may use them in

any desired manner.

The optional

attributes parameter is a dictionary containing additional informa-

tion about the color space. At the time of publication, only one entry is deﬁned in

this dictionary, as shown in Table 4.20.

Color Spaces4.5

189

TABLE 4.20 Entry in a DeviceN color space attributes dictionary

KEY TYPE VALUE

Colorants dictionary (Optional) A dictionary describing the individual colorants used in the DeviceN

color space. For each entry in this dictionary, the key is a colorant name and the

value is an array deﬁning a

Separation color space for that colorant (see “Separa-

tion Color Spaces” on page 183). The key must match the colorant name given in

that color space. The dictionary need not list all colorants used in the

DeviceN

color space and may list additional colorants.

This dictionary has no effect on the operation of the

DeviceN color space itself or

the appearance that it produces. However, it provides information about the indi-

vidual colorants that may be useful to some applications. In particular, the alter-

nate color space and tint transformation function of a

Separation color space

describe the appearance of that colorant alone, whereas those of a

DeviceN color

space describe only the appearance of its colorants in combination.

Example 4.12 shows a DeviceN color space consisting of three color components

named

Orange, Green, and None. In this example, the DeviceN color space,

object 30, has an attributes dictionary whose

Colorants entry is an indirect refer-

ence to object 45 (which might also be referenced by attributes dictionaries of

other

DeviceN color spaces). tintTransform1, whose deﬁnition is not shown, maps

three color components (tints of the colorants

Orange, Green, and None) to four

color components in the alternate color space,

DeviceCMYK. tintTransform2 maps

a single color component (an orange tint) to four components in

DeviceCMYK.

Likewise,

tintTransform3 maps a green tint to DeviceCMYK, and tintTransform4

maps a tint of PANTONE 131 to DeviceCMYK.

Example 4.12

30 0 obj % Color space

[ /DeviceN

[/Orange /Green /None]

/DeviceCMYK

tintTransform1

<< /Colorants 45 0 R >>

]

endobj

GraphicsCHAPTER 4

190

45 0 obj % Colorants dictionary

<< /Orange [ /Separation

/Orange

/DeviceCMYK

tintTransform2

]

/Green [ /Separation

/Green

/DeviceCMYK

tintTransform3

]

/PANTONE#20131 [ /Separation

/PANTONE#20131

/DeviceCMYK

tintTransform4

]

endobj

Multitone Examples

The following examples illustrate various interesting and useful special cases of

the use of

Indexed and DeviceN color spaces in combination to produce multi-

tone colors.

Examples 4.13 and 4.14 illustrate the use of

DeviceN to create duotone color

spaces. In Example 4.13, an

Indexed color space maps index values in the range 0

to 255 to a duotone

DeviceN space in cyan and black. In effect, the index values

are treated as if they were tints of the duotone space, which are then mapped into

tints of the two underlying colorants. Only the beginning of the lookup table

string for the

Indexed color space is shown; the full table would contain 256 two-

byte entries, each specifying a tint value for cyan and black, for a total of 512

bytes. If the alternate color space of the

DeviceN space is selected, the tint trans-

formation function (object 15 in the example) maps the two tint components for

cyan and black to the four components for a

DeviceCMYK color space by supply-

ing zero values for the other two components. Example 4.14 shows the deﬁnition

of another duotone color space, this time using black and gold colorants (where

gold is a spot colorant) and using a

CalRGB space as the alternate color space. This

could be deﬁned in the same way as in the preceding example, with a tint trans-

formation function that converts from the two tint components to colors in the

alternate

CalRGB color space.

Color Spaces4.5

191

Example 4.13

10 0 obj % Color space

[ /Indexed

[ /DeviceN

[/Cyan /Black]

/DeviceCMYK

15 0 R

]

255

<6605 6806 6907 6B09 6C0A …>

]

endobj

15 0 obj % Tint transformation function

<< /FunctionType 4

/Domain [0.0 1.0 0.0 1.0]

/Range [0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0]

/Length 16

stream

{003−1 roll}

endstream

endobj

Example 4.14

30 0 obj % Color space

[ /Indexed

[ /DeviceN

[/Black /Gold]

[ /CalRGB

<< /WhitePoint [1.0 1.0 1.0]

/Gamma [2.2 2.2 2.2]

]

35 0 R % Tint transformation function

]

255

… Lookup table …

]

endobj

GraphicsCHAPTER 4

192

Given a formula for converting any combination of black and gold tints to cali-

brated RGB, a 2-in, 3-out type 4 function could be used for the tint transforma-

tion. Alternatively, a type 0 function could be used, but this would require a large

number of sample points to represent the function accurately; for example, sam-

pling each input variable for 256 tint values between 0.0 and 1.0 would require

256

= 65,536 samples. But since the DeviceN color space is being used as the

base of an

Indexed color space, there are actually only 256 possible combinations

of black and gold tint values. A more compact way to represent this information

is to put the alternate color values directly into the lookup table alongside the

DeviceN color values, as in Example 4.15.

Example 4.15

10 0 obj % Color space

[ /Indexed

[ /DeviceN

[/Black /Gold /None /None /None]

[ /CalRGB

<< /WhitePoint [1.0 1.0 1.0]

/Gamma [2.2 2.2 2.2]

]

20 0 R % Tint transformation function

]

255

… Lookup table …

]

endobj

In this example, each entry in the lookup table has ﬁve components: two for the

black and gold colorants and three more (speciﬁed as

None) for the equivalent

CalRGB color components. If the black and gold colorants are available on the

output device, the

None components will be ignored; if black and gold are not

available, the tint transformation function will be used to convert a ﬁve-compo-

nent color into a three-component equivalent in the alternate

CalRGB color

space. But since, by construction, the third, fourth, and ﬁfth components are the

CalRGB components, the tint transformation function can merely discard the ﬁrst

two components and return the last three. This can be easily expressed with a

type 4 function, as shown in Example 4.16.

Color Spaces4.5

193

Example 4.16

20 0 obj % Tint transformation function

<< /FunctionType 4

/Domain [0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0]

/Range [0.0 1.0 0.0 1.0 0.0 1.0]

/Length 27

stream

{5 3 roll pop pop}

endstream

endobj

For a ﬁnal example, consider Figure 4.16, which shows a quadtone (four-

component) image produced using

Indexed and DeviceN color spaces by an

extension of the techniques described above. (See implementation note 30 in

Appendix H.)

FIGURE 4.16 Quadtone image using Indexed DeviceN

This example starts with the grayscale image shown on the left and paints it with

four colorants: black and three PANTONE spot colors. The alternate color space

Single-component (grayscale) image

Quadtone image

GraphicsCHAPTER 4

194

is a simple calibrated RGB. Thus the DeviceN color space has seven components:

the four desired colorants plus the three components of the alternate space.

Example 4.17 shows the image XObject (see Section 4.8.4, “Image Dictionaries”)

representing the quadtone image, followed by the color space used to interpret

the image data.

Example 4.17

5 0 obj % Image XObject

<< /Type /XObject

/Subtype /Image

/Width 288

/Height 288

/ColorSpace 10 0 R

/BitsPerComponent 8

/Length 105278

/Filter /ASCII85Decode

stream

… Data for grayscale image …

endstream

endobj

10 0 obj % Indexed color space for image

[ /Indexed

15 0 R % Base color space

255 % Table has 256 entries

30 0 R % Lookup table

]

endobj

15 0 obj % Base color space (DeviceN) for Indexed space

[ /DeviceN

[ /Black % Four colorants (black plus three spot colors)

/PANTONE#20216#20CVC

/PANTONE#20409#20CVC

/PANTONE#202985#20CVC

/None % Three components for alternate space

/None

]

16 0 R % Alternate color space

20 0 R % Tint transformation function

]

endobj

Color Spaces4.5

195

16 0 obj % Alternate color space for DeviceN space

[ /CalRGB

<< /WhitePoint [1.0 1.0 1.0] >>

]

endobj

20 0 obj % Tint transformation function for DeviceN space

<< /FunctionType 4

/Domain [0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0]

/Range [0.0 1.0 0.0 1.0 0.0 1.0]

/Length 44

stream

{ 7 3 roll % Just discard ﬁrst four values

pop pop pop pop

}

endstream

endobj

30 0 obj % Lookup table for Indexed color space

<< /Length 1975

/Filter [/ASCII85Decode /FlateDecode]

stream

8;T1BB2"M7*!"psYBt1k\gY1T<D&tO]r*F7Hga*

… Additional data (seven components for each table entry) …

endstream

endobj

4.5.6 Overprint Control

The graphics state contains an overprint parameter, controlled by the op and OP

entries in a graphics state parameter dictionary. Overprint control is useful main-

ly on devices that produce true physical separations, but it is available on some

composite devices as well. Although the operation of this parameter is device-

dependent, it is described here, rather than in the chapter on color rendering,

because it pertains to an aspect of painting in device color spaces that is impor-

tant to many applications.

Any painting operation marks some speciﬁc set of device colorants, depending

on the color space in which the painting takes place. In a

Separation or DeviceN

color space, the colorants to be marked are speciﬁed explicitly; in a device or CIE-

based color space, they are implied by the process color model of the output

GraphicsCHAPTER 4

196

device (see Chapter 6). The overprint parameter is a boolean ﬂag that determines

how painting operations affect colorants other than those explicitly or implicitly

speciﬁed by the current color space.

If the overprint parameter is false (the default value), painting a color in any color

space causes the corresponding areas of unspeciﬁed colorants to be erased (paint-

ed with a tint value of 0.0). The effect is that the color at any position on the page

is whatever was painted there last; this is consistent with the normal opaque

painting behavior of the Adobe imaging model.

If the overprint parameter is true and the output device supports overprinting,

no such erasing actions are performed; anything previously painted in other

colorants is left undisturbed. Consequently, the color at a given position on the

page may be a combined result of several painting operations in different color-

ants. The effect produced by such overprinting is device-dependent and is not

deﬁned by the PDF language.

Note: Not all devices support overprinting. Furthermore, many PostScript printers

support it only when separations are being produced, and not for composite output.

If overprinting is not supported, the value of the overprint parameter is ignored.

An additional graphics state parameter, the overprint mode

(PDF 1.3) affects the

interpretation of a tint value of 0.0 for a color component in a

DeviceCMYK color

space when overprinting is enabled. This parameter is controlled by the

OPM

entry in a graphics state parameter dictionary; it has an effect only when the over-

print parameter is true, as described above.

When colors are speciﬁed in a

DeviceCMYK color space and the output device has

a native color space that is also

DeviceCMYK, each of the source color com-

ponents controls the corresponding device colorant directly. Ordinarily, each

source color component value replaces the value previously painted for the cor-

responding device colorant, no matter what the new value is; this is the default

behavior, speciﬁed by overprint mode 0.

When the overprint mode is 1 (also called nonzero overprint mode), a tint value of

0.0 for a source color component leaves the corresponding component of the

previously painted color unchanged. The effect is equivalent to painting in a

DeviceN color space that includes only those components whose values are non-

Color Spaces4.5

197

zero. For example, if the overprint parameter is true and the overprint mode is 1,

the operation

0.2 0.3 0.0 1.0 k

is equivalent to

0.2 0.3 1.0 scn

in the color space shown in Example 4.18.

Example 4.18

10 0 obj % Color space

[ /DeviceN

[/Cyan /Magenta /Black]

/DeviceCMYK

15 0 R

]

endobj

15 0 obj % Tint transformation function

<< /FunctionType 4

/Domain [0.0 1.0 0.0 1.0 0.0 1.0]

/Range [0.0 1.0 0.0 1.0 0.0 1.0 0.0 1.0]

/Length 13

stream

{0 exch}

endstream

endobj

Nonzero overprint mode applies only to painting operations that use the current

color in the graphics state when the current color space is

DeviceCMYK. It does

not apply to the painting of images or to any colors that are the result of a compu-

tation, such as those in a shading pattern or conversions from some other color

space. It also does not apply if the device’s native color space is not

DeviceCMYK;

in that case, source colors must be converted to the device’s native color space,

and all components participate in the conversion, whatever their values. (This is

shown explicitly in the alternate color space and tint transformation function of

the

DeviceN color space in Example 4.18.)

GraphicsCHAPTER 4

198

4.5.7 Color Operators

Table 4.21 lists the PDF operators that control color spaces and color values. (Also

color-related is the graphics state operator

ri, listed in Table 4.7 on page 142 and

discussed under “Rendering Intents” on page 179.) Color operators may appear

at the page description level or inside text objects (see Figure 4.1 on page 122).

TABLE 4.21 Color operators

OPERANDS OPERATOR DESCRIPTION

name cs (PDF 1.1) Set the color space to use for nonstroking operations. The operand

name must be a name object. If the color space is one that can be speciﬁed by a

name and no additional parameters (

DeviceGray, DeviceRGB, DeviceCMYK, and

certain cases of

Pattern), the name may be speciﬁed directly. Otherwise, it must

be a name deﬁned in the

ColorSpace subdictionary of the current resource dic-

tionary (see Section 3.7.2, “Resource Dictionaries”); the associated value is an

array describing the color space (see Section 4.5.2, “Types of Color Space”).

The

cs operator also sets the current nonstroking color to its initial value, which

depends on the color space:

• In a device, CalGray, or CalRGB color space, the initial color is black.

• In a Lab or ICCBased color space, the initial color is black unless that falls

outside the intervals speciﬁed by the space’s

Range entry, in which case the

nearest valid value is substituted.

• In an Indexed color space, the initial color value is 0.

• In a Separation or DeviceN color space, the initial tint value is 1.0 for all color-

ants.

• In a Pattern color space, the initial color is a pattern object that causes nothing

to be painted.

name CS (PDF 1.1) Same as cs, but for stroking operations.

… c

sc (PDF 1.1) Set the color to use for nonstroking operations in a device, CIE-based

(other than

ICCBased), or Indexed color space. The number of operands re-

quired and their interpretation depends on the current nonstroking color space:

• For DeviceGray, CalGray, and Indexed color spaces, one operand is required

(n = 1).

• For DeviceRGB, CalRGB, and Lab color spaces, three operands are required

(n = 3).

• For DeviceCMYK, four operands are required (n = 4).

Color Spaces4.5

199

… c

scn (PDF 1.2) Same as sc, but also supports Pattern, Separation, DeviceN, and

… c

name

scn ICCBased

color spaces.

If the current nonstroking color space is a

Separation, DeviceN (PDF 1.3), or

ICCBased (PDF 1.3) color space, the operands c

… c

are numbers. The number

of operands and their interpretation depends on the color space.

If the current nonstroking color space is a

Pattern color space, name is the name

of an entry in the

Pattern subdictionary of the current resource dictionary (see

Section 3.7.2, “Resource Dictionaries”). For an uncolored tiling pattern

(

PatternType = 1 and PaintType = 2), c

… c

are component values specifying a

color in the pattern’s underlying color space. For other types of pattern, these

operands must not be speciﬁed.

… c

SC (PDF 1.1) Same as sc, but for stroking operations.

… c

SCN (PDF 1.2) Same as scn, but for stroking operations.

… c

name

SCN

gray g

Set the color space to DeviceGray (or the DefaultGray color space; see “Default

Color Spaces” on page 177) and set the gray level to use for nonstroking opera-

tions.

gray is a number between 0.0 (black) and 1.0 (white).

gray G Same as g, but for stroking operations.

rgb rg Set the color space to DeviceRGB (or the DefaultRGB color space; see “Default

Color Spaces” on page 177) and set the color to use for nonstroking operations.

Each operand must be a number between 0.0 (minimum intensity) and 1.0

(maximum intensity).

rgb RG Same as rg, but for stroking operations.

cmyk k Set the color space to DeviceCMYK (or the DefaultCMYK color space; see “Default

Color Spaces” on page 177) and set the color to use for nonstroking operations.

Each operand must be a number between 0.0 (zero concentration) and 1.0

(maximum concentration). The behavior of this operator is affected by the over-

print mode (see Section 4.5.6, “Overprint Control”).

cmyk K Same as k, but for stroking operations.

GraphicsCHAPTER 4

200

In certain circumstances, invoking operators that specify colors or other color-

related parameters in the graphics state is not allowed. This restriction occurs

when deﬁning graphical ﬁgures whose colors are to be speciﬁed separately each

time they are used. Speciﬁcally, the restriction applies:

• In any glyph deﬁnition that uses the d1 operator (see Section 5.5.4, “Type 3

Fonts”)

• In the content stream of an uncolored tiling pattern (see “Uncolored Tiling

Patterns” on page 209)

In these circumstances, the following will cause an error:

• Invoking any of the following operators:

cs SCN k

CS g K

sc G ri

SC rg sh

scn RG

• Invoking the gs operator with any of the following entries in the graphics state

parameter dictionary:

HT BG UCR

TR BG2 UCR2

TR2

• Painting an image. However, painting an image mask (see “Stencil Masking” on

page 257) is permitted, because it does not specify colors, but rather designates

places where the current color is to be painted.

4.6 Patterns

When operators such as S (stroke), f (ﬁll), and Tj (show text) paint an area of the

page with the current color, they ordinarily apply a single color that covers the

area uniformly. However, it is also possible to apply “paint” that consists of a

repeating graphical ﬁgure or a smoothly varying color gradient instead of a sim-

ple color. Such a repeating ﬁgure or smooth gradient is called a pattern. Patterns

are quite general, and have many uses; for example, they can be used to create

Patterns4.6

201

various graphical textures, such as weaves, brick walls, sunbursts, and similar

geometrical and chromatic effects. (See implementation note 31 in Appendix H.)

Patterns come in two varieties:

• Tiling patterns consist of a small graphical ﬁgure (called a pattern cell) that is

replicated at ﬁxed horizontal and vertical intervals to ﬁll the area to be painted.

The graphics objects to use for tiling are described by a content stream.

• Shading patterns deﬁne a gradient ﬁll that produces a smooth transition

between colors across the area. The color to use is speciﬁed as a function of

position using any of a variety of methods.

Note: The ability to paint with patterns is a feature of PDF 1.2 (tiling patterns) and

PDF 1.3 (shading patterns). With some effort, it is possible to achieve a limited form

of tiling patterns in PDF 1.1 by deﬁning them as character glyphs in a special font

and painting them repeatedly with the

Tj operator. Another technique, deﬁning

patterns as halftone screens, is not recommended, because the effects produced are

device-dependent.

Patterns are speciﬁed in a special family of color spaces named

Pattern, whose

“color values” are pattern objects instead of the numeric component values used

with other color spaces. A pattern object may be a dictionary or a stream, de-

pending on the type of pattern; the term pattern dictionary will be used generical-

ly in this section to refer to either a dictionary object or the dictionary portion of

a stream object. This section describes

Pattern color spaces and the speciﬁcation

of color values within them; see Section 4.5, “Color Spaces,” for information

about color spaces and color values in general.

4.6.1 General Properties of Patterns

A pattern dictionary contains descriptive information deﬁning the appearance

and properties of a pattern. All pattern dictionaries contain an entry named

PatternType, whose value identiﬁes the kind of pattern the dictionary describes:

type 1 for a tiling pattern or type 2 for a shading pattern. The remaining contents

of the dictionary depend on the pattern type, and are detailed below in the sec-

tions on individual pattern types.

GraphicsCHAPTER 4

202

All patterns are treated as colors; a Pattern color space is established with the cs

or CS operator just like other color spaces, and a particular pattern is installed as

the current color with the

scn or SCN operator (see Table 4.21 on page 198).

A pattern’s appearance is described with respect to its own internal coordinate

system. Every pattern has a pattern matrix, a transformation matrix that maps the

pattern’s internal coordinate system to the default coordinate system of the

pattern’s parent content stream (the content stream in which the pattern is deﬁned

as a resource). The concatenation of the pattern matrix with that of the parent

content stream establishes the pattern coordinate space, within which all graphics

objects in the pattern are interpreted.

For example, if a pattern is used on a page, the pattern will appear in the

Pattern

subdictionary of that page’s resource dictionary, and the pattern matrix maps

pattern space to the default (initial) coordinate space of the page. Changes to the

page’s transformation matrix that occur within the page’s content stream, such as

rotation and scaling, have no effect on the pattern; it maintains its original rela-

tionship to the page no matter where on the page it is used. Similarly, if a pattern

is used within a form XObject (see Section 4.9, “Form XObjects”), the pattern

matrix maps pattern space to the form’s default user space (that is, the form co-

ordinate space at the time the form is painted with the

Do operator). Finally, a

pattern may used within another pattern; the inner pattern’s matrix deﬁnes its

relationship to the pattern space of the outer pattern.

Note: PostScript allows a pattern to be deﬁned in one context but used in another.

For example, a pattern might be deﬁned on a page (that is, its pattern matrix maps

the pattern coordinate space to the user space of the page) but be used in a form on

that page, so that its relationship to the page is independent of each individual place-

ment of the form. PDF does not support this feature; in PDF, all patterns are local to

the context in which they are deﬁned.

4.6.2 Tiling Patterns

A tiling pattern consists of a small graphical ﬁgure called a pattern cell. Painting

with the pattern replicates the cell at ﬁxed horizontal and vertical intervals to ﬁll

an area. The effect is as if the ﬁgure were painted on the surface of a clear glass

tile, identical copies of which were then laid down in an array covering the area

and trimmed to its boundaries. This is called tiling the area.

Patterns4.6

203

The pattern cell can include graphical elements such as ﬁlled areas, text, and sam-

pled images. Its shape need not be rectangular, and the spacing of tiles can differ

from the dimensions of the cell itself. When performing painting operations such

S (stroke) or f (ﬁll), the viewer application paints the cell on the current page

as many times as necessary to ﬁll an area. The order in which individual tiles

(instances of the cell) are painted is unspeciﬁed and unpredictable; it is inad-

visable for the ﬁgures on adjacent tiles to overlap.

The appearance of the pattern cell is deﬁned by a content stream containing the

painting operators needed to paint one instance of the cell. Table 4.22 lists the

entries in this stream’s dictionary.

TABLE 4.22 Entries in a type 1 pattern dictionary

KEY TYPE VALUE

Type name (Optional) The type of PDF object that this dictionary describes; if present,

must be

Pattern for a pattern dictionary.

PatternType integer (Required) A code identifying the type of pattern that this dictionary

describes; must be 1 for a tiling pattern.

PaintType integer (Required) A code that determines how the color of the pattern cell is to be

speciﬁed:

1 Colored tiling pattern. The pattern’s content stream itself speciﬁes the

colors used to paint the pattern cell. When the content stream begins

execution, the current color is the one that was initially in effect in

the pattern’s parent content stream. (This is similar to the deﬁnition

of the pattern matrix; see above.)

2 Uncolored tiling pattern. The pattern’s content stream does not speci-

fy any color information. Instead, the entire pattern cell is painted

with a separately speciﬁed color each time the pattern is used. Essen-

tially, the content stream describes a stencil through which the cur-

rent color is to be poured. The content stream must not invoke

operators that specify colors or other color-related parameters in the

graphics state; otherwise, an error will occur (see Section 4.5.7, “Col-

or Operators”). The content stream may paint an image mask, how-

ever, since it does not specify any color information (see “Stencil

Masking” on page 257).

GraphicsCHAPTER 4

204

TilingType integer (Required) A code that controls adjustments to the spacing of tiles relative to

the device pixel grid:

1 Constant spacing. Pattern cells are spaced consistently—that is, by a

multiple of a device pixel. To achieve this, the viewer application may

need to distort the pattern cell slightly by making small adjustments

XStep, YStep, and the transformation matrix. The amount of dis-

tortion does not exceed 1 device pixel.

2 No distortion. The pattern cell is not distorted, but the spacing

between pattern cells may vary by as much as 1 device pixel, both

horizontally and vertically, when the pattern is painted. This achieves

the spacing requested by

XStep and YStep on average, but not neces-

sarily for each individual pattern cell.

3 Constant spacing and faster tiling. Pattern cells are spaced consistently

as in tiling type 1, but with additional distortion permitted to enable

a more efﬁcient implementation.

BBox rectangle (Required) An array of four numbers in the pattern coordinate system giving

the coordinates of the left, bottom, right, and top edges, respectively, of the

pattern cell’s bounding box. These boundaries are used to clip the pattern

cell.

XStep number (Required) The desired horizontal spacing between pattern cells, measured in

the pattern coordinate system.

YStep number (Required) The desired vertical spacing between pattern cells, measured in

the pattern coordinate system. Note that

XStep and YStep may differ from

the dimensions of the pattern cell implied by the

BBox entry. This allows

tiling with irregularly shaped ﬁgures.

XStep and YStep may be either positive

or negative, but not zero.

Resources dictionary (Required) A resource dictionary containing all of the named resources

required by the pattern’s content stream (see Section 3.7.2, “Resource Dic-

tionaries”).

Matrix array (Optional) An array of six numbers deﬁning the pattern matrix (see

Section 4.6.1, “General Properties of Patterns”). Default value: the identity

matrix [100100].

Patterns4.6

205

The pattern dictionary’s BBox, XStep, and YStep values are interpreted in the pat-

tern coordinate system, and the graphics objects in the pattern’s content stream

are deﬁned with respect to that coordinate system. The placement of pattern cells

in the tiling is based on the location of one key pattern cell, which is then dis-

placed by multiples of

XStep and YStep to replicate the pattern. The origin of the

key pattern cell coincides with the origin of the pattern coordinate system; the

phase of the tiling can be controlled by the translation components of the

Matrix

entry in the pattern dictionary.

The ﬁrst step in painting with a tiling pattern is to establish the pattern as the cur-

rent color in the graphics state. Subsequent painting operations will tile the

painted areas with the pattern cell described by the pattern’s content stream.

Whenever it needs to obtain the pattern cell, the viewer application does the fol-

lowing:

1. Saves the current graphics state (as if by invoking the

q operator)

2. Installs the graphics state that was in effect at the beginning of the pattern’s

parent content stream, with the current transformation matrix altered by the

pattern matrix as described in Section 4.6.1, “General Properties of Patterns”

3. Paints the graphics objects speciﬁed in the pattern’s content stream

4. Restores the saved graphics state (as if by invoking the

Q operator)

Note: The pattern’s content stream should not set any of the device-dependent

parameters in the graphics state (see Table 4.3 on page 136). Doing so may result in

incorrect output.

Colored Tiling Patterns

A colored tiling pattern is one whose color is self-contained. In the course of

painting the pattern cell, the pattern’s content stream explicitly sets the color of

each graphical element it paints. A single pattern cell can contain elements that

are painted different colors; it can also contain sampled grayscale or color images.

This type of pattern is identiﬁed by a pattern type of 1 and a paint type of 1 in the

pattern dictionary.

When the current color space is a

Pattern space, a colored tiling pattern can be

selected as the current color by supplying its name as the single operand to the

scn or SCN operator. This name must be the key of an entry in the Pattern sub-

GraphicsCHAPTER 4

206

dictionary of the current resource dictionary (see Section 3.7.2, “Resource Dic-

tionaries”), whose value is the stream object representing the pattern. Since the

pattern deﬁnes its own color information, no additional operands representing

color components are speciﬁed to

scn or SCN. For example, if P1 is the name of a

pattern resource in the current resource dictionary, the following code establishes

it as the current nonstroking color:

/Pattern cs

/P1 scn

Subsequent executions of nonstroking painting operators, such as f (ﬁll), Tj

(paint text), or Do (paint external object) with an image mask, will use the desig-

nated pattern to tile the areas to be painted.

Example 4.19 deﬁnes a page (object 5) that paints a rectangle and a character

glyph using a colored tiling pattern (object 15). Figure 4.17 shows the results.

Example 4.19

5 0 obj % Page object

<< /Type /Page

/Parent 2 0 R

/Resources 10 0 R

/Contents 20 0 R

endobj

10 0 obj % Resource dictionary for page

<< /ProcSet [/PDF /Text]

/Font << /F1 12 0 R >>

/Pattern << /P1 15 0 R >>

endobj

12 0 obj

<< /Type /Font

/Subtype /Type1

/BaseFont /Times−Roman

endobj

Patterns4.6

207

15 0 obj % Pattern deﬁnition

<< /Type /Pattern

/PatternType 1 % Tiling pattern

/PaintType 1 % Colored

/TilingType 1

/BBox [0 0 60 60]

/XStep 60

/YStep 60

/Resources 16 0 R

/Length 404

stream

0.3 g % Set color for dark gray stars

15.000 27.000 m % Construct star-shaped path

7.947 5.292 l

26.413 18.708 l

3.587 18.708 l

22.053 5.292 l

f % Fill with dark gray

45.000 57.000 m % Construct star-shaped path

37.947 35.292 l

56.413 48.708 l

33.587 48.708 l

52.053 35.292 l

f % Fill with dark gray

0.7 g % Set color for light gray stars

15.000 57.000 m % Construct star-shaped path

7.947 35.292 l

26.413 48.708 l

3.587 48.708 l

22.053 35.292 l

f % Fill with light gray

45.000 27.000 m % Construct star-shaped path

37.947 5.292 l

56.413 18.708 l

33.587 18.708 l

52.053 5.292 l

f % Fill with light gray

endstream

endobj

16 0 obj % Resource dictionary for pattern

<< /ProcSet [/PDF] >>

endobj

GraphicsCHAPTER 4

208

20 0 obj % Contents of page

<< /Length 246 >>

stream

/Pattern cs % Set pattern color space

/P1 scn % Set star pattern as nonstroking color

0.0 G % Set stroking color to black

120 120 184 120 re % Construct rectangular path

B % Fill and stroke path

BT % Begin text object

/F1 1 Tf % Set font and size

270 0 0 270 160 100 Tm % Set text matrix

0.9 g % Set nonstroking color to light gray

(A) Tj % Fill glyph with gray

/Pattern cs % Set pattern color space

/P1 scn % Set star pattern as nonstroking color

0 0 TD % Return to start of line

(A) Tj % Fill glyph with stars

ET % End text object

endstream

endobj

FIGURE 4.17

Output from Example 4.19

AAAA

Patterns4.6

209

The pattern consists of four stars in two different colors. The pattern’s content

stream speciﬁes the colors of the stars. Several features of Example 4.19 are note-

worthy:

• The rectangle and the glyph representing the letter A are painted with the same

pattern. The pattern cells align, even though the current transformation matrix

is altered between the two uses of the pattern.

• The pattern cell does not completely cover the tile: it leaves the spaces between

the stars unpainted. When the tiling pattern is used as a color, the existing

background shows through these unpainted areas, as the appearance of the

glyph in Figure 4.17 demonstrates. The letter is ﬁrst painted solid gray; when it

is painted again with the star pattern, the gray continues to show between the

stars.

Uncolored Tiling Patterns

An uncolored tiling pattern is one that has no inherent color: the color must be

speciﬁed separately whenever the pattern is used. This type of pattern is iden-

tiﬁed by a pattern type of 1 and a paint type of 2 in the pattern dictionary. The

pattern’s content stream does not explicitly specify any colors; it can paint an im-

age mask (see “Stencil Masking” on page 257), but no other kind of image. This

provides a way to tile different regions of the page with pattern cells having the

same shape but different colors.

Pattern color space representing an uncolored tiling pattern requires a parame-

ter: an object identifying the underlying color space in which the actual color of

the pattern is to be speciﬁed. The underlying color space is given as the second

element of the array that deﬁnes the

Pattern color space. For example, the array

[/Pattern /DeviceRGB]

deﬁnes a Pattern color space with DeviceRGB as its underlying color space.

Note: The underlying color space cannot be another

Pattern color space.

Operands supplied to the

scn or SCN operator in such a color space must include

a color value in the underlying color space, speciﬁed by one or more numeric

color components, as well as the name of a pattern object representing an un-

colored tiling pattern. For example, if the current resource dictionary (see

GraphicsCHAPTER 4

210

Section 3.7.2, “Resource Dictionaries”) deﬁnes Cs3 as the name of a ColorSpace

resource whose value is the Pattern color space shown above, and P2 as a Pattern

resource denoting an uncolored tiling pattern, then the code

/Cs3 cs

0.30 0.75 0.21 /P2 scn

establishes Cs3 as the current nonstroking color space and P2 as the current non-

stroking color, to be painted in the color represented by the speciﬁed components

in the

DeviceRGB color space. Subsequent executions of nonstroking painting

operators, such as

f (ﬁll), Tj (show text), and Do (paint external object) with an

image mask, will use the designated pattern and color to tile the areas to be paint-

ed. The same pattern can be used repeatedly with a different color each time.

Example 4.20 deﬁnes an uncolored tiling pattern and then uses it to paint a rec-

tangle and a circle in different colors; Figure 4.18 shows the results.

Example 4.20

5 0 obj % Page object

<< /Type /Page

/Parent 2 0 R

/Resources 10 0 R

/Contents 20 0 R

endobj

10 0 obj % Resource dictionary for page

<< /ProcSet [/PDF]

/ColorSpace << /Cs9 12 0 R >>

/Pattern << /P1 15 0 R >>

endobj

12 0 obj % Color space

[/Pattern /DeviceGray]

endobj

15 0 obj % Pattern deﬁnition

<< /Type /Pattern

/PatternType 1 % Tiling pattern

/PaintType 2 % Uncolored

/TilingType 1

/BBox [−12 −12 12 12]

Patterns4.6

211

/XStep 30

/YStep 30

/Resources 16 0 R

/Length 95

stream

0.000 12.000 m % Construct star-shaped path

−7.053 −9.708 l

11.413 3.708 l

−11.413 3.708 l

7.053 −9.708 l

f % Fill with current color

endstream

endobj

16 0 obj % Resource dictionary for pattern

<< /ProcSet [/PDF] >>

endobj

20 0 obj % Contents of page

<< /Length 243 >>

stream

0.9 g % Set nonstroking color to light gray

140 210 170 −100 re % Construct rectangular path

f % Fill with light gray

/Cs9 cs % Set pattern color space

1.0 /P1 scn % Set ﬁll pattern and underlying color (white)

140 110 170 100 re % Construct rectangular path

f % Fill with white stars

0.0 /P1 scn % Set ﬁll pattern and underlying color (black)

0.0 G % Set stroking color to black

285.00 185.04 m % Construct circular path

285.00 218.16 258.12 245.04 225.00 245.04 c

191.88 245.04 165.00 218.16 165.00 185.04 c

165.00 151.92 191.88 125.04 225.00 125.04 c

258.12 125.04 285.00 151.92 285.00 185.04 c

B % Fill and stroke path

endstream

endobj

GraphicsCHAPTER 4

212

FIGURE 4.18 Output from Example 4.20

The pattern consists of a single star, which the pattern paints without specifying a

color. Most of the remarks following Example 4.19 on page 206 also apply to

Example 4.20. Additionally:

• The program paints the rectangle twice, ﬁrst with light gray and then with the

tiling pattern. To paint with the pattern, it supplies two operands to the

scn

operator: the number 1.0, denoting white in the underlying DeviceGray color

space, and the name of the pattern.

• The program paints the interior of the circle with the same pattern, but with

the underlying color set to 0.0 (black).

4.6.3 Shading Patterns

Shading patterns (PDF 1.3) provide a smooth transition between colors across an

area to be painted, independent of the resolution of any particular output device

and without specifying the number of steps in the color transition. Patterns of

this type are described by pattern dictionaries with a pattern type of 2. Table 4.23

shows the contents of this type of dictionary.

Patterns4.6

213

TABLE 4.23 Entries in a type 2 pattern dictionary

KEY TYPE VALUE

Type integer (Optional) The type of PDF object that this dictionary describes; if present,

must be

Pattern for a pattern dictionary.

PatternType integer (Required) A code identifying the type of pattern that this dictionary de-

scribes; must be 2 for a shading pattern.

Shading dictionary (Required) A shading object (see below) deﬁning the shading pattern’s gradient

or stream ﬁll. The contents of the dictionary consist of the entries in Table 4.25 on

page 216, plus those in one of Tables 4.26 to 4.31 on pages 219 to 235.

Matrix array (Optional) An array of six numbers deﬁning the pattern matrix (see

Section 4.6.1, “General Properties of Patterns”). Default value: the identity

matrix

[100100].

ExtGState dictionary (Optional) A graphics state parameter dictionary (see Section 4.3.4, “Graph-

ics State Parameter Dictionaries”) containing graphics state parameters to be

put into effect temporarily while the shading pattern is painted. Any parame-

ters that are not so speciﬁed are inherited from the graphics state that was in

effect at the beginning of the content stream in which the pattern is deﬁned

as a resource.

The most signiﬁcant additional entry is Shading, whose value is a shading object

deﬁning the properties of the shading pattern’s gradient ﬁll. This is a complex

“paint” that determines the type of color transition the shading pattern produces

when painted across an area. A shading object may be a dictionary or a stream,

depending on the type of shading; the term shading dictionary will be used gener-

ically in this section to refer to either a dictionary object or the dictionary portion

of a stream object.

By setting a shading pattern as the current color in the graphics state, a PDF con-

tent stream can use it with painting operators such as

f (ﬁll), S (stroke), Tj (show

text), or

Do (paint external object) with an image mask to paint a path, character

glyph, or mask with a smooth color transition. When a shading is used in this

way, the geometry of the gradient ﬁll is independent of that of the object being

painted.

GraphicsCHAPTER 4

214

Shading Operator

When the area to be painted is a relatively simple shape whose geometry is the

same as that of the gradient ﬁll itself, the

sh operator can be used instead of the

usual painting operators.

sh accepts a shading dictionary as an operand and

applies the corresponding gradient ﬁll directly to current user space. This opera-

tor does not require the creation of a pattern dictionary or a path and works

without reference to the current color in the graphics state. Table 4.24 describes

the

sh operator.

Note: Patterns deﬁned by type 2 pattern dictionaries do not tile. To create a tiling

pattern containing a gradient ﬁll, invoke the

sh operator from within the content

stream of a type 1 (tiling) pattern.

TABLE 4.24 Shading operator

OPERANDS OPERATOR DESCRIPTION

name sh (PDF 1.3) Paint the shape and color shading described by a shading dictionary, sub-

ject to the current clipping path. The current color in the graphics state is neither

used nor altered. The effect is different from that of painting a path using a shading

pattern as the current color.

name is the name of a shading dictionary resource in the Shading subdictionary of

the current resource dictionary (see Section 3.7.2, “Resource Dictionaries”). All co-

ordinates in the shading dictionary are interpreted relative to the current user

space. (By contrast, when a shading dictionary is used in a type 2 pattern, the

coordinates are expressed in pattern space.) All colors are interpreted in the color

space identiﬁed by the shading dictionary’s

ColorSpace entry (see Table 4.25 on

page 216). The

Background entry, if present, is ignored.

This operator should be applied only to bounded or geometrically deﬁned shad-

ings. If applied to an unbounded shading, it will paint the shading’s gradient ﬁll

across the entire clipping region, which may be time-consuming.

Shading Dictionaries

A shading dictionary speciﬁes details of a particular gradient ﬁll, including the

type of shading to be used, the geometry of the area to be shaded, and the geom-

Patterns4.6

215

etry of the gradient ﬁll itself. Various shading types are available, depending on

the value of the dictionary’s

ShadingType entry:

• Function-based shadings (type 1) deﬁne the color of every point in the domain

using a mathematical function (not necessarily smooth or continuous).

• Axial shadings (type 2) deﬁne a color blend along a line between two points,

optionally extended beyond the boundary points by continuing the boundary

colors.

• Radial shadings (type 3) deﬁne a blend between two circles, optionally ex-

tended beyond the boundary circles by continuing the boundary colors. This

type of shading is commonly used to represent three-dimensional spheres and

cones.

• Free-form Gouraud-shaded triangle meshes (type 4) deﬁne a common construct

used by many three-dimensional applications to represent complex colored

and shaded shapes. Vertices are speciﬁed in free-form geometry.

• Lattice-form Gouraud-shaded triangle meshes (type 5) are based on the same

geometrical construct as type 4, but with vertices speciﬁed as a pseudo-

rectangular lattice.

• Coons patch meshes (type 6) construct a shading from one or more color

patches, each bounded by four cubic Bézier curves.

• Tensor-product patch meshes (type 7) are similar to type 6, but with additional

control points in each patch, affording greater control over color mapping.

Table 4.25 shows the entries that all shading dictionaries share in common;

entries speciﬁc to particular shading types are described in the relevant sections

below.

Note: The term target coordinate space, used in many of the following descriptions,

refers to the coordinate space into which a shading is painted. For shadings used with

a type 2 pattern dictionary, this is the pattern coordinate space, discussed in

Section 4.6.1, “General Properties of Patterns.” For shadings used directly with the

operator, it is the current user space.

GraphicsCHAPTER 4

216

TABLE 4.25 Entries common to all shading dictionaries

KEY TYPE VALUE

ShadingType integer (Required) The shading type:

1 Function-based shading

2 Axial shading

3 Radial shading

4 Free-form Gouraud-shaded triangle mesh

5 Lattice-form Gouraud-shaded triangle mesh

6 Coons patch mesh

7 Tensor-product patch mesh

ColorSpace name or (Required) The color space in which color values are expressed. This may be

array any device, CIE-based, or special color space except a

Pattern space. See

“Color Space: Special Considerations,” below, for further information.

Background array (Optional) An array of color components appropriate to the color space,

specifying a single background color value. If present, this color is used be-

fore any painting operation involving the shading, to ﬁll the entire area to be

painted. The effect is as if the painting operation were performed twice: ﬁrst

with the background color and then again with the shading.

BBox rectangle (Optional) An array of four numbers giving the left, bottom, right, and top

coordinates, respectively, of the shading’s bounding box. The coordinates are

interpreted in the shading’s target coordinate space. If present, this bounding

box is applied as a temporary clipping boundary when the shading is painted,

in addition to the current clipping path and any other clipping boundaries in

effect at that time.

AntiAlias boolean (Optional) A ﬂag indicating whether to ﬁlter the shading function to prevent

aliasing artifacts. The shading operators sample shading functions at a rate

determined by the resolution of the output device. Aliasing can occur if the

function is not smooth—that is, if it has a high spatial frequency relative to

the sampling rate. Anti-aliasing can be computationally expensive and is usu-

ally unnecessary, since most shading functions are smooth enough, or are

sampled at a high enough frequency, to avoid aliasing effects. Anti-aliasing

may not be implemented on some output devices, in which case this ﬂag is

ignored. Default value: false.

Patterns4.6

217

Shading types 4 to 7 are deﬁned by a stream containing descriptive data charac-

terizing the shading’s gradient ﬁll. In these cases, the shading dictionary is also a

stream dictionary and can contain any of the standard entries common to all

streams (see Table 3.4 on page 35). In particular, it will always include a

Length

entry, which is required for all streams.

In addition, some shading dictionaries also include a

Function entry whose value

is a function (dictionary or stream) deﬁning how colors vary across the area to be

shaded. In such cases, the shading dictionary usually deﬁnes the geometry of the

shading, while the function deﬁnes the color transitions across that geometry.

The function is required for some types of shading and optional for others. Func-

tions are described in detail in Section 3.9, “Functions.”

Note: Discontinuous color transitions, or those with high spatial frequency, may ex-

hibit aliasing effects when painted at low effective resolutions.

Color Space: Special Considerations

Conceptually, a shading determines a color value for each individual point within

the area to be painted. In practice, however, the shading may actually be used to

compute color values only for some subset of the points in the target area, with

the colors of the intervening points determined by interpolation between the

ones computed. Viewer applications are free to use this strategy as long as the

interpolated color values approximate those deﬁned by the shading to within the

smoothness tolerance speciﬁed in the graphics state (see Section 6.5.2, “Smooth-

ness Tolerance”). The

ColorSpace entry common to all shading dictionaries not

only deﬁnes the color space in which the shading speciﬁes its color values, but

also determines the color space in which color interpolation is performed.

Note: Some shading types (4 to 7) perform interpolation on a parametric value sup-

plied as input to the shading’s color mapping function, as described in the relevant

sections below. This form of interpolation is conceptually distinct from the interpola-

tion described here, which operates on the output color values produced by the color

mapping function and takes place within the shading’s target color space.

Gradient ﬁlls between colors deﬁned by most shadings are implemented using a

variety of interpolation algorithms, and these algorithms are sensitive to the char-

acteristics of the color space. Linear interpolation, for example, may have observ-

ably different results when applied in a

DeviceCMYK color space than in a Lab

color space, even if the starting and ending colors are perceptually identical. The

GraphicsCHAPTER 4

218

difference arises because the two color spaces are not linear relative to each other.

Shadings are rendered according to the following rules:

• If ColorSpace is a device color space different from the native color space of the

output device, color values in the shading will be converted to the native color

space using the standard conversion formulas described in Section 6.2, “Con-

versions among Device Color Spaces.” To optimize performance, these conver-

sions may take place at any time (either before or after any interpolation on the

color values in the shading). Thus, shadings deﬁned with device color spaces

may have color gradient ﬁlls that are less accurate and somewhat device-

dependent. (This does not apply to axial and radial shadings—shading types 2

and 3—because those shading types perform gradient ﬁll calculations on a

single variable and then convert to parametric colors.)

• If ColorSpace is a CIE-based color space, all gradient ﬁll calculations will be

performed in that space. Conversion to device colors will occur only after all

interpolation calculations have been performed. Thus, the color gradients will

be device-independent for the colors generated at each point.

• If ColorSpace is a Separation or DeviceN color space and the speciﬁed color-

ants are supported, no color conversion calculations are needed. If the speciﬁed

colorants are not supported (so that the space’s alternate color space must be

used), gradient ﬁll calculations will be performed in the designated

Separation

or DeviceN color space before conversion to the alternate space. Thus, non-

linear tint transformation functions will be accommodated for the best possi-

ble representation of the shading.

• If ColorSpace is an Indexed color space, all color values speciﬁed in the shading

will be immediately converted to the base color space. Depending on whether

the base color space is a device or CIE-based space, gradient ﬁll calculations

will be performed as stated above. Interpolation never occurs in an

Indexed

color space, which is quantized and therefore inappropriate for calculations

that assume a continuous range of colors. For similar reasons, an

Indexed color

space is not allowed in any shading whose color values are generated by a func-

tion; this applies to any shading dictionary that contains a

Function entry.

Shading Types

In addition to the entries listed in Table 4.25 on page 216, all shading dictionaries

have entries speciﬁc to the type of shading they represent, as indicated by the

Patterns4.6

219

value of their ShadingType key. The following sections describe the available

shading types and the dictionary entries speciﬁc to each.

Type 1 (Function-Based) Shadings

In type 1 (function-based) shadings, the color at every point in the domain is

deﬁned by a speciﬁed mathematical function. The function need not be smooth

or continuous. This is the most general of the available shading types, and is use-

ful for shadings that cannot be adequately described with any of the other types.

Table 4.26 shows the shading dictionary entries speciﬁc to this type of shading, in

addition to those common to all shading dictionaries (Table 4.25 on page 216).

Note: This type of shading may not be used with an

Indexed color space.

TABLE 4.26 Additional entries speciﬁc to a type 1 shading dictionary

KEY TYPE VALUE

Domain array (Optional) An array of four numbers [x

min

max

min

max

] specifying the rec-

tangular domain of coordinates over which the color function(s) are deﬁned.

Default value:

[0.0 1.0 0.0 1.0].

Matrix array (Optional) A transformation matrix mapping the coordinate space speciﬁed by

the

Domain entry into the shading’s target coordinate space. For example, to

map the domain rectangle

[0.0 1.0 0.0 1.0] to a 1-inch square with lower-left

corner at coordinates (100, 100) in default user space, the

Matrix value would be

[72 0 0 72 100 100]. Default value: the identity matrix [100100].

Function function (Required) A 2-in, n-out function or an array of n 2-in, 1-out functions (where n

is the number of color components in the shading dictionary’s color space).

Each function’s domain must be a superset of that of the shading dictionary. If

the value returned by the function for a given color component is out of range, it

will be adjusted to the nearest valid value.

The domain rectangle (Domain) establishes an internal coordinate space for the

shading that is independent of the target coordinate space in which it is to be

painted. The color function(s) (

Function) specify the color of the shading at each

point within this domain rectangle. The transformation matrix (

Matrix) then

maps the domain rectangle into a corresponding rectangle or parallelogram in

the target coordinate space. Points within the shading’s bounding box (

BBox)

that fall outside this transformed domain rectangle will be painted with the shad-

GraphicsCHAPTER 4

220

ing’s background color (Background); if the shading dictionary has no

Background entry, such points will be left unpainted. If the function is undeﬁned

at any point within the declared domain rectangle, an error may occur, even if the

corresponding transformed point falls outside the shading’s bounding box.

Type 2 (Axial) Shadings

Type 2 (axial) shadings deﬁne a color blend that varies along a linear axis between

two endpoints and extends indeﬁnitely perpendicular to that axis. The shading

may optionally be extended beyond either or both endpoints by continuing the

boundary colors indeﬁnitely. Table 4.27 shows the shading dictionary entries spe-

ciﬁc to this type of shading, in addition to those common to all shading diction-

aries (Table 4.25 on page 216).

Note: This type of shading may not be used with an

Indexed color space.

TABLE 4.27 Additional entries speciﬁc to a type 2 shading dictionary

KEY TYPE VALUE

Coords array (Required) An array of four numbers [x

] specifying the starting and

ending coordinates of the axis, expressed in the shading’s target coordinate

space.

Domain array (Optional) An array of two numbers [t

] specifying the limiting values of a

parametric variable t. The variable is considered to vary linearly between these

two values as the color gradient varies between the starting and ending points of

the axis. The variable t becomes the input argument to the color function(s).

Default value:

[0.0 1.0].

Function function (Required) A 1-in, n-out function or an array of n 1-in, 1-out functions (where

n is the number of color components in the shading dictionary’s color space).

The function(s) are called with values of the parametric variable t in the domain

deﬁned by the

Domain entry. Each function’s domain must be a superset of

that of the shading dictionary. If the value returned by the function for a given

color component is out of range, it will be adjusted to the nearest valid value.

Extend array (Optional) An array of two boolean values specifying whether to extend the

shading beyond the starting and ending points of the axis, respectively. Default

value: [false false].

Patterns4.6

221

The color blend is accomplished by linearly mapping each point (x, y) along

the axis between the endpoints (x

, y

) and (x

, y

) to a corresponding point in

the domain speciﬁed by the shading dictionary’s

Domain entry. The points (0, 0)

and (1, 0) in the domain correspond respectively to (x

, y

) and (x

, y

) on the

axis. Since all points along a line in domain space perpendicular to the line from

(0, 0) to (1, 0) will have the same color, only the new value of x needs to be com-

puted:

The value of the parametric variable t is then determined from x′ as follows:

• For 0 ≤ x′ ≤ 1, t = t

+ (t

− t

) × x′.

• For x′ < 0, if the ﬁrst element of the Extend array is true, then t = t

; otherwise,

t is undeﬁned and the point is left unpainted.

• For x′ > 1, if the second element of the Extend array is true, then t = t

; other-

wise, t is undeﬁned and the point is left unpainted.

The resulting value of t is then passed as input to the function(s) deﬁned by the

shading dictionary’s

Function entry, yielding the component values of the color

with which to paint the point (x, y).

Type 3 (Radial) Shadings

Type 3 (radial) shadings deﬁne a color blend that varies between two circles.

Shadings of this type are commonly used to depict three-dimensional spheres

and cones. Table 4.28 shows the shading dictionary entries speciﬁc to this type of

shading, in addition to those common to all shading dictionaries (Table 4.25 on

page 216).

Note: This type of shading may not be used with an

Indexed color space.

x′

–

()

–

()

× y

–

()

–

()

×+

–()

----------------------------------------------------------------------------------------------------=

GraphicsCHAPTER 4

222

TABLE 4.28 Additional entries speciﬁc to a type 3 shading dictionary

KEY TYPE VALUE

Coords array (Required) An array of six numbers [x

] specifying the centers and

radii of the starting and ending circles, expressed in the shading’s target coor-

dinate space. The radii r

and r

must both be greater than or equal to 0. If one

radius is 0, the corresponding circle is treated as a point; if both are 0, nothing is

painted.

Domain array (Optional) An array of two numbers [t

] specifying the limiting values of a

parametric variable t. The variable is considered to vary linearly between these

two values as the color gradient varies between the starting and ending circles.

The variable t becomes the input argument to the color function(s). Default

value:

[0 1].

Function function (Required) A 1-in, n-out function or an array of n 1-in, 1-out functions (where n

is the number of color components in the shading dictionary’s color space). The

function(s) are called with values of the parametric variable t in the domain de-

ﬁned by the shading dictionary’s

Domain entry. Each function’s domain must be

a superset of that of the shading dictionary. If the value returned by the function

for a given color component is out of range, it will be adjusted to the nearest

valid value.

Extend array (Optional) An array of two boolean values specifying whether to extend the

shading beyond the starting and ending circles, respectively. Default value:

[false false].

The color blend is based on a family of blend circles interpolated between the

starting and ending circles that are deﬁned by the shading dictionary’s

Coords

entry. The blend circles are deﬁned in terms of a subsidiary parametric variable

which varies linearly between 0.0 and 1.0 as t varies across the domain from t

, as speciﬁed by the dictionary’s Domain entry. The center and radius of each

blend circle are given by the parametric equations

–

--------------=

()

–()×+=

s() y

–()×+=

rs() r

–()×+=

Patterns4.6

223

Each value of s between 0.0 and 1.0 determines a corresponding value of t, which

is then passed as the input argument to the function(s) deﬁned by the shading

dictionary’s

Function entry. This yields the component values of the color with

which to ﬁll the corresponding blend circle. For values of s not lying between 0.0

and 1.0, the boolean elements of the shading dictionary’s

Extend array determine

whether and how the shading will be extended. If the ﬁrst of the two elements is

true, the shading is extended beyond the deﬁned starting circle to values of s less

than 0.0; if the second element is true, the shading is extended beyond the deﬁned

ending circle to s values greater than 1.0.

Note that either of the starting and ending circles may be larger than the other. If

the shading is extended at the smaller end, the family of blend circles continues as

far as that value of s for which the radius of the blend circle r(s) = 0; if the shading

is extended at the larger end, the blend circles continue as far as that s value for

which r(s) is large enough to encompass the shading’s entire bounding box

(

BBox). Extending the shading can thus cause painting to extend beyond the

areas deﬁned by the two circles themselves.

Conceptually, all of the blend circles are painted in order of increasing values of s,

from smallest to largest. Blend circles extending beyond the starting circle are

painted in the same color deﬁned by the shading dictionary’s

Function entry for

the starting circle (t = t

, s = 0.0); those extending beyond the ending circle are

painted in the color deﬁned for the ending circle (t = t

, s = 1.0). The painting is

opaque, with the color of each circle completely overlaying those preceding it;

thus if a point lies within more than one blend circle, its ﬁnal color will be that of

the last of the enclosing circles to be painted, corresponding to the greatest value

of s. Note the following points:

• If one of the starting and ending circles entirely contains the other, the shading

will depict a sphere.

• If neither circle contains the other, the shading will depict a cone. If the starting

circle is larger, the cone will appear to point out of the page; if the ending circle

is larger, the cone will appear to point into the page.

Example 4.21 paints the leaf-covered branch shown in Figure 4.19. Each leaf is

ﬁlled with the same radial shading (object number 5). The color function

(object 10) is a stitching function (described in Section 3.9.3, “Type 3 (Stitching)

Functions”) whose two subfunctions (objects 11 and 12) are both exponential

interpolation functions (see Section 3.9.2, “Type 2 (Exponential Interpolation)

GraphicsCHAPTER 4

224

Functions”). Each leaf is drawn as a path and then ﬁlled with the shading, using

code such as that shown in Example 4.22 (where the name

Sh1 is associated with

object 5 by the

Shading subdictionary of the current resource dictionary; see

Section 3.7.2, “Resource Dictionaries”).

FIGURE 4.19 Radial shading

Example 4.21

5 0 obj % Shading dictionary

<< /ShadingType 3

/ColorSpace /DeviceCMYK

/Coords [0.0 0.0 0.096 0.0 0.0 1.000] % Concentric circles

/Function 10 0 R

/Extend [true true]

endobj

10 0 obj % Color function

<< /FunctionType 3

/Domain [0.0 1.0]

/Functions [11 0 R 12 0 R]

/Bounds [0.708]

/Encode [1.0 0.0 0.0 1.0]

endobj

Patterns4.6

225

11 0 obj % First subfunction

<< /FunctionType 2

/Domain [0.0 1.0]

/C0 [0.929 0.357 1.000 0.298]

/C1 [0.631 0.278 1.000 0.027]

/N 1.048

endobj

12 0 obj % Second subfunction

<< /FunctionType 2

/Domain [0.0 1.0]

/C0 [0.929 0.357 1.000 0.298]

/C1 [0.941 0.400 1.000 0.102]

/N 1.374

endobj

Example 4.22

316.789 140.311 m % Move to start of leaf

303.222 146.388 282.966 136.518 279.122 121.983 c % Curved segment

277.322 120.182 l % Straight line

285.125 122.688 291.441 121.716 298.156 119.386 c % Curved segment

336.448 119.386 l % Straight line

331.072 128.643 323.346 137.376 316.789 140.311 c % Curved segment

W n % Set clipping path

q % Save graphics state

27.7843 0 .0000 0.0000 −27.7843 310.2461 121.1521 cm % Set matrix

/Sh1 sh % Paint shading

Q % Restore graphics state

Type 4 Shadings (Free-Form Gouraud-Shaded Triangle Meshes)

Type 4 shadings (free-form Gouraud-shaded triangle meshes) are commonly

used to represent complex colored and shaded three-dimensional shapes. The

area to be shaded is deﬁned by a path composed entirely of triangles. The color at

each vertex of the triangles is speciﬁed, and a technique known as Gouraud

interpolation is used to color the interiors. The interpolation functions deﬁning

the shading may be linear or nonlinear. Table 4.29 shows the entries speciﬁc to

GraphicsCHAPTER 4

226

this type of shading dictionary, in addition to those common to all shading

dictionaries (Table 4.25 on page 216) and stream dictionaries (Table 3.4 on

page 35).

TABLE 4.29 Additional entries speciﬁc to a type 4 shading dictionary

KEY TYPE VALUE

BitsPerCoordinate integer (Required) The number of bits used to represent each vertex coordinate.

Valid values are 1, 2, 4, 8, 12, 16, 24, and 32.

BitsPerComponent integer (Required) The number of bits used to represent each color component.

Valid values are 1, 2, 4, 8, 12, and 16.

BitsPerFlag integer (Required) The number of bits used to represent the edge ﬂag for each ver-

tex (see below). Valid values of

BitsPerFlag are 2, 4, and 8, but only the

least signiﬁcant 2 bits in each ﬂag value are used. Valid values for the edge

ﬂag itself are 0, 1, and 2.

Decode rectangle (Required) An array of numbers specifying how to map vertex coordinates

and color components into the appropriate ranges of values. The de-

coding method is similar to that used in image dictionaries (see “Decode

Arrays” on page 252). The ranges are speciﬁed as follows:

min

max

min

max

1,min

1,max

… c

n,min

n,max

]

Note that only one pair of c values should be speciﬁed if a Function entry

is present.

Function function (Optional) A 1-in, n-out function or an array of n 1-in, 1-out functions

(where n is the number of color components in the shading dictionary’s

color space). If this entry is present, the color data for each vertex must be

speciﬁed by a single parametric variable rather than by n separate color

components; the designated function(s) will be called with each interpo-

lated value of the parametric variable to determine the actual color at each

point. Each input value will be forced into the range interval speciﬁed for

the corresponding color component in the shading dictionary’s

Decode

array. Each function’s domain must be a superset of that interval. If the

value returned by the function for a given color component is out of

range, it will be adjusted to the nearest valid value.

This entry may not be used with an Indexed color space.

Patterns4.6

227

Unlike shading types 1 to 3, types 4 to 7 are represented as streams. Each stream

contains a sequence of vertex coordinates and color data that deﬁnes the triangle

mesh. In a type 4 shading, each vertex is speciﬁed by the following values, in the

order shown:

f x y c

… c

where

f is the vertex’s edge ﬂag (discussed below)

x and y are its horizontal and vertical coordinates

… c

are its color components

All vertex coordinates are expressed in the shading’s target coordinate space. If

the shading dictionary includes a

Function entry, then only a single parametric

value, t, is permitted for each vertex in place of the color components c

… c

The edge ﬂag associated with each vertex determines the way it connects to the

other vertices of the triangle mesh. A vertex v

with an edge ﬂag value f

= 0

begins a new triangle, unconnected to any other. At least two more vertices

and v

) must be provided, but their edge ﬂags will be ignored. These three

vertices deﬁne a triangle (v

, v

), as shown in Figure 4.20.

FIGURE 4.20 Starting a new triangle in a free-form Gouraud-shaded triangle mesh

(Start new triangle)

triangle

GraphicsCHAPTER 4

228

Subsequent triangles are deﬁned by a single new vertex combined with two verti-

ces of the preceding triangle. Given triangle (v

, v

), where vertex v

precedes

vertex v

in the data stream and v

precedes v

, a new vertex v

can form a new

triangle on side v

or side v

, as shown in Figure 4.21. (Side v

is assumed to be

shared with a preceding triangle and so is not available for continuing the mesh.)

If the edge ﬂag is f

= 1 (side v

), the next vertex forms the triangle (v

, v

); if

the edge ﬂag is f

= 2 (side v

), the next vertex forms the triangle (v

, v

). An

edge ﬂag of f

= 0 would start a new triangle, as described above.

FIGURE 4.21 Connecting triangles in a free-form Gouraud-shaded triangle mesh

Complex shapes can be created by using the edge ﬂags to control the edge on

which subsequent triangles are formed. Figure 4.22 shows two simple examples.

Mesh 1 begins with triangle 1 and uses the following edge ﬂags to draw each suc-

ceeding triangle:

Three new

vertices

=0 f

One new

vertex

One new

vertex

1 f

0===() 7 f

2=()

2 f

1=() 8 f

2=()

3 f

1=() 9 f

2=()

4 f

1=() 10 f

1=()

5 f

1=() 11 f

1=()

6 f

1=()

Patterns4.6

229

FIGURE 4.22 Varying the value of the edge ﬂag to create different shapes

Mesh 2 again begins with triangle 1 and uses the edge ﬂags

The stream must provide vertex data for a whole number of triangles with appro-

priate edge ﬂags; otherwise, an error will occur.

The data for each vertex consists of the following items, reading in sequence from

higher-order to lower-order bit positions:

• An edge ﬂag, expressed in BitsPerFlag bits

• A pair of horizontal and vertical coordinates, each expressed in BitsPer-

Coordinate

bits

• A set of n color components (where n is the number of components in the

shading’s color space), each expressed in

BitsPerComponent bits, in the order

expected by the

sc operator

Mesh 1 Mesh 2

1 f

0===() 4 f

2=()

2 f

1=() 5 f

2=()

3 f

2=() 6 f

2=()

GraphicsCHAPTER 4

230

Each set of vertex data must occupy a whole number of bytes; if the total number

of bits required is not divisible by 8, the last data byte for each vertex is padded at

the end with extra bits, which are ignored. The coordinates and color values are

decoded according to the

Decode array in the same way as in an image dictionary

(see “Decode Arrays” on page 252).

If the shading dictionary contains a

Function entry, the color data for each vertex

must be speciﬁed by a single parametric value t, rather than by n separate color

components. All linear interpolation within the triangle mesh is done using the t

values; after interpolation, the results are passed to the function(s) speciﬁed in

the

Function entry to determine the color at each point.

Type 5 Shadings (Lattice-Form Gouraud-Shaded Triangle Meshes)

Type 5 shadings (lattice-form Gouraud-shaded triangle meshes) are similar to

type 4, but instead of using free-form geometry, their vertices are arranged in a

pseudorectangular lattice, which is topologically equivalent to a rectangular grid.

The vertices are organized into rows, which need not be geometrically linear (see

Figure 4.23). Table 4.30 shows the shading dictionary entries speciﬁc to this type

of shading, in addition to those common to all shading dictionaries (Table 4.25

on page 216) and stream dictionaries (Table 3.4 on page 35).

FIGURE 4.23 Lattice-form triangular meshes

Ideal lattice

(i, j)(i, j+1)

(i+1, j)(i+1, j+1)

Pseudorectangular lattice

Patterns4.6

231

TABLE 4.30 Additional entries speciﬁc to a type 5 shading dictionary

KEY TYPE VALUE

BitsPerCoordinate integer (Required) The number of bits used to represent each vertex coordinate.

Valid values are 1, 2, 4, 8, 12, 16, 24, and 32.

BitsPerComponent integer (Required) The number of bits used to represent each color component.

Valid values are 1, 2, 4, 8, 12, and 16.

VerticesPerRow integer (Required) The number of vertices in each row of the lattice; the value

must be greater than or equal to 2. The number of rows need not be

speciﬁed.

Decode array (Required) An array of numbers specifying how to map vertex coordinates

and color components into the appropriate ranges of values. The decod-

ing method is similar to that used in image dictionaries (see “Decode Ar-

rays” on page 252). The ranges are speciﬁed as follows: