Glossary
4B
Index
–
A
compilation
of
indexes
developed
and
provided
by
each
of
the
tobacco
companies
that
were
defendants
in
the
State
of
Minnesota
vs
Philip
Morris
Inc.,
et
al.
The
4B
Index
contains
objective
indexing
information
about
the
27
million
pages
of
documents,
provided
during
litigation.
The
4B
can
be
searched
separately
by
company
at
the
Minnesota
Tobacco
Document
Depository
where
the
documents
are
now
housed.
Bates
Number
–
A
unique
identifying
number
that
was
assigned
by
each
producing
party
to
each
page
of
the
tobacco
industry
documents
when
the
documents
were
produced
in
litigation.
These
numbers
can
be
used
to
search
for
documents
in
the
4B
Index,
the
Minnesota
Select
Set,
and
on
the
Tobacco
Industry
Documents
Web
Site.
Bates
Numbers
are
either
numeric
or
alpha/numeric.
The
name
Bates
Number
comes
from
the
name
of
the
machine
–
Bates
–
that
is
used
to
stamp
numbers
onto
pages
of
documents.
GIF
Image
(Graphics
Interchange
Format)
–
A
standard
file
format
for
compressing
images
used
on
the
Internet,
because
the
images
compress
well.
It's
particularly
good
for
text,
art,
cartoons
and
line
drawings.
All
Internet
browsers
contain
software
that
allows
the
display
of
GIF
images.
GIF
images
are
used
on
this
Minnesota
Select
Set
database
to
enable
all
browsers
to
view
the
images
without
the
aid
of
plug-in
software.
Guildford
–
British
American
Tobacco
Documents
–
A
subset
of
the
Minnesota
Select
Set
that
contains
approximately
7
thousand
documents.
Out
of
a
universe
of
several
million
documents
at
that
depository,
the
attorneys
for
Minnesota
ultimately
chose
the
nearly
7
thousand
documents
that
they
felt
related
best
to
their
trial
and
to
the
population
of
Minnesota.
While
this
is
a
particularly
interesting
and
informative
cache
of
documents
with
global
relevance,
the
selection
process
means
that,
potentially,
whole
areas
of
document
types
are
not
represented.
For
instance,
according
to
attorneys
for
the
trial,
because
Minnesota
does
not
have
a
large
Hispanic
or
African
American
population,
few
documents
relating
to
marketing
to
and
targeting
of
these
groups
were
chosen.
Minnesota
Select
Set
–
The
discreet
set
of
documents,
approximately
380,000
pages,
deemed
most
relevant
by
the
attorneys
in
the
Minnesota
vs
Philip
Morris,
Inc.
et
al.
trial.
These
were
culled
by
the
attorneys
from
the
27
million
pages
that
make
up
the
total
body
of
documents
produced
during
litigation.
These
documents
are
housed
at
the
Minnesota
Tobacco
Document
Depository.
Minnesota
Tobacco
Document
Depository
–
Created
on
November
7,
1995,
during
the
Minnesota
vs
Philip
Morris,
Inc.
et
al.
trial
to
safely
house
the
paper
copies
of
the
27
million
pages
of
tobacco
industry
documents.
They
are
stored
and
sorted
by
Bates
Numbers
into
boxes.
The
depository
is
open
to
the
public
and
copies
of
particular
documents
can
be
requested
for
a
nominal
fee
directly
from
the
depository
or
by
linking
to
the
Depository's
e-mail
address
from
the
Tobacco
Industry
Documents
Web
Site.
OCR
(Optical
Character
Recognition)
–
Refers
to
the
process
of
machine
recognition
of
printed
characters.
Text
documents
scanned
into
the
computer
are
turned
into
a
bitmap,
which
creates
a
picture
of
the
text.
OCR
software
analyzes
the
light
and
dark
areas
of
the
bitmap
to
identify
alphabetic
letters
and
numeric
digits.
Once
recognized,
the
character
is
converted
into
ASCII
text.
Hand
printing
is
much
more
difficult
to
recognize
than
machine-printed
characters.
Yellowed,
worn,
old,
and
smudged
documents
are
also
quite
difficult.
Unfortunately,
the
latter
comprise
a
large
portion
of
the
tobacco
industry
documents.
|