db_load.html   [plain text]


<!--$Id: db_load.so,v 10.44 2007/10/26 15:02:55 bostic Exp $-->
<!--Copyright (c) 1997,2008 Oracle.  All rights reserved.-->
<!--See the file LICENSE for redistribution information.-->
<html>
<head>
<title>Berkeley DB: db_load</title>
<meta name="description" content="Berkeley DB: An embedded database programmatic toolkit.">
<meta name="keywords" content="embedded,database,programmatic,toolkit,btree,hash,hashing,transaction,transactions,locking,logging,access method,access methods,Java,C,C++">
</head>
<body bgcolor=white>
<table width="100%"><tr valign=top>
<td>
<b>db_load</b>
</td>
</tr></table>
<hr size=1 noshade>
<tt>
<b><pre>db_load [<b>-nTV</b>] [<b>-c name=value</b>] [<b>-f file</b>]
    [<b>-h home</b>] [<b>-P password</b>] [<b>-t btree | hash | queue | recno</b>] file
db_load [<b>-r lsn | fileid</b>] [<b>-h home</b>] [<b>-P password</b>] file</pre></b>
<b>Description</b>
<a name="2"><!--meow--></a>
<p>The db_load utility reads from the standard input and loads it
into the database <b>file</b>.  The database <b>file</b> is created if
it does not already exist.</p>
<p>The input to db_load must be in the output format specified by the
<a href="../utility/db_dump.html">db_dump</a> utility, utilities, or as specified for the <b>-T</b>
below.</p>
<p>The options are as follows:</p>
<br>
<b>-c</b><ul compact><li>Specify configuration options ignoring any value they may have based on
the input.  The command-line format is <b>name=value</b>.  See the
Supported Keywords section below for a list of keywords supported by
the <b>-c</b> option.</ul>
<b>-f</b><ul compact><li>Read from the specified <b>input</b> file instead of from the standard
input.</ul>
<b>-h</b><ul compact><li>Specify a home directory for the database environment.
<p>If a home directory is specified, the database environment is opened
using the <a href="../api_c/env_open.html#DB_INIT_LOCK">DB_INIT_LOCK</a>, <a href="../api_c/env_open.html#DB_INIT_LOG">DB_INIT_LOG</a>,
<a href="../api_c/env_open.html#DB_INIT_MPOOL">DB_INIT_MPOOL</a>, <a href="../api_c/env_open.html#DB_INIT_TXN">DB_INIT_TXN</a>, and <a href="../api_c/env_open.html#DB_USE_ENVIRON">DB_USE_ENVIRON</a>
flags to <a href="../api_c/env_open.html">DB_ENV-&gt;open</a>.  (This means that db_load can be
used to load data into databases while they are in use by other
processes.) If the <a href="../api_c/env_open.html">DB_ENV-&gt;open</a> call fails, or if no home
directory is specified, the database is still updated, but the
environment is ignored; for example, no locking is done.</p></ul>
<b>-n</b><ul compact><li>Do not overwrite existing keys in the database when loading into an
already existing database.  If a key/data pair cannot be loaded into the
database for this reason, a warning message is displayed on the standard
error output, and the key/data pair are skipped.</ul>
<b>-P</b><ul compact><li>Specify an environment password.  Although Berkeley DB utilities overwrite
password strings as soon as possible, be aware there may be a window of
vulnerability on systems where unprivileged users can see command-line
arguments or where utilities are not able to overwrite the memory
containing the command-line arguments.</ul>
<b>-r</b><ul compact><li>Reset the database's file ID or log sequence numbers (LSNs).
<p>All database pages in transactional environments contain references to
the environment's log records.  In order to copy a database into a
different database environment, database page references to the old
environment's log records must be reset, otherwise data corruption can
occur when the database is modified in the new environment.  The
<b>-r</b> <b>lsn</b> option resets a database's log sequence
numbers.</p>
<p>All databases contain an ID string used to identify the database in the
database environment cache.  If a database is copied, and used in the
same environment as another file with the same ID string, corruption can
occur.  The <b>-r</b> <b>fileid</b>  option resets a database's file
ID to a new value.</p>
<p><b>In both cases, the physical file specified by the <b>file</b> argument
is modified in-place.</b></p></ul>
<b>-T</b><ul compact><li>The <b>-T</b> option allows non-Berkeley DB applications to easily load text
files into databases.
<p>If the database to be created is of type Btree or Hash, or the keyword
<b>keys</b> is specified as set, the input must be paired lines of text,
where the first line of the pair is the key item, and the second line of
the pair is its corresponding data item.  If the database to be created
is of type Queue or Recno and the keywork <b>keys</b> is not set, the
input must be lines of text, where each line is a new data item for the
database.</p>
<p>A simple escape mechanism, where newline and backslash (\)
characters are special, is applied to the text input.  Newline characters
are interpreted as record separators.  Backslash characters in the text
will be interpreted in one of two ways: If the backslash character
precedes another backslash character, the pair will be interpreted as a
literal backslash.  If the backslash character precedes any other
character, the two characters following the backslash will be interpreted
as a hexadecimal specification of a single character; for example,
\0a is a newline character in the ASCII character set.</p>
<p>For this reason, any backslash or newline characters that naturally
occur in the text input must be escaped to avoid misinterpretation by
db_load.</p>
<p>If the <b>-T</b> option is specified, the underlying access method type
must be specified using the <b>-t</b> option.</p></ul>
<b>-t</b><ul compact><li>Specify the underlying access method.  If no <b>-t</b> option is
specified, the database will be loaded into a database of the same type
as was dumped; for example, a Hash database will be created if a Hash
database was dumped.
<p>Btree and Hash databases may be converted from one to the other.  Queue
and Recno databases may be converted from one to the other.  If the
<b>-k</b> option was specified on the call to <a href="../utility/db_dump.html">db_dump</a> then Queue
and Recno databases may be converted to Btree or Hash, with the key being
the integer record number.</p></ul>
<b>-V</b><ul compact><li>Write the library version number to the standard output, and exit.</ul>
<br>
<p>The db_load utility may be used with a Berkeley DB environment (as described for the
<b>-h</b> option, the environment variable <b>DB_HOME</b>, or
because the utility was run in a directory containing a Berkeley DB
environment).  In order to avoid environment corruption when using a
Berkeley DB environment, db_load should always be given the chance to
detach from the environment and exit gracefully.  To cause db_load
to release all environment resources and exit cleanly, send it an
interrupt signal (SIGINT).</p>
<p>The db_load utility exits 0 on success, 1 if one or more key/data
pairs were not loaded into the database because the key already existed,
and &gt;1 if an error occurs.</p>
<b>Examples</b>
<p>The db_load utility can be used to load text files into databases.
For example, the following command loads the standard UNIX
<i>/etc/passwd</i> file into a database, with the login name as the
key item and the entire password entry as the data item:</p>
<blockquote><pre>awk -F: '{print $1; print $0}' &lt; /etc/passwd |
    sed 's/\\/\\\\/g' | db_load -T -t hash passwd.db</pre></blockquote>
<p>Note that backslash characters naturally occurring in the text are escaped
to avoid interpretation as escape characters by db_load.</p>
<br><b>Environment Variables</b>
<br>
<b>DB_HOME</b><ul compact><li>If the <b>-h</b> option is not specified and the environment variable
DB_HOME is set, it is used as the path of the database home, as described
in <a href="../api_c/env_open.html">DB_ENV-&gt;open</a>.</ul>
<br>
<b>Supported Keywords</b>
The following keywords are supported for the <b>-c</b> command-line
option to the db_load utility.  See <a href="../api_c/db_open.html">DB-&gt;open</a> for further
discussion of these keywords and what values should be specified.
<p>The parenthetical listing specifies how the value part of the
<b>name=value</b> pair is interpreted.  Items listed as (boolean)
expect value to be <b>1</b> (set) or <b>0</b> (unset).  Items listed
as (number) convert value to a number.  Items listed as (string) use
the string value without modification.</p>
<br>
<b>bt_minkey (number)</b><ul compact><li>The minimum number of keys per page.</ul>
<b>chksum (boolean)</b><ul compact><li>Enable page checksums.</ul>
<b>database (string)</b><ul compact><li>The database to load.</ul>
<b>db_lorder (number)</b><ul compact><li>The byte order for integers in the stored database metadata.</ul>
<b>db_pagesize (number)</b><ul compact><li>The size of database pages, in bytes.</ul>
<b>duplicates (boolean)</b><ul compact><li>The value of the <a href="../api_c/db_set_flags.html#DB_DUP">DB_DUP</a> flag.</ul>
<b>dupsort (boolean)</b><ul compact><li>The value of the <a href="../api_c/db_set_flags.html#DB_DUPSORT">DB_DUPSORT</a> flag.</ul>
<b>extentsize (number)</b><ul compact><li>The size of database extents, in pages, for Queue databases configured
to use extents.</ul>
<b>h_ffactor (number)</b><ul compact><li>The density within the Hash database.</ul>
<b>h_nelem (number)</b><ul compact><li>The size of the Hash database.</ul>
<b>keys (boolean)</b><ul compact><li>Specify whether keys are present for Queue or Recno databases.</ul>
<b>re_len (number)</b><ul compact><li>Specify fixed-length records of the specified length.</ul>
<b>re_pad (string)</b><ul compact><li>Specify the fixed-length record pad character.</ul>
<b>recnum (boolean)</b><ul compact><li>The value of the <a href="../api_c/db_set_flags.html#DB_RECNUM">DB_RECNUM</a> flag.</ul>
<b>renumber (boolean)</b><ul compact><li>The value of the <a href="../api_c/db_set_flags.html#DB_RENUMBER">DB_RENUMBER</a> flag.</ul>
<b>subdatabase (string)</b><ul compact><li>The subdatabase to load.</ul>
<br>
</tt>
<p><font size=1>Copyright (c) 1996,2008 Oracle.  All rights reserved.</font>
</body>
</html>