faq.html   [plain text]


<!--$Id: faq.so,v 1.12 2006/09/07 19:26:44 alanb Exp $-->
<!--Copyright (c) 1997,2008 Oracle.  All rights reserved.-->
<!--See the file LICENSE for redistribution information.-->
<html>
<head>
<title>Berkeley DB Reference Guide: Replication FAQ</title>
<meta name="description" content="Berkeley DB: An embedded database programmatic toolkit.">
<meta name="keywords" content="embedded,database,programmatic,toolkit,btree,hash,hashing,transaction,transactions,locking,logging,access method,access methods,Java,C,C++">
</head>
<body bgcolor=white>
<table width="100%"><tr valign=top>
<td><b><dl><dt>Berkeley DB Reference Guide:<dd>Berkeley DB Replication</dl></b></td>
<td align=right><a href="../rep/partition.html"><img src="../../images/prev.gif" alt="Prev"></a><a href="../toc.html"><img src="../../images/ref.gif" alt="Ref"></a><a href="../rep/ex.html"><img src="../../images/next.gif" alt="Next"></a>
</td></tr></table>
<p align=center><b>Replication FAQ</b></p>
<ol>
<p><li><b>Does Berkeley DB provide support for forwarding write queries from
clients to masters?</b>
<p>No, it does not.  The Berkeley DB RPC server code could be modified to support
this functionality, but in general this protocol is left entirely to the
application.  Note, there is no reason not to use the communications
channels the application establishes for replication support to forward
database update messages to the master, since Berkeley DB does not require
those channels to be used exclusively for replication messages.
Replication Manager does not currently offer this service to the
application.</p>
<p><li><b>Can I use replication to partition my environment across
multiple sites?</b>
<p>No, this is not possible.  All replicated databases must be equally
shared by all environments in the replication group.</p>
<p><li><b>I'm running with replication but I don't see my databases
on the client.</b>
<p>This problem may be the result of the application using absolute path
names for its databases, and the pathnames are not valid on the client
system.</p>
<p><li><b>How can I distinguish Berkeley DB messages from application messages?</b>
<p>There is no way to distinguish Berkeley DB messages from application-specific
messages, nor does Berkeley DB offer any way to wrap application messages
inside of Berkeley DB messages.  Distributed applications exchanging their
own messages should either enclose Berkeley DB messages in their own wrappers,
or use separate network connections to send and receive Berkeley DB messages.
The one exception to this rule is connection information for new sites;
Berkeley DB offers a simple method for sites joining replication groups to
send connection information to the other database environments in the
group (see <a href="../../ref/rep/newsite.html">Connecting to a new site</a>
for more information).</p>
<p><li><b>How should I build my <b>send</b> function?</b>
<p>This depends on the specifics of the application.  One common way is to
write the <b>rec</b> and <b>control</b> arguments' sizes and data to
a socket connected to each remote site.  On a fast, local area net, the
simplest method is likely to be to construct broadcast messages.  Each
Berkeley DB message would be encapsulated inside an application specific
message, with header information specifying the intended recipient(s)
for the message.  This will likely require a global numbering scheme,
however, as the Berkeley DB library has to be able to send specific log
records to clients apart from the general broadcast of new log records
intended for all members of a replication group.</p>
<p><li><b>Does every one of my threads of control on the master have to
set up its own connection to every client?  And, does every one of my
threads of control on the client have to set up its own connection to
every master?</b>
<p>This is not always necessary.  In the Berkeley DB replication model, any
thread of control which modifies a database in the master environment
must be prepared to send a message to the client environments, and any
thread of control which delivers a message to a client environment must
be prepared to send a message to the master.  There are many ways in
which these requirements can be satisfied.</p>
<p>The simplest case is probably a single, multithreaded process running
on the master and clients.  The process running on the master would
require a single write connection to each client and a single read
connection from each client.  A process running on each client would
require a single read connection from the master and a single write
connection to the master.  Threads running in these processes on the
master and clients would use the same network connections to pass
messages back and forth.</p>
<p>A common complication is when there are multiple processes running on
the master and clients.  A straight-forward solution is to increase the
numbers of connections on the master -- each process running on the
master has its own write connection to each client.  However, this
requires only one additional connection for each possible client in the
master process.  The master environment still requires only a single
read connection from each client (this can be done by allocating a
separate thread of control which does nothing other than receive client
messages and forward them into the database).  Similarly, each client
still only requires a single thread of control that receives master
messages and forwards them into the database, and which also takes
database messages and forwards them back to the master.  This model
requires the networking infrastructure support many-to-one
writers-to-readers, of course.</p>
<p>If the number of network connections is a problem in the multiprocess
model, and inter-process communication on the system is inexpensive
enough, an alternative is have a single process which communicates
between the master the each client, and whenever a process'
<b>send</b> function is called, the process passes the message to the
communications process which is responsible for forwarding the message
to the appropriate client.  Alternatively, a broadcast mechanism will
simplify the entire networking infrastructure, as processes will likely
no longer have to maintain their own specific network connections.</p>
</ol>
<table width="100%"><tr><td><br></td><td align=right><a href="../rep/partition.html"><img src="../../images/prev.gif" alt="Prev"></a><a href="../toc.html"><img src="../../images/ref.gif" alt="Ref"></a><a href="../rep/ex.html"><img src="../../images/next.gif" alt="Next"></a>
</td></tr></table>
<p><font size=1>Copyright (c) 1996,2008 Oracle.  All rights reserved.</font>
</body>
</html>