Home | Trees | Indices | Help |
---|
|
This is the configuration file for the packege.
The variables PORTALS and PLUGINS together hold all sources to be included in the search index.
Author: Johannes Schwenk
Copyright: 2010, Johannes Schwenk
Version: 2.0
Date: 2010-09-15
|
|||
string |
USER_AGENT = u'Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv The user agent string the client should use to identify itself to servers. |
||
string |
PORTAL_PLUGIN_NAME = u'portal' The name of the generic plugin module name for Plone-Portals. |
||
int |
PORTAL_RETRY_WAIT = 10 If the portals server could not fulfill the request, wait for X seconds before a retry.
|
||
int |
MAX_PORTAL_RETRIES = 3 Number of retries before failure. |
||
int |
PORTAL_REQUEST_INCREMENT = 3 Number of entries to get from the portal's server for each incremental request. |
||
int |
REQUEST_TIMEOUT = 1200 Number of seconds to wait for the servers response. |
||
datetime |
LAST_QUERY_DEFAULT = datetime(1970, 1, 1) The date from which to start the querying of portals if no last update is specified, e.g. |
||
list of dict |
PORTALS = [{u'url': u'http://cmsdev.rektorat.uni-freiburg.de:2 List of portals to query. |
||
string |
PLUGIN_DIR_NAME = u'plugins' Name of the directory containing the plugins. |
||
list of dict |
PLUGINS = [{u'name': u'stb', u'url': u'http://info.verwaltung. List of plugins to load and query. |
||
int |
LOG_LEVEL = logging.DEBUG The debug level to be used by e.g. |
||
string |
LOG_FILE_DIR = u'./log' The directory of the logfile. |
||
string |
LOG_FILENAME = u'%s/getXML.log' % LOG_FILE_DIR The full path and filename of the logfile. |
||
int |
LOG_BACKUP_COUNT = 9 Number of logfile backups to keep. |
||
int |
LOG_ROLLOVER_SIZE = 0 If the logfile exceeds this size (in bytes), the logger will start a new logfile and keep up to LOG_BACKUP_COUNT old logfiles around. |
||
string |
STATE_FILE_DIR = u'./state' The directory where to save the state for portals and plugins. |
||
string |
STATE_FILE_EXT = u'dat' The extension of the state files written to STATE_FILE_DIR. |
||
string |
TEMP_DIR = u'./tmp' Name of the directory for temporary data, e.g. |
||
string |
TEMP_FILE_EXT = u'tmp' The extension of temporary files. |
||
string |
TEMPLATES_DIR = u'./templates' Name of a directory where to find templates and text snippets. |
||
string |
XML_FILENAME = u'unifr.xml' The filename of the resulting XML document ready to be fed to the parser for search index generation. |
||
string |
OUT_DIR = u'/home/schwenk/dipl/completesearch/databases/unifr' The file output file (XML_FILENAME) will be moved to this location once the retrieval process has finished successfully. |
||
bool |
ALWAYS_OUTPUT_STATS_ON_EXIT = True Whether to output the stats to stderr on exit of getXML.py, regardless of an error or warning has occured or not. |
||
string |
COMPLETION_SERVER_PROGRAM = u'./codebase/server/startCompletio Command to start the CompletionServer. |
||
list |
COMPLETION_SERVER = [COMPLETION_SERVER_PROGRAM, u'-d', u'0d', Join the arguments to start the CompletionServer so they can be passed to subprocess.call() .
|
||
string |
COMPLETION_SERVER_START_DIR = u'/home/schwenk/dipl/completesea Working directory from which to start the CompletionServer. |
||
string |
PARSER_DIR = u'/home/schwenk/dipl/completesearch/databases/unifr' The directory where the parser is located. |
||
list |
PARSER = [u'make', u'pall'] The command to start the parsing of the XML file. |
|
USER_AGENTThe user agent string the client should use to identify itself to servers.
|
MAX_PORTAL_RETRIESNumber of retries before failure. If the portals server could not
fulfill the request, wait for PORTAL_RETRY_WAIT seconds before retrying. Retry a
maximum of
|
LAST_QUERY_DEFAULTThe date from which to start the querying of portals if no last update is specified, e.g. on the first run.
See Also: BaseSyncPlugin and PortalSourceState
|
PORTALSList of portals to query. Each entry is a dictionary with
|
PLUGINSList of plugins to load and query. Each entry is a dictionary with
|
LOG_LEVELThe debug level to be used by e.g. BaseLogger. Can be one of
|
LOG_BACKUP_COUNTNumber of logfile backups to keep.
See Also: LOG_ROLLOVER_SIZE
|
STATE_FILE_DIRThe directory where to save the state for portals and plugins.
See Also: xmlgetter.state
|
STATE_FILE_EXTThe extension of the state files written to STATE_FILE_DIR. The name of the state files will be the name of the plugin or portal defined in PORTALS or PLUGINS
See Also: PortalSourceState
|
TEMP_DIRName of the directory for temporary data, e.g. retrieval data.
|
XML_FILENAMEThe filename of the resulting XML document ready to be fed to the parser for search index generation. It will be built in TEMP_DIR and on successful generation moved to OUT_DIR .
|
OUT_DIRThe file output file (XML_FILENAME) will be moved to this location once the retrieval process has finished successfully. Must be an absolute path!
|
ALWAYS_OUTPUT_STATS_ON_EXITWhether to output the stats to stderr on exit of getXML.py, regardless of an error or warning has occured or not. Useful if one wants to get notified about every completed acquisition process.
|
COMPLETION_SERVER_PROGRAMCommand to start the CompletionServer.
|
COMPLETION_SERVERJoin the arguments to start the CompletionServer so they can be passed
to
|
COMPLETION_SERVER_START_DIRWorking directory from which to start the CompletionServer.
|
PARSER_DIRThe directory where the parser is located. Absolute path!
|
PARSERThe command to start the parsing of the XML file. Also in this case (set to execute "make pall") it also rebuilds the index.
|
Home | Trees | Indices | Help |
---|
Generated by Epydoc 3.0.1 on Thu Sep 16 13:42:03 2010 | http://epydoc.sourceforge.net |