Commit 268e064e authored by Arnaud Saval's avatar Arnaud Saval

WEBLAB-650 rename old demo folder into bundle folder

git-svn-id: svn://svn.forge.objectweb.org/svnroot/weblab/trunk/WebLabApplications/bundle@2791 a803c184-d8de-4a0e-a52b-d3eef88212b6
parents
This diff is collapsed.
WEBLAB BUNDLE 1.2.2.1
OBJECTIVES
This bundle aims to gather coherent services and portlets around the WebLab platform to
demonstrate its capability in term of service integration and orchestration for
unstructured document processing and retrieval.
This bundle can crawl a local folder (toIndex) in order to analyze text based documents,
index them to finally offer access to them through a portal.
The processing capabilities are limited (only default rules for the named-entity extraction engine are used)
but it allows to have a complete processing chain and ease integration and test of
new components either on processing chain or on user interface.
This bundle is regularly released (http://weblab-project.org/index.php?title=Download)
and build nightly with latest services/portlets (see http://bamboo.ow2.org/browse/WEBLAB-BUNDLE).
CONTENT
This bundle presents an information retrieval system based on the complete WebLab architecture.
It is mainly composed of the following WebLab services:
- an homemade folder crawler able to listen and crawl the content of a given folder (http://weblab-project.org/index.php?title=Folder_Listener),
- a normaliser that will extract the text content of various files (ms-office, pdf, rtf, etc.) based on Apache Tika (http://weblab-project.org/index.php?title=Normaliser_using_Tika),
- a named entities extraction service that detects words in the document and annotate it in documents, based on gazetteer (http://weblab-project.org/index.php?title=Simple_Gazetteer),
- an indexer that will index the text content and make it searchable based on Apache SOLR (http://weblab-project.org/index.php?title=Solr_Indexer/Searcher_WebLab_Web_Service).
In addition to these services, we can found some technical services.
The demo also contains a WebLab chain,
- that chains the previously mentioned services;
and four WebLab portlets:
- a launchCrawl portlet that will launch and monitor the processing of documents with the chain,
- a search portlet that will launch query on the SOLR searcher,
- a result portlet that display the results of the query,
- a annotated document portlet that display the document annotated with the annotation added by the named entities extraction service.
PREREQUISITE
You should have a jdk1.6.25 or greater installed in order to run the WebLab demo,
JAVA_HOME must be declared and java must be available in your path
Ports 8080, 8005, 8009 (for Liferay), 8181, 8105, 8109 (for Tomcat) 8084, 7600, 7700, 7800, 7900 (for PEtALS) should be available on the computer that runs the WebLab
Your computer should have at least 3Go of RAM to run the WebLab demo but 4Go is recommended to run to process more efficiently.
Remember that WebLab is a server application and not a desktop one.
INSTALLATION
If you read this file, you already unzip the bundle archive, so installation is done.
The demo uses mainly the following ports: Liferay on 8181, Tomcat on 8080 and PEtALS on 8084.
Other ports are used to monitor shutdown/restart command, however since all is installed on one machine, there is no specific network configuration needed.
Only port 8080 needs to be opened if you want to connect an external client to the machine that support the WebLab.
LAUNCHING
1. Launch the script run.sh (Linux) or run.bat (Windows) regarding your OS, (It may takes several minutes)
2. Go with your favorite browser to http://localhost:8080/
3. Log in with email demo@weblab-project.org password demo,
4. You can now launch an indexation that will crawl and analyse the content of your "toIndex" folder.
By default, you will find inside some document about the Weblab project.
Then you can search on the acquired documents
(for example type in the search field weblab, to find document containing the word weblab)
If you want to restart a new indexing session from scratch, launch the script called reset.sh (Linux) or reset.bat (Windows).
It will clear the current index and repository and make the WebLab ready for new indexing session.
You don't need to stop the application before calling a reset.
You can stop the application by calling the stop.sh (Linux) or stop.bat (Windows).
Note that the servers may take several seconds to be stopped depending on your hardware resources.
GOING FURTHER
You may want to go further by adding some Services or Portlets to this demonstration.
To do so you can use the WebLab Development Dashboard (http://weblab-project.org/index.php?title=Developer_Dashboard)
to learn developing services, portlets and chains.
You can also ask your questions on the WebLab mailing list: user@weblab-project.org
NB: "toIndex" folder contains a file provided for testing purpose.
\ No newline at end of file
<assembly
xmlns="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.2"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.2 http://maven.apache.org/xsd/assembly-1.1.2.xsd">
<id>1.2.2.1</id>
<formats>
<format>zip</format>
</formats>
<includeBaseDirectory>true</includeBaseDirectory>
<fileSets>
<fileSet>
<directory>scripts</directory>
<outputDirectory>.</outputDirectory>
<filtered>true</filtered>
<fileMode>0755</fileMode>
</fileSet>
<fileSet>
<directory>configuration</directory>
<outputDirectory>configuration</outputDirectory>
</fileSet>
<fileSet>
<directory>services</directory>
<outputDirectory>${tomcat.path}</outputDirectory>
<filtered>true</filtered>
<fileMode>0755</fileMode>
</fileSet>
<fileSet>
<directory>toIndex</directory>
<outputDirectory>toIndex</outputDirectory>
</fileSet>
<fileSet>
<directory>warcs</directory>
<outputDirectory>warcs</outputDirectory>
</fileSet>
<fileSet>
<!-- Custom Liferay with installed portlets and WebLab messages -->
<directory>liferayData/data</directory>
<outputDirectory>${liferay.path}/data/</outputDirectory>
</fileSet>
<fileSet>
<!-- Remove this part if you want a "clean" Liferay -->
<directory>liferayData/html</directory>
<outputDirectory>${liferay.tomcat.path}/webapps/ROOT/html/</outputDirectory>
<fileMode>0755</fileMode>
</fileSet>
<!-- Put Commons libraries into Tomcat -->
<fileSet>
<directory>libraries/services/target/services-libs</directory>
<outputDirectory>${tomcat.path}/lib/</outputDirectory>
</fileSet>
<!-- Put Commons libraries into Liferay -->
<fileSet>
<directory>libraries/portlets/target/portlets-libs</directory>
<outputDirectory>${liferay.tomcat.path}/lib/ext</outputDirectory>
</fileSet>
<!-- Copy portlets into Liferay deploy directory -->
<fileSet>
<directory>libraries/portlets/wars</directory>
<outputDirectory>${liferay.path}/deploy/</outputDirectory>
</fileSet>
</fileSets>
<files>
<file>
<!-- README file -->
<source>README.txt</source>
</file>
<file>
<!-- Support inter pages events in portal-ext.properties -->
<source>liferayData/portal-ext.properties</source>
<outputDirectory>${liferay.tomcat.path}/webapps/ROOT/WEB-INF/classes/</outputDirectory>
</file>
<file>
<!-- Copy sa in the Petals install dir -->
<source>petals/target/sa-weblab-chain-distribution.zip</source>
<outputDirectory>${petals.path}/toInstall</outputDirectory>
</file>
</files>
<dependencySets>
<!-- Extract servers -->
<dependencySet>
<includes>
<include>org.apache.tomcat:apache-tomcat:zip</include>
<include>com.liferay:liferay-portal-tomcat:zip</include>
<include>org.ow2.petals:petals-platform:zip</include>
</includes>
<outputFileNameMapping>${artifact.artifactId}</outputFileNameMapping>
<unpack>true</unpack>
<unpackOptions>
<excludes>
<!-- Default configuration is overridden by the WebLab one -->
<!-- <exclude>${liferay.path}/conf/server.xml</exclude> -->
<exclude>${liferay.path}/data</exclude>
<exclude>${liferay.path}/data/hsql/lportal.script</exclude>
<exclude>${liferay.path}/data/hsql/lportal.properties</exclude>
<exclude>${liferay.path}/data/lucene</exclude>
<exclude>${liferay.path}/data/jackrabbit</exclude>
<exclude>${liferay.path}/data/jackrabbit</exclude>
<!-- Custom template -->
<exclude>${liferay.tomcat.path}/webapps/ROOT/html/common/themes/top_js-ext.jspf</exclude>
<exclude>${liferay.tomcat.path}/webapps/ROOT/html/themes/classic/css/main.css</exclude>
<exclude>${liferay.tomcat.path}/webapps/ROOT/html/themes/classic/images/favicon.ico</exclude>
<!-- Remove JVM -->
<exclude>${liferay.tomcat.path}/jre1.6.0_20</exclude>
<exclude>${tomcat.path}/bin/catalina.bat</exclude>
<exclude>${tomcat.path}/bin/catalina.sh</exclude>
<exclude>${tomcat.path}/conf/server.xml</exclude>
<exclude>${tomcat.path}/conf/tomcat-users.xml</exclude>
<exclude>${tomcat.path}/webapps/docs/</exclude>
<exclude>${tomcat.path}/webapps/examples/</exclude>
<exclude>${tomcat.path}/webapps/host-manager/</exclude>
<!-- Remove unused standard Liferay portlets -->
<exclude>${liferay.tomcat.path}/webapps/opensocial-portlet/</exclude>
<exclude>${liferay.tomcat.path}/webapps/mail-portlet/</exclude>
<exclude>${liferay.tomcat.path}/webapps/kaleo-web/</exclude>
<exclude>${liferay.tomcat.path}/webapps/chat-portlet/</exclude>
<exclude>${liferay.tomcat.path}/webapps/knowledge-base-portlet/</exclude>
<exclude>${liferay.tomcat.path}/webapps/social-networking-portlet/</exclude>
<exclude>${liferay.tomcat.path}/webapps/sevencogs-hook/</exclude>
<exclude>${liferay.tomcat.path}/webapps/sevencogs-theme/</exclude>
<exclude>${liferay.tomcat.path}/webapps/web-form-portlet/</exclude>
</excludes>
</unpackOptions>
<fileMode>0755</fileMode>
</dependencySet>
<!-- Put WebLab Services in Tomcat server -->
<dependencySet>
<includes>
<include>org.ow2.weblab.webservices:*:war:*</include>
</includes>
<outputDirectory>${tomcat.path}/webapps/${artifact.artifactId}</outputDirectory>
<outputFileNameMapping>${artifact.artifactId}${dashClassifier?}.${artifact.extension}</outputFileNameMapping>
<!-- Remove common libraries already existing into the server -->
<unpack>true</unpack>
<unpackOptions>
<excludes>
<exclude>WEB-INF/classes/services_configuration.xml</exclude>
<exclude>WEB-INF/lib/model-1.2.2*</exclude>
<exclude>WEB-INF/lib/extended-1.2.2*</exclude>
<exclude>WEB-INF/lib/annotator-1.2.*</exclude>
<exclude>WEB-INF/lib/dataoperator-*</exclude>
<exclude>WEB-INF/lib/log4j-*</exclude>
<exclude>WEB-INF/lib/commons-io-*</exclude>
<exclude>WEB-INF/lib/commons-logging-*</exclude>
<exclude>WEB-INF/lib/commons-codec-*</exclude>
<exclude>WEB-INF/lib/joda-time-*</exclude>
<exclude>WEB-INF/lib/xalan-*</exclude>
<exclude>WEB-INF/lib/cxf-*</exclude>
<exclude>WEB-INF/lib/aopalliance-*</exclude>
<exclude>WEB-INF/lib/spring-*</exclude>
<exclude>WEB-INF/lib/jena-2.6.4*</exclude>
<exclude>WEB-INF/lib/wsdl4j*</exclude>
<exclude>WEB-INF/lib/neethi*</exclude>
<exclude>WEB-INF/lib/rdf-helper-jena*</exclude>
<exclude>WEB-INF/lib/iri-*</exclude>
<exclude>WEB-INF/lib/stax2-api-*</exclude>
<exclude>WEB-INF/lib/xml-resolver-*</exclude>
<exclude>WEB-INF/lib/icu4j-*</exclude>
<exclude>WEB-INF/lib/asm-*</exclude>
<exclude>WEB-INF/lib/jaxb-impl-*</exclude>
<exclude>WEB-INF/lib/xml-apis-*</exclude>
<exclude>WEB-INF/lib/serializer-*</exclude>
<exclude>WEB-INF/lib/slf4j-*</exclude>
<exclude>WEB-INF/lib/xercesImpl-*</exclude>
<exclude>WEB-INF/lib/xmlschema-core-*</exclude>
<exclude>WEB-INF/lib/geronimo-javamail*</exclude>
<exclude>WEB-INF/lib/woodstox-core-asl-*</exclude>
</excludes>
</unpackOptions>
</dependencySet>
<!-- Put BC SOAP in Petals server -->
<dependencySet>
<includes>
<include>org.ow2.petals:petals-bc-soap:zip:*</include>
<include>org.ow2.petals:petals-se-pojo:zip:*</include>
</includes>
<outputDirectory>${petals.path}/toInstall</outputDirectory>
<outputFileNameMapping>${artifact.artifactId}${dashClassifier?}.${artifact.extension}</outputFileNameMapping>
</dependencySet>
</dependencySets>
</assembly>
<assembly
xmlns="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.2"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/plugins/maven-assembly-plugin/assembly/1.1.2 http://maven.apache.org/xsd/assembly-1.1.2.xsd">
<id>libs</id>
<formats>
<format>dir</format>
</formats>
<includeBaseDirectory>false</includeBaseDirectory>
<dependencySets>
<dependencySet>
<outputDirectory>/</outputDirectory>
</dependencySet>
</dependencySets>
</assembly>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle.portlet</groupId>
<artifactId>parent-portlets</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../parent-portlets.xml</relativePath>
</parent>
<artifactId>document-viewer-portlet</artifactId>
<packaging>war</packaging>
<dependencies>
<dependency>
<groupId>org.ow2.weblab.portlets</groupId>
<artifactId>${artifactId}</artifactId>
<version>${portlet.document-viewer-portlet.version}</version>
<type>war</type>
</dependency>
</dependencies>
</project>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle.portlet</groupId>
<artifactId>parent-portlets</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../parent-portlets.xml</relativePath>
</parent>
<artifactId>weblab-facet</artifactId>
<packaging>war</packaging>
<dependencies>
<dependency>
<groupId>org.ow2.weblab.portlets</groupId>
<artifactId>${artifactId}</artifactId>
<version>${portlet.facet-portlet.version}</version>
<type>war</type>
</dependency>
</dependencies>
</project>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle.portlet</groupId>
<artifactId>parent-portlets</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../parent-portlets.xml</relativePath>
</parent>
<artifactId>launch-chain-portlet</artifactId>
<packaging>war</packaging>
<dependencies>
<dependency>
<groupId>org.ow2.weblab.portlets</groupId>
<artifactId>${artifactId}</artifactId>
<version>${portlet.launch-chain-portlet.version}</version>
<type>war</type>
</dependency>
</dependencies>
</project>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle.portlet</groupId>
<artifactId>parent-portlets</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../parent-portlets.xml</relativePath>
</parent>
<artifactId>weblab-metaview</artifactId>
<packaging>war</packaging>
<dependencies>
<dependency>
<groupId>org.ow2.weblab.portlets</groupId>
<artifactId>${artifactId}</artifactId>
<version>${portlet.metadata-portlet.version}</version>
<type>war</type>
</dependency>
</dependencies>
</project>
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle</groupId>
<artifactId>versions</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../../versions/pom.xml</relativePath>
</parent>
<groupId>org.ow2.weblab.bundle.portlet</groupId>
<artifactId>parent-portlets</artifactId>
<packaging>pom</packaging>
<build>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-assembly-plugin</artifactId>
<version>2.3</version>
<executions>
<execution>
<phase>package</phase>
<goals>
<goal>single</goal>
</goals>
</execution>
</executions>
<configuration>
<descriptor>../portlet-assembly.xml</descriptor>
<finalName>${artifactId}</finalName>
<encoding>UTF-8</encoding>
<appendAssemblyId>false</appendAssemblyId>
<outputDirectory>../wars</outputDirectory>
</configuration>
</plugin>
</plugins>
</build>
</project>
\ No newline at end of file
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<parent>
<groupId>org.ow2.weblab.bundle</groupId>
<artifactId>versions</artifactId>
<version>1.2.2.1-SNAPSHOT</version>
<relativePath>../../versions/pom.xml</relativePath>
</parent>
<artifactId>portlets-custom</artifactId>
<packaging>pom</packaging>
<name>WebLab Portlets assembly.</name>
<description>WebLab Portlets.</description>
<modules>
<module>launch-chain</module>
<module>result</module>
<module>docview</module>
<module>search</module>
<module>metadata</module>
<module>facet</module>
</modules>
<dependencies>
<!-- Common libraries -->
<dependency>
<groupId>org.ow2.weblab.core</groupId>
<artifactId>model</artifactId>
<version>${weblab.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.core</groupId>
<artifactId>extended</artifactId>
<version>${weblab.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.core</groupId>
<artifactId>annotator</artifactId>
<version>${annotator.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.core.helpers</groupId>
<artifactId>rdf-helper-jena</artifactId>
<version>1.3.2</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.core.helpers</groupId>
<artifactId>rdf-helper-jena-selection</artifactId>
<version>1.5.3</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.services</groupId>
<artifactId>dataoperator</artifactId>
<version>${weblab.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.ow2.weblab.portlets</groupId>
<artifactId>weblab-portlet</artifactId>
<version>${weblab.portlet.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>log4j</groupId>
<artifactId>log4j</artifactId>
<version>${log4j.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>com.googlecode.json-simple</groupId>
<artifactId>json-simple</artifactId>
<version>${json-simple.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>commons-io</groupId>
<artifactId>commons-io</artifactId>
<version>${commons-io.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>commons-codec</groupId>
<artifactId>commons-codec</artifactId>
<version>${commons-codec.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>commons-logging</groupId>
<artifactId>commons-logging</artifactId>
<version>${commons-logging.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>joda-time</groupId>
<artifactId>joda-time</artifactId>
<version>${joda-time.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>xalan</groupId>
<artifactId>xalan</artifactId>
<version>${xalan.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>com.ibm.icu</groupId>
<artifactId>icu4j</artifactId>
<version>${icu4j.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>org.springframework</groupId>
<artifactId>spring-beans</artifactId>
<version>${spring.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>taglibs</groupId>
<artifactId>standard</artifactId>
<version>${taglib.version}</version>
<type>jar</type>
</dependency>
<dependency>
<groupId>jstl</groupId>
<artifactId>jstl</artifactId>
<version>${jstl.version}</version>
<type>jar</type>
</dependency>
</dependencies>
<build>
<finalName>portlets</finalName>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-assembly-plugin</artifactId>
<version>2.3</version>
<executions>
<execution>
<phase>package</phase>
<goals>
<goal>single</goal>
</goals>
</execution>
</executions>
<configuration>
<descriptor>assembly.xml</descriptor>
<finalName>portlets</finalName>
<encoding>UTF-8</encoding>
</configuration>
</plugin>
</plugins>
</build>
<repositories>
<repository>
<id>released third party</id>
<name>WebLab released third party</name>
<layout>default</layout>