Features
Main technical features :
- P2P network architecture
- Cross-platform : can work on any operating system which can run a JVM
- Parsing capabilities :
- TXT clear text
- CSV tabular data
- RTF rich text
- XML structured data
- HTML web documents
- RSS, RDF, Atom Newsfeeds
- MS Office Excel, Word, Powerpoint
- MS Visio diagrams
- ODF OpenDocument
- PDF Portable Document
- PS Postscript
- SWF Adobe Flash
- VCard Electronic Business Card
- Archives : 7zip, zip, bz2, tbz, tbz2, tar.gz, rpm, jar, apk
- Images : png, jpg, gif, svg, ico, bmp, tif, psd
- CAD Drawings : dwg
- MM FreeMind mind maps
- Audio : mp3, ogg, oga, m4a, m4p, flac, wma, sid
- Torrent torrent metadata
- OpenSearch interface
- Load balancing
- Automated index redundancy distribution
- Direct import of Fremd-DB (Surrogate Harvester API)
- Automatic indexing through a proxy filter
- Embedded web server
- Internal domain names ending in .yacy
- P2P bootstrap from central seed lists
- Spell check
- Filter expression. e.g. : filetype:pdf
- UTF-8 encoding
Technologies
Below are the main technologies used in the project :
- Java 11
- XHTML, CSS, JavaScript
- JSON
- Dublin Core
- External components :
- Apache Commons Toolkit
- Apache HttpComponents
- Apache Jakarta Oro RegExp
- Apache POI API for Microsoft Documents
- Apache James Mime4j
- Apache Lucene
- Apache Solr
- Apache PDFBox and FontBox
- Apache Xerces XML Parser
- Apache XML APIs
- Bouncy Castle Crypto APIs : Provider, Mail
- GlassFish Servlets
- Guava
- ICU International Components for Unicode
- J7Zip
- Java CIFS Client Library
- Jazzy Spelling API
- Jaudiotagger
- JSch Java Secure Channel
- JDBC
- Jetty Web server
- jQuery JavaScript library
- JSONIC json encoder/decoder
- json-simple toolkit
- jsoup Java HTML Parser
- language-detection library
- metadata-extractor API
- Mozilla charset detector
- Noggit JSON parser
- Restlet Framework
- SLF4J Simple Logging Facade for Java
- Spatial4j spatial/geospatial library
- Stax2 API
- TwelveMonkeys ImageIO plugins : BMP, TIFF
- Giant Java Tree TAR Package
- WebCat SWF Package
- Weupnp tiny UPnP client
- Woodstox XML processor
- XMP Adobe's Extensible Metadata Platform
- Build and tests utils
- Apache Ant Building Environment
- JRPM
- JUnit testing framework
Issues
- Stability
- Performance
- Languages support
- Very simple stemming
Converted from http://wiki.yacy.de/index.php?title=En:Features, may be outdated