Uses of Class
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.DocumentURLFilter
-
-
Uses of WebcrawlerConnector.DocumentURLFilter in org.apache.manifoldcf.crawler.connectors.webcrawler
Fields in org.apache.manifoldcf.crawler.connectors.webcrawler declared as WebcrawlerConnector.DocumentURLFilter Modifier and Type Field Description protected WebcrawlerConnector.DocumentURLFilterWebcrawlerConnector.ProcessActivityLinkHandler. filterMethods in org.apache.manifoldcf.crawler.connectors.webcrawler with parameters of type WebcrawlerConnector.DocumentURLFilter Modifier and Type Method Description protected java.lang.StringWebcrawlerConnector. doCanonicalization(WebcrawlerConnector.DocumentURLFilter filter, WebURL url)Code to canonicalize a URL.protected booleanWebcrawlerConnector. extractLinks(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter)Code to extract links from an already-fetched document.protected java.lang.StringWebcrawlerConnector. makeDocumentIdentifier(java.lang.String parentIdentifier, java.lang.String rawURL, WebcrawlerConnector.DocumentURLFilter filter, org.apache.manifoldcf.crawler.interfaces.IHistoryActivity activities)Convert an absolute or relative URL to a document identifier.protected voidWebcrawlerConnector. processDocument(org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, java.lang.String documentIdentifier, java.lang.String versionString, boolean indexDocument, java.util.Map<java.lang.String,java.util.Set<java.lang.String>> metaHash, java.lang.String[] acls, WebcrawlerConnector.DocumentURLFilter filter)Constructors in org.apache.manifoldcf.crawler.connectors.webcrawler with parameters of type WebcrawlerConnector.DocumentURLFilter Constructor Description ProcessActivityHTMLHandler(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter, int metaRobotTagsUsage)Constructor.ProcessActivityLinkHandler(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter, java.lang.String contextDescription, java.lang.String linkType)Constructor.ProcessActivityRedirectionHandler(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter)Constructor.ProcessActivityXMLHandler(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter)Constructor.
-