s(mr x d dl Z d dlZd dlZd dlmZ d dlmZmZmZm Z d dl mZ d dlm Z mZ G d d ZdS ) N)Path)DictListOptionalAny)drop_privileges)DOCROOT_EXCLUDE_DIRSDOCROOT_MAX_DEPTHc e Zd ZdZdej fdZddededede d e eeef f d Z edfdede de d ee fd Zdeded efdZdeded efdZdS )DocrootProcessorzO Processes individual docroot to collect .htaccess files and metadata. loggerc || _ d S )N)r )selfr s ,py/cl_website_collector/docroot_processor.py__init__zDocrootProcessor.__init__ s docrootdomainsusernametimeoutreturnc < t j }|||g g dddd} | j d| t | 5 | |t |dz }| j dt | | |D ]}| j d| |s| j d | n|D ]}t j |z |k r#d |d<