Project

General

Profile

Actions

Epic DOMAINCLAW-1

closed
RA NK

Mail-Hound Prototype — Domain Probe, Redirect Tracking, Contact Extraction

Epic DOMAINCLAW-1: Mail-Hound Prototype — Domain Probe, Redirect Tracking, Contact Extraction

Added by Redmine Admin about 1 month ago. Updated about 1 month ago.

Status:
Closed
Priority:
Normal
Start date:
05/06/2026
Due date:
% Done:

100%

Estimated time:
20:00 h (Total: 30:00 h)
Spent time:

Description

Goal

Build a working end-to-end MVP flow for Mail-Hound.

The expected workflow is:

Input domains → Fast Precheck → Select domains → Deep Crawl → Review Results → Export CSV/JSON → Review Logs → Run locally and with Docker.

The MVP should allow a user to input a list of domains, quickly validate them, select valid domains for deeper crawling, extract crawl/contact data, and export the results in a structured format.

Scope

This sprint focuses on stability, usability, and producing usable output.

The goal is not to fully solve advanced anti-bot, TLS fingerprinting, or complex crawler-hardening cases in this sprint. Those improvements can be handled in a later sprint.

Expected Outcome

By the end of the sprint, the prototype should:

  • Run from the Streamlit UI.
  • Accept multiple domains as input.
  • Perform a fast precheck for each domain.
  • Allow the user to select domains for deep crawling.
  • Crawl selected domains and extract pages, redirects, and contacts.
  • Export CSV and JSON files per run.
  • Write logs per run.
  • Run locally.
  • Run with Docker Compose.

Execution Priority

  1. Fast Precheck and Domain Selection
  2. Deep Crawl Selected Domains and Extract Emails
  3. Export CSV/JSON and Per-Run Logging
  4. Verify Local and Docker Run

Final Deliverable

A usable Mail-Hound MVP that can run end-to-end and produce real crawl data, contact/email extraction results, export files, logs, and a basic Docker deployment setup.


Subtasks 4 (0 open4 closed)

Feature DOMAINCLAW-2: Fast Precheck and Domain SelectionClosedNguyen tuan kiet05/06/2026

Actions
Feature DOMAINCLAW-3: Export CSV/JSON and Per-Run LoggingClosedNguyen tuan kiet05/08/2026

Actions
Feature DOMAINCLAW-4: Verify Local and Docker RunClosedNguyen tuan kiet05/08/2026

Actions
Feature DOMAINCLAW-5: Deep Crawl Selected Domains and Extract EmailsClosedNguyen tuan kiet05/08/2026

Actions

RA Updated by Redmine Admin about 1 month ago Actions #1

  • Subtask #97 added

RA Updated by Redmine Admin about 1 month ago Actions #2

  • Subtask #98 added

RA Updated by Redmine Admin about 1 month ago Actions #3

  • Subtask #99 added

RA Updated by Redmine Admin about 1 month ago Actions #4

  • Subtask #100 added

RA Updated by Redmine Admin about 1 month ago Actions #5

  • Assignee set to Nguyen tuan kiet

NK Updated by Nguyen tuan kiet about 1 month ago Actions #6

  • Estimated time set to 20:00 h

NK Updated by Nguyen tuan kiet about 1 month ago Actions #7

  • Status changed from New to Closed
Actions

Also available in: PDF Atom