Python实现对网站目录扫描

发布时间:2019-09-10 09:20:43编辑:auto阅读(1827)

    一个很简单的版本,以后会做进一步的修改:多线程,从文件中读取,跟据Head头判断等等.

    需要提供一个网站和不存在页面的错误提示

    CODE:

    #!/usr/bin/env python
    # -*- coding:utf-8 -*-
    import urllib
    import urlparse
    import HTMLParser

    class CheckHTML(HTMLParser.HTMLParser):
        available = True
        def check_data(self,data,ErrorSym):
            error = ErrorSym
            if error in data:
                self.available = False

    target = raw_input("Please enter your target:")
    errorsyb = raw_input("Enter the errorsyb:(eg:404 Error)")
    check_list = ['admin','inc','data','include','count']

    for li in check_list:
        new_urls = urlparse.urljoin("http://"+target+"/",li)
        d = urllib.urlopen(new_urls)
        data = d.read()
        check = CheckHTML()
        check.check_data(data,errorsyb)
        if check.available:
            print "%s------------->Found!"%new_urls
        else:
            print "%s------------->Not Found!"%new_urls

    --------------------------------------------------------------------------------------------

    The Result maybe like this :

    Please enter your target:www.fuck.com
    Enter the errorsyb:(eg:404 Error)HTTP 错误 404
    http://www.fuck.com/admin------------->Found!
    http://www.fuck.com/inc------------->Not Found!
    http://www.fuck.com/data------------->Found!
    http://www.fuck.com/include------------->Found!
    http://www.fuck.com/count------------->Found!


       

关键字