Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问这是什么问题? #64

Open
dyytian opened this issue Nov 17, 2019 · 1 comment
Open

请问这是什么问题? #64

dyytian opened this issue Nov 17, 2019 · 1 comment

Comments

@dyytian
Copy link

dyytian commented Nov 17, 2019

dirname:E:\BaiduNetdiskDownload\bustag_win_0.2.0\bustag
Bustag server starting: version: 0.2.0

CWD: E:\BaiduNetdiskDownload\bustag_win_0.2.0\bustag
Bottle v0.12.17 server starting up (using PasteServer())...
Listening on http://0.0.0.0:8000/
Hit Ctrl-C to quit.

system error
Press Enter to continue ...
start download
args : Namespace(count='100', exclude=None, include=None, level=1, max_redirect=10, max_tasks=5, max_tries=4, no_parse_links=False, output=None, proxy=None, roots=('https://www.busdmm.work',), strict=True)
还没有训练好的模型, 无法推荐
process page 5
process page 10
process page 9
process page 3
process page 8
process page 4
process page 2
process page 7
process page 6
item NASH-184 is processed
item GNAX-017 is processed
item YOPI-001 is processed
item LMPI-014 is processed
item MASA-001 is processed
process page 12
process page 14
process page 11
item IPX-407 is processed
process page 15
item MEYD-547 is processed
item IPX-395 is processed
item IPX-393 is processed
item IPX-398 is processed
item IPX-399 is processed
item JUFE-116 is processed
item JUFE-117 is processed
item MEYD-545 is processed
item JUFE-115 is processed
item IPX-401 is processed
item JUFE-114 is processed
item JUFE-118 is processed
item MIAA-181 is processed
item EBOD-721 is processed
item IPX-394 is processed
item IPX-403 is processed
item EYAN-144 is processed
item IPX-404 is processed
process page 13
item IPX-405 is processed
item IPX-392 is processed
item IPX-406 is processed
item IPX-400 is processed
item MIAA-180 is processed
item IPX-397 is processed
item IPX-402 is processed
item MEYD-546 is processed
item IPX-396 is processed
item MEYD-548 is processed
item MEYD-544 is processed
item AP-712 is processed
item DVAJ-422 is processed
item MDBK-066 is processed
item HGOT-016 is processed
item VENU-896 is processed
item SQTE-273 is processed
item UMD-707 is processed
item SQTE-272 is processed
item SABA-575 is processed
item APOD-013 is processed
item OIGS-030 is processed
item UMD-710 is processed
item VAGU-219 is processed
item MUDR-089 is processed
item KRHK-009 is processed
item MEYD-543 is processed
item OFKU-132 is processed
item UMD-709 is processed
item VENU-895 is processed
item GMEM-001 is processed
item HJMO-418 is processed
item HJMO-419 is processed
item AVSA-109 is processed
item SABA-574 is processed
item HGOT-017 is processed
item MEYD-542 is processed
item CEAD-275 is processed
item VENU-894 is processed
item SABA-576 is processed
item ARM-814 is processed
item SPZ-1051 is processed
item TIKC-039 is processed
item SOAN-043 is processed
item UMD-708 is processed
item DVDMS-468 is processed
item ZEX-383 is processed
item VEC-392 is processed
item DOCP-184 is processed
item CHN-178 is processed
item AMA-053 is processed
item TKBN-001 is processed
2019-11-17 18:23:03,616 - bustag - ERROR - db.py - saveit
UNIQUE constraint failed: item_tag.item_id, item_tag.tag_id
Traceback (most recent call last):
File "lib\site-packages\peewee.py", line 2949, in execute_sql
sqlite3.IntegrityError: UNIQUE constraint failed: item_tag.item_id, item_tag.tag_id

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "bustag\spider\db.py", line 130, in saveit
File "lib\site-packages\peewee.py", line 6012, in create
File "lib\site-packages\peewee.py", line 6201, in save
File "lib\site-packages\peewee.py", line 1785, in inner
File "lib\site-packages\peewee.py", line 1856, in execute
File "lib\site-packages\peewee.py", line 2572, in _execute
File "lib\site-packages\peewee.py", line 2320, in _execute
File "lib\site-packages\peewee.py", line 2962, in execute
File "lib\site-packages\peewee.py", line 2956, in execute_sql
File "lib\site-packages\peewee.py", line 2732, in exit
File "lib\site-packages\peewee.py", line 183, in reraise
File "lib\site-packages\peewee.py", line 2949, in execute_sql
peewee.IntegrityError: UNIQUE constraint failed: item_tag.item_id, item_tag.tag_id
item RKI-504 is processed
item SPZ-1049 is processed
item SUJI-113 is processed
item TUS-078 is processed
item YMDD-172 is processed
item SPZ-1050 is processed
item TIKF-039 is processed
item ATOM-393 is processed
item DNW-058 is processed
2019-11-17 18:23:08,292 - aspider - WARNING - crawling.py - exit_on_empty_queue
reach count: 100, now quit
2019-11-17 18:23:08,293 - aspider - WARNING - crawling.py - crawl
closing the crawler
2019-11-17 18:23:08,296 - aspider - ERROR - crawling.py - fetch

Traceback (most recent call last):
File "lib\site-packages\aspider\crawling.py", line 276, in fetch
File "lib\site-packages\aspider\crawling.py", line 151, in parse_links
File "lib\site-packages\aiohttp\client_reqrep.py", line 969, in read
File "lib\site-packages\aiohttp\streams.py", line 359, in read
File "lib\site-packages\aiohttp\streams.py", line 381, in readany
File "lib\site-packages\aiohttp\streams.py", line 297, in _wait
concurrent.futures._base.CancelledError
2019-11-17 18:23:08,297 - aspider - WARNING - crawling.py - work
canceling the worker
2019-11-17 18:23:08,298 - aspider - WARNING - crawling.py - work
canceling the worker
2019-11-17 18:23:08,304 - aspider - WARNING - crawling.py - work
canceling the worker
*** Report ***
https://www.busdmm.work 200 text/html UTF-8 67907 61/62
https://www.busdmm.work/AMA-053 200 text/html UTF-8 57731 12/44
https://www.busdmm.work/AP-712 200 text/html UTF-8 58475 13/45
https://www.busdmm.work/APOD-013 200 text/html UTF-8 57767 12/45
https://www.busdmm.work/ARM-814 200 text/html UTF-8 61238 21/52
https://www.busdmm.work/ATOM-393 200 text/html UTF-8 55826 12/45
https://www.busdmm.work/AVSA-109 200 text/html UTF-8 56540 11/45
https://www.busdmm.work/CEAD-275 200 text/html UTF-8 60944 17/50
https://www.busdmm.work/CHN-178 200 text/html UTF-8 55068 14/46
https://www.busdmm.work/DNW-058 200 text/html UTF-8 53404 8/45
https://www.busdmm.work/DOCP-184 200 text/html UTF-8 57452 12/45
https://www.busdmm.work/DVAJ-422 200 text/html UTF-8 57755 12/46
https://www.busdmm.work/DVDMS-468 200 text/html UTF-8 60200 15/46
https://www.busdmm.work/EBOD-721 200 text/html UTF-8 52965 11/44
https://www.busdmm.work/EYAN-144 200 text/html UTF-8 53705 11/44
https://www.busdmm.work/GMEM-001 200 text/html UTF-8 59396 16/48
https://www.busdmm.work/GNAX-017 200 text/html UTF-8 55738 19/48
https://www.busdmm.work/HGOT-016 200 text/html UTF-8 49829 11/44
https://www.busdmm.work/HGOT-017 200 text/html UTF-8 50850 12/44
https://www.busdmm.work/HJMO-418 200 text/html UTF-8 51947 13/48
https://www.busdmm.work/HJMO-419 200 text/html UTF-8 57452 10/47
https://www.busdmm.work/IPX-392 200 text/html UTF-8 56593 10/48
https://www.busdmm.work/IPX-393 200 text/html UTF-8 57404 14/48
https://www.busdmm.work/IPX-394 200 text/html UTF-8 55214 11/48
https://www.busdmm.work/IPX-395 200 text/html UTF-8 55739 14/48
https://www.busdmm.work/IPX-396 200 text/html UTF-8 56253 8/47
https://www.busdmm.work/IPX-397 200 text/html UTF-8 56800 11/46
https://www.busdmm.work/IPX-398 200 text/html UTF-8 54915 12/48
https://www.busdmm.work/IPX-399 200 text/html UTF-8 55349 13/47
https://www.busdmm.work/IPX-400 200 text/html UTF-8 55651 7/46
https://www.busdmm.work/IPX-401 200 text/html UTF-8 56269 11/46
https://www.busdmm.work/IPX-402 200 text/html UTF-8 55304 10/49
https://www.busdmm.work/IPX-403 200 text/html UTF-8 57244 9/48
https://www.busdmm.work/IPX-404 200 text/html UTF-8 54809 11/48
https://www.busdmm.work/IPX-405 200 text/html UTF-8 56384 11/48
https://www.busdmm.work/IPX-406 200 text/html UTF-8 55784 8/45
https://www.busdmm.work/IPX-407 200 text/html UTF-8 56050 19/49
https://www.busdmm.work/JUFE-114 200 text/html UTF-8 59527 15/49
https://www.busdmm.work/JUFE-115 200 text/html UTF-8 55459 10/44
https://www.busdmm.work/JUFE-116 200 text/html UTF-8 56070 15/47
https://www.busdmm.work/JUFE-117 200 text/html UTF-8 56385 12/48
https://www.busdmm.work/JUFE-118 200 text/html UTF-8 55841 12/48
https://www.busdmm.work/KRHK-009 200 text/html UTF-8 55735 15/48
https://www.busdmm.work/LMPI-014 200 text/html UTF-8 52125 14/44
https://www.busdmm.work/MASA-001 200 text/html UTF-8 53294 14/44
https://www.busdmm.work/MDBK-066 200 text/html UTF-8 60968 16/48
https://www.busdmm.work/MEYD-542 200 text/html UTF-8 54584 5/48
https://www.busdmm.work/MEYD-543 200 text/html UTF-8 53532 8/48
https://www.busdmm.work/MEYD-544 200 text/html UTF-8 55701 11/47
https://www.busdmm.work/MEYD-545 200 text/html UTF-8 54167 13/48
https://www.busdmm.work/MEYD-546 200 text/html UTF-8 53726 11/47
https://www.busdmm.work/MEYD-547 200 text/html UTF-8 54809 15/48
https://www.busdmm.work/MEYD-548 200 text/html UTF-8 53734 9/48
https://www.busdmm.work/MIAA-180 200 text/html UTF-8 55311 10/48
https://www.busdmm.work/MIAA-181 200 text/html UTF-8 54819 16/48
https://www.busdmm.work/MUDR-089 200 text/html UTF-8 53588 13/47
https://www.busdmm.work/NASH-184 200 text/html UTF-8 59298 28/46
https://www.busdmm.work/OFKU-132 200 text/html UTF-8 55648 15/45
https://www.busdmm.work/OIGS-030 200 text/html UTF-8 57883 17/48
https://www.busdmm.work/RKI-504 200 text/html UTF-8 55262 15/48
https://www.busdmm.work/SABA-574 200 text/html UTF-8 60248 11/46
https://www.busdmm.work/SABA-575 200 text/html UTF-8 55051 15/46
https://www.busdmm.work/SABA-576 200 text/html UTF-8 55870 9/44
https://www.busdmm.work/SOAN-043 200 text/html UTF-8 53136 12/42
https://www.busdmm.work/SPZ-1049 200 text/html UTF-8 55585 7/44
https://www.busdmm.work/SPZ-1050 200 text/html UTF-8 54964 11/45
https://www.busdmm.work/SPZ-1051 200 text/html UTF-8 55136 12/44
https://www.busdmm.work/SQTE-272 200 text/html UTF-8 57823 10/47
https://www.busdmm.work/SQTE-273 200 text/html UTF-8 57696 14/47
https://www.busdmm.work/SUJI-113 200 text/html UTF-8 54959 11/44
https://www.busdmm.work/TIKC-039 200 text/html UTF-8 53775 11/44
https://www.busdmm.work/TIKF-039 200 text/html UTF-8 55307 11/45
https://www.busdmm.work/TKBN-001 200 text/html UTF-8 55282 10/44
https://www.busdmm.work/TUS-078 200 text/html UTF-8 57152 11/45
https://www.busdmm.work/UMD-707 200 text/html UTF-8 66187 15/48
https://www.busdmm.work/UMD-708 200 text/html UTF-8 60076 12/49
https://www.busdmm.work/UMD-709 200 text/html UTF-8 63578 13/48
https://www.busdmm.work/UMD-710 200 text/html UTF-8 59073 16/49
https://www.busdmm.work/VAGU-219 200 text/html UTF-8 57966 12/47
https://www.busdmm.work/VEC-392 200 text/html UTF-8 56356 9/47
https://www.busdmm.work/VENU-894 200 text/html UTF-8 56639 11/47
https://www.busdmm.work/VENU-895 200 text/html UTF-8 58193 12/47
https://www.busdmm.work/VENU-896 200 text/html UTF-8 56944 10/47
https://www.busdmm.work/YMDD-172 200 text/html UTF-8 56489 15/46
https://www.busdmm.work/YOPI-001 200 text/html UTF-8 54717 16/43
https://www.busdmm.work/ZEX-383 200 text/html UTF-8 53797 13/46
https://www.busdmm.work/page/1 301 redirect https://www.busdmm.work
https://www.busdmm.work/page/10 200 text/html UTF-8 69013 38/61
https://www.busdmm.work/page/11 200 text/html UTF-8 68994 33/62
https://www.busdmm.work/page/12 200 text/html UTF-8 65999 35/62
https://www.busdmm.work/page/13 200 text/html UTF-8 66832 33/62
https://www.busdmm.work/page/14 200 text/html UTF-8 64066 35/62
https://www.busdmm.work/page/15 200 text/html UTF-8 67269 34/61
https://www.busdmm.work/page/2 200 text/html UTF-8 68825 33/61
https://www.busdmm.work/page/3 200 text/html UTF-8 68961 33/61
https://www.busdmm.work/page/4 200 text/html UTF-8 68159 33/61
https://www.busdmm.work/page/5 200 text/html UTF-8 68445 33/62
https://www.busdmm.work/page/6 200 text/html UTF-8 70620 33/61
https://www.busdmm.work/page/7 200 text/html UTF-8 68353 33/61
https://www.busdmm.work/page/8 200 text/html UTF-8 68686 33/61
https://www.busdmm.work/page/9 200 text/html UTF-8 71562 33/61

   100 html

5803200 html_bytes
1 redirect
Finished 101 urls in 62.958 secs (max_tasks=5) (0.321 urls/sec/task)
Todo: 194
Done: 101
Date: Sun Nov 17 18:23:09 2019 local time

*** ALL DONE NOW ***

2019-11-17 18:23:09,341 - aspider - WARNING - crawling.py - exit_on_empty_queue
reach count: 100, now quit

@gxtrobot
Copy link
Owner

你使用的老数据库把

新版和老版数据库结构不一样

删掉数据库试试

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants