asrada

I have converted Keypoint RCNN with Detectron/Caffe2 TO Caffe, https://github.com/dedoogong/caffe-keypoint-rcnn

light head rcnn TF to caffe converting (on testing)

tf2caffe.py weight_extractor.py

For testing car/hand/face/traffic sign detection and gesture control or free talking, you must connect one usb webcam and i2s mic/speaker like ReSpeaker/Breakout board to TX2.

Download asurada_detector.zip and darknet.zip, then unzip both and move darknet folder into asurada_detector folder.

Run, r.sh, it will install all(I refered pyyolo)

$./r.sh

Download pretrained models and save each model to each project's model folder and download detection model for darknet and all other files(names, cfg,data,avi)

https://www.dropbox.com/sh/9r0lju9ju2nlof4/AACxeIxOOZMhrTc23p6RVXmOa?dl=0

UPDATE

I succeed in converting yolov2 model to caffe version. I uploaded it on naver cloud too. I'm trying to convert it to tensorRT! (I think caffe based TRT is faster than TF based TRT). I'm trying to get the detection box based on python again! yolov2 caffemodel : http://naver.me/GWGiBG8R yolov2 prototxt : http://naver.me/5JoGu38j

I recommand to use python3, not python2.

$python3 ./test_2webcam.py

Roughly, it runs in 25 FPS(depending on how many objs are found because of get_region_box and NMS) on Jetson TX2.

For others, download gits N follow their installation, respectively

DeepSpeech : https://github.com/mozilla/DeepSpeech
DeepHand : https://github.com/lmb-freiburg/hand3d
DeepAlignmentNetwork(Theano) : https://github.com/MarekKowalski/DeepAlignmentNetwork if you dont want to use DAN, you can use OpenFace instead. https://github.com/TadasBaltrusaitis/OpenFace
OpenPose : https://github.com/CMU-Perceptual-Computing-Lab/openpose

In case of DAN, I replaced the existing Theano based DAN with TF based one for optimization. Please refer to DataPrepare.py for DB setup in npz format. run DAN.py for training. I uploaded pre built image set in npz format in dropbox.

UPDATE

It runs > 25 FPS. I modified the existing net architecture a lot. I removed all stage 2 net and replaced all conv with separable dw conv(270MB->7MB!!). it runs x2 faster!! Please use DAN_stage1_spdw.py instead of DAN.py

In case of DeepHand, please download quantized_graph.pb for testing. I use just middle part of the whole model, 'PoseNet'. I removed 'HandSegNet' and 'PosePriorNet, ViewPointNet' and I quantized the 'PoseNet' part to get the reduced and faster model. Size is changed from 188.4MB(2 pickles) -> 70 MB(1 frozen pb) -> 17.6 MB(1 quantized_graph.pb)

In case of DeepSpeech, you need to modify some sourced to build it on TX2. Please read really carefully of the instruction. I had such a hard time to build it finally successfully. Bcus of the RNN(bidirection network connection), it fails in quantization in TF or TRT. But fortunately, I can run it using GPUs! (Original version just support only armv6 CPU).

TODO : More optimization for speed!

replace tiny-yolo's feature extraction with mobilenet based darknet.
change FP32 to FP16
apply prunning
optimizing imread/resize node with nvx I guess after all opts are applied, it would run in more than 50 FPS on TX2 and model size would be less than 10MB with the same mAP(around 70~80).

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.gradle		.gradle
.idea		.idea
Android_Detector		Android_Detector
Convert-Pascal-VOC-to-COCO-master		Convert-Pascal-VOC-to-COCO-master
FaceLandmark_Detector_DAN		FaceLandmark_Detector_DAN
HandPose_Detector		HandPose_Detector
utils		utils
yolov2_caffe		yolov2_caffe
LICENSE		LICENSE
README.md		README.md
command.txt		command.txt
labelImg.py		labelImg.py
relative_trafo.py		relative_trafo.py
search.py		search.py
shape.py		shape.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

asrada

UPDATE

UPDATE

About

Releases

Packages

Languages

License

dedoogong/asrada

Folders and files

Latest commit

History

Repository files navigation

asrada

UPDATE

UPDATE

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages