Remove support for camera due to Visual Recognition API not being available anymore #180

robertoetcheverryr · 2021-10-10T14:44:35Z

As the title says, May I make a PR removing the config.js option for the camera or at least adding a comment that it's not available anymore? And the same with the ibm-credentials.env file? Maybe delete the switch case regarding the camera?

jweisz · 2021-10-21T14:21:48Z

@robertoetcheverryr I originally decided to leave it in just in case someone had an already-provisioned visual recognition service, but the likelihood of this keeps going down over time. That said, I'm not fully convinced we should remove the camera support, as it's a huge part of what makes TJBot fun! But, it's not clear to me what the other options might be. Are there any vision models we might be able to run locally on the Pi?

robertoetcheverryr · 2021-10-22T00:49:34Z

I haven't used it yet, but Google has a Vision API with 1000 monthly queries. I'm not sure if that works with the free account or with the "you must put your CC and only then you get the free tier"...

jweisz · 2021-10-22T12:59:50Z

We would not able to officially support the use of Google APIs with TJBot.

andycitron · 2021-12-13T18:18:42Z

I do have an already-provisioned visual recognition service provisioned, but it seems to have stopped working. I'm having trouble figuring out why. The reply tjbot returns says: Error: <TITLE>Error</TITLE>
An error occurred while processing your request.

Reference #30.713a2f17.1639419281.1a1a60f6

Do I have any hope of working around this?

jweisz · 2021-12-14T15:25:02Z

Likely not. There's no support for the Watson Visual Recognition service anymore as it's been discontinued, and it seems I really should go ahead with removing it from the TJBot library. That said, I haven't yet found a viable replacement, since I'd hate for TJBot to lose the ability to see. 😢

I'm definitely open to someone submitting a PR to replace Visual Recognition with something else. Preferably something on-device, though that might up the hardware requirements...

andycitron · 2021-12-24T17:47:05Z

Justin, I reimplemented my tjbot code using Microsoft Azure visual services. I uploaded the code fragments to github in case you were interested in incorporating it into your tjbot node js implementation. You can find it here:
https://github.com/andycitron/tjBotFragmentThatUsesAzureVisualServices

Note that it does introduce a dependency that the user has a Microsoft Azure account.

Also, if you want to incorporate it into tj.see(), you'd want to structure it a bit differently. tj.see() takes a photo. I did not want my Microsoft functions to have a dependency on tj.takePhoto. So I put the 'take a photo' part into the intent processing for 'see' and passed the photo into the code that uses Microsoft functions.

The code I put out there includes additional methods that invoke Microsoft facial recognition. That is not part of the 'tj.see()' functionality. That code requires pre-training of the facial recognition models. I included that because it might be useful to someone.

jweisz · 2022-01-04T15:39:53Z

Hey @andycitron, happy new year. :)

Thanks for the effort you put into TJBot, this is a really great contribution. Unfortunately, I won't be able to make this part of the official library because it uses a competitor's cloud service. But, I will put this on our Featured Recipes page to showcase your work.

andycitron · 2022-01-05T02:52:51Z

Cool. Yesterday I posted a video to Youtube illustrating how TJ works with Azure. Perhaps you want to include the 4 minute video along with the featured recipe: https://youtu.be/B92efwFqXSs

Could you give me the link to the Featured Recipes page where you referenced my code? I'd like to include a link to that on my home page. thx.

jweisz · 2022-01-05T14:56:43Z

Here's the link: https://github.com/ibmtjbot/tjbot/tree/master/featured#microsoft-azure-visual-services-by-andycitron

andycitron · 2022-05-30T15:28:42Z

Justin, Sorry to bother you, but I can’t figure out where to ask this question. It’s not a tjbot issue, just a question. Where does tjbot store the audio file it uses for speech to text? What format? I see that Microsoft Azure has ‘person voice recognition’ and I’m thinking about trying that out. Seems it wants a wav audio file as input. My tjbot gets confused when multiple speakers are talking at the same time. Those utterances usually end up being ignored by my implementation. But every once in a while it’ll try to respond. Because I know who I’m talking to (facial recognition) I can ignore utterances from a different person….or at least I can try. Do you know the answer? Or can you tell me the proper place to ask this. I actually think an implementation using multiple microphones and detecting voice based on location in the room makes sense, but that seems very hard. Thx, Andy From: Justin Weisz ***@***.*** Sent: Tuesday, January 4, 2022 10:40 AM To: ibmtjbot/tjbot Cc: andycitron; Mention Subject: Re: [ibmtjbot/tjbot] Remove support for camera due to Visual Recognition API not being available anymore (#180) Hey @andycitron<https://github.com/andycitron>, happy new year. :) Thanks for the effort you put into TJBot, this is a really great contribution. Unfortunately, I won't be able to make this part of the official library because it uses a competitor's cloud service. But, I will put this on our Featured Recipes page to showcase your work. — Reply to this email directly, view it on GitHub<#180 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIG4X2MDFOYAVO4Q4IXE52LUUMIFLANCNFSM5FWRME3A>. Triage notifications on the go with GitHub Mobile for iOS<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.***>

jweisz · 2022-05-31T15:34:18Z

Hi @andycitron -- take a look at tjbot.js:792, where listen() is defined:
https://github.com/ibmtjbot/tjbotlib/blob/4fe0263bd0050f910752ae589d3b33cdb9cb93ae/src/tjbot.js#L792

The audio isn't stored locally, the data is streamed through a pipe between the microphone and a web socket. So it would need some modification to save the output to a file first, before uploading to the Microsoft service. Maybe check to see if their API supports WebSockets?

jweisz · 2023-01-13T15:41:52Z

Closing as this is now an issue in the tjbotlib repo: ibmtjbot/node-tjbotlib#73

jweisz closed this as completed Jan 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove support for camera due to Visual Recognition API not being available anymore #180

Remove support for camera due to Visual Recognition API not being available anymore #180

robertoetcheverryr commented Oct 10, 2021

jweisz commented Oct 21, 2021

robertoetcheverryr commented Oct 22, 2021

jweisz commented Oct 22, 2021

andycitron commented Dec 13, 2021

jweisz commented Dec 14, 2021

andycitron commented Dec 24, 2021

jweisz commented Jan 4, 2022

andycitron commented Jan 5, 2022

jweisz commented Jan 5, 2022

andycitron commented May 30, 2022 via email

jweisz commented May 31, 2022

jweisz commented Jan 13, 2023

Remove support for camera due to Visual Recognition API not being available anymore #180

Remove support for camera due to Visual Recognition API not being available anymore #180

Comments

robertoetcheverryr commented Oct 10, 2021

jweisz commented Oct 21, 2021

robertoetcheverryr commented Oct 22, 2021

jweisz commented Oct 22, 2021

andycitron commented Dec 13, 2021

jweisz commented Dec 14, 2021

andycitron commented Dec 24, 2021

jweisz commented Jan 4, 2022

andycitron commented Jan 5, 2022

jweisz commented Jan 5, 2022

andycitron commented May 30, 2022 via email

jweisz commented May 31, 2022

jweisz commented Jan 13, 2023