Re: INPUT (sound vision meetup @ CDG)

On Aug 19, 2015, at 3:40 PM, Toby Schachman wrote:

Food: for the math group I would order on Eat24, usually Mehfil Indian. I'd get like samosas, naan, saag paneer, lamb korma, chicken tikka masala in whatever quantity made sense. All the things come with rice so don't order more rice. Allow 45-60 min for delivery.

Word to the wise: “whatever quantity made sense” turns out to be roughly one main dish for every two attendees. That is to say, I ordered about twice as much food as we needed. The fridge is full of delicious Mehfil Indian food—you are welcome to as much as you can bear.

INPUT.1 turned out pretty well. About a dozen people showed up, with varied but overlapping backgrounds and interests, and we stayed pretty closely to the agenda I had prepared.

The Demo (photo by Götz Bachmann)

I tried to get down most of the proper nouns when we opened up to discussion at the end. Here’s a decoded form:

• Google’s cloud-based ASR systems are secretive and closed, but work well (they don’t give timing info).

• Mechanical Turk-based realtime ASR is now a thing, with <2sec latency.

• A few people really liked the Asus Xtion for RGB-D imaging, and think that our CV problems would be a lot easier with depth info.

• There were some good pointers for 3-d scanning from RGB-D cameras, including Kinect Fusion and a fellow from Oxford mentioned a SLAM system that, he kept saying, “does everything” but I can find no traces of the so-called Infinita system online.

• The trigram “deformation of people” is rather unfortunate. And saying “there’s a lot of literature on it” could mean more than one thing. This, of course, is in reference to our “skeleton tracking” investigations. Someone was really into this University of Kentucky research paper on the Real-time Simultaneous Pose and Shape Estimation for Articulated Objects Using a Single Depth Camera.

• Google Glass and egocentric perspective.

• I swear Neeraj was talking about research in “knowledge spaces,” but the term of art is “knowledge bases,” so either I misheard or CDG slipped into his subconscious. Neeraj thinks Luke Zettelmoyer’s research on “intersections of natural language processing, machine learning, and decision making under uncertainty” may be relevant as we progress.

• I think SEMPRE was the intended referent when discussing Percy Liang’s work. I’m imaging a booming God-like voice:

Utterance: Which college did Obama go to?

Denotation: Occidental College, Columbia University

People seemed excited about coming back next month, and some had ideas for other people who should be in our orbits. There were some interesting conversations and observations—Götz took some notes and may send out some of his impressions.

Onward!

R.M.O.

Date:	Wed, 19 Aug 2015 23:25:41 -0700
From:	Robert M Ochshorn
Subject:	Re: INPUT (sound & vision meetup @ CDG)