23 Tue

[AI ์Šค์ฟจ 1๊ธฐ] 11์ฃผ์ฐจ DAY 2

Visual Recognition

Object Detection ๋ฌธ์ œ์ •์˜

  • Classification : ์ด ์˜์ƒ์€ ์ž๋™์ฐจ ์˜์ƒ์ผ ํ™•๋ฅ ์ด ๊ฐ€์žฅ ๋†’๋‹ค

  • Classification + Localization : ์ด ์˜์ƒ์€ ์ž๋™์ฐจ ์˜์ƒ์ผ ํ™•๋ฅ ์ด ๊ฐ€์žฅ ๋†’๊ณ  ์ด ์ž๋™์ฐจ๋Š” ์ด ์œ„์น˜์ฏค ์žˆ๋‹ค

  • Object Detection : ์˜์ƒ์•ˆ์— ์žˆ๋Š” ๋ชจ๋“  ๋ฌผ์ฒด๋“ค๊ณผ ๊ทธ๊ฒƒ๋“ค์˜ ์œ„์น˜ ์˜์—ญ์„ ํ•จ๊ป˜ ์ฐพ๋Š”๋‹ค

  • ์•„๋ž˜๋กœ ๊ฐˆ์ˆ˜๋ก ๋‚œ์ด๋„๊ฐ€ ์–ด๋ ค์›€. ํŠนํžˆ Object Detection ๊ฐ™์€ ๊ฒฝ์šฐ๋Š” ๋ช‡ ๊ฐœ์˜ object๊ฐ€ ์กด์žฌํ•˜๋Š”์ง€ ์•Œ ์ˆ˜ ์—†๊ธฐ ๋•Œ๋ฌธ

  • [๋ฌผ์ฒด์˜ ์กด์žฌ, ์ค‘์‹ฌ์ ์˜ x์ขŒํ‘œ, ์ค‘์‹ฌ์ ์˜ y์ขŒํ‘œ, ๋ฌผ์ฒด์˜ ๋„ˆ๋น„, ๋ฌผ์ฒด์˜ ๋†’์ด] + [class]

  • object๋ณ„๋กœ ๋ฒกํ„ฐ๊ฐ€ ํ•˜๋‚˜์”ฉ ํ•„์š”ํ•˜๋‹ค

    • ์–ด๋А ์œ„์น˜์— Object๊ฐ€ ์žˆ๋Š”์ง€ ์ฐพ์•„์•ผํ•จ

    • Object์˜ ์˜์—ญ๊ณผ ํฌ๊ธฐ๋„ ๊ฒฐ์ •

    • Object๋ฅผ ๋ถ„๋ฅ˜ํ•ด์•ผํ•จ

Object Detection ์š”์†Œ ๊ธฐ์ˆ 

  • Region Proposal : Object๊ฐ€ ์žˆ์„๋งŒํ•œ ์˜์—ญ์„ ๋‹ค์ˆ˜ ์ถ”์ฒœ

    • ๋ชจ๋“  ์˜์—ญ์„ ๋ณผ ์ˆ˜๊ฐ€ ์—†์œผ๋ฏ€๋กœ ์žˆ์„๋งŒํ•œ ์˜์—ญ์„ ์ถ”์ฒœ๋ฐ›์Œ

  • Non-max Suppression : ๊ฒน์น˜๋Š” ์˜์—ญ์„ ์ œ๊ฑฐ

  • Classification : ์˜์—ญ์†์˜ object๋ฅผ ๋ถ„๋ฅ˜

  • Bounding Box Regression : Object์˜์—ญ์„ ๋ฏธ์„ธ์กฐ์ •

Faster RCNN ๊ธฐ๋ณธ flow

  • RCNN -> Fast RCNN -> Faster RCNN

  • ์ดˆ๊ธฐ์— Pre-trained๋œ CNN์„ ํ†ต๊ณผ์‹œ์ผœ Feature map์„ ์–ป๋Š”๋‹ค

    • Base Network๋ผ๊ณ ๋„ ๋ถ€๋ฆ„

  • ์–ป์€ feature map์„ ํ†ตํ•ด region proposal์„ ์ฐธ๊ณ ํ•˜์—ฌ ์–ป์–ด์ง„ region์„ ๊ฐ€์ง€๊ณ  ์ž…๋ ฅ๋ฒกํ„ฐ๋กœ ์„ค์ •ํ•œ๋‹ค

IoU measure

  • Intersection over Union

  • IoU = ๊ต์ง‘ํ•ฉ / ํ•ฉ์ง‘ํ•ฉ

Objectness๋ฒกํ„ฐ

  • 2๊ฐœ๋กœ ๊ตฌ์„ฑ

  • 1๊ฐœ๋Š” Object์ผ ํ™•๋ฅ , ๋‹ค๋ฅธ 1๊ฐœ๋Š” Object๊ฐ€ ์•„๋‹ ํ™•๋ฅ 

  • Ground Truth

    • ์‚ฌ๋žŒ์ด ๋ผ๋ฒจ๋ง ํ•œ ์ •๋ณด. ์ค‘์‹ฌ์ , ์œ„์น˜, ๋„ˆ๋น„ ๋“ฑ

  • Ground Truth์™€ object์˜์—ญ์˜ IoU๋ฅผ ๊ณ„์‚ฐํ•˜์—ฌ threshold๋ณด๋‹ค ํฌ๋ฉด object ์˜์—ญ์ด๋ผ๊ณ  ํŒ๋‹จ

    • ์‹ค์ œ๋กœ Ground Truth๊ฐ€ ๋„ˆ๋ฌด ํด ๊ฒฝ์šฐ object ์˜์—ญ์ด ์•„๋‹ˆ๋ผ๊ณ  ํŒ๋‹จํ•  ๊ฐ€๋Šฅ์„ฑ์ด ๋†’์Œ(์‹ค์ œ๋กœ๋Š” ๋งž์ง€๋งŒ)

  • ๊ทธ๋ ‡๋‹ค๋ฉด, ๊ต์ง‘ํ•ฉ์œผ๋กœ๋งŒ ํŒ๋‹จํ•˜๋ฉด ๋˜์ง€ ์•Š๋Š”๊ฐ€?

    • object์˜์—ญ์„ ์ฐพ๊ธฐ๋Š” ์‰ฌ์›Œ์ง„๋‹ค

    • ๊ทธ๋Ÿฌ๋‚˜ Box Boundary Regressor๊ฐ€ ํ•  ์ผ์ด ์–ด๋ ค์›Œ์ง

  • Box Boundary Regressor

    • object์˜ ๊ฒฝ๊ณ„์ƒ์ž์˜ ์ขŒํ‘œ๋ฅผ ์ฐพ๋Š” ์ผ์„ ํ•จ

Bounding Box Regressor

  • ์ด์ „์— ๋งํ–ˆ๋˜ [๋†’์ด, ๋„ˆ๋น„, x ์ค‘์‹ฌ์ , y์ค‘์‹ฌ์ ] ์ด ์•„๋‹ˆ๋ผ ์‹ค์ œ๋กœ๋Š” [์…€๋†’์ด / ์ „์ฒด๋†’์ด, ์…€๋„ˆ๋น„ / ์ „์ฒด๋„ˆ๋น„, x์ค‘์‹ฌ์ ์œผ๋กœ๋ถ€ํ„ฐ ๋ฒ—์–ด๋‚œ ํฌ๊ธฐ, y์ค‘์‹ฌ์ ์œผ๋กœ๋ถ€ํ„ฐ ๋ฒ—์–ด๋‚œ ํฌ๊ธฐ] ๋ฅผ ๋‹ค๋ฃฌ๋‹ค

  • anchor box๊ฐ€ ์—†๋‹ค๋ฉด?

    • ๊ฐ ์…€๊ฐ„์˜ ํŒŒ๋ผ๋ฏธํ„ฐ ๊ฐ’์ด ์„œ๋กœ ๋‹ฌ๋ผ์„œ ํ•™์Šต์ด ๋˜๊ธฐ ์–ด๋ ค์›€

    • object์˜ ์ผ๋ถ€๋ถ„๋งŒ์„ object๋ผ๊ณ  ํŒ๋‹จํ•œ๋‹ค. => ์ž˜ ํ•™์Šต๋˜์—ˆ๋‹ค๊ณ  ํ•  ์ˆ˜๋Š” ์—†๋‹ค.

Anchor Box

  • ๋ฏธ๋ฆฌ ์ •ํ•ด์ง„ ํฌ๊ธฐ์™€ ๋น„์œจ์„ ๊ฐ€์ง„ ์ •์˜๋œ ํ›„๋ณด box

  • ๊ฐ box์˜ ํฌ๊ธฐ๋Š” ์ •ํ•˜๊ธฐ ๋‚˜๋ฆ„์ด๋ฉฐ ์ฃผ๋กœ ๋ฐฐ์ˆ˜๋ฅผ ํ•ด์„œ ํฌ๊ธฐ๋ฅผ ์ฆ๊ฐ€์‹œํ‚จ๋‹ค. => ์–ด๋–ค๊ฒƒ์ด ํšจ์œจ์ ์ธ์ง€๋Š” ์•„์ง ์—ฐ๊ตฌ ์ค‘

  • ๊ฐ€์žฅ ์ž‘์€ box๋ฅผ IoU์™€ ๋น„๊ตํ•ด์„œ object๋ฅผ ํŒ๋‹จ

  • non-object๋กœ ํŒ๋‹จ๋˜๋ฉด ๋” ํฐ box์™€ ๋น„๊ต. ์ด ๊ณผ์ •์„ ๋ฐ˜๋ณต

  • ์ด ํ›„ ํŒ๋‹จ ๋  ๊ฒฝ์šฐ ๋ฉˆ์ถค

  • BAB : Background Anchor Box

  • OAB : Object Anchor Box

  • ์ด ํ›„ ๋ชจ๋“  Anchor box์˜ BAB/OAB ๊ฒฐ๊ณผ๋ฅผ ํ•˜๋‚˜์˜ ๋ฒกํ„ฐ๋กœ ๋งŒ๋“ ๋‹ค.

    • ํ•ด๋‹น ์œ„์น˜์— ๋Œ€ํ•ด์„œ๋Š” N * M ์‚ฌ์ด์ฆˆ์˜ AB ๋ฐ•์Šค๋Š” Object๋กœ ํŒ๋ณ„ํ•  ์ˆ˜ ์žˆ๋‹ค๋Š” ์˜๋ฏธ ์ „๋‹ฌ

    • ๊ธธ์ด๋Š” 2 x K

  • ๊ฐ ์œ„์น˜๋งˆ๋‹ค ์ด ๋ฒกํ„ฐ๊ฐ€ ์กด์žฌํ•˜๋ฏ€๋กœ image size * k ๊ฐœ์˜ outpu t

Last updated

Was this helpful?