논문 아카이브

「논문 아카이브」

[CVPR'10] H. Jégou, et.al.

Aggregating local descriptors into a compact image representation

Abstract 이 연구에서는 피셔 커널 표현의 단순화로 볼 수 있는 제한된 차원의 벡터로 로컬 이미지 설명자를 집계하는 간단하면서도 효율적인 방법을 제안하고, 차원 축소와 인덱싱 알고리즘을 공동으로 최적화하는 방법을 보여줍니다. 우리는 검색의 정확성, 효율성, 표현의 메모리 사용량이라는 세 가지 제약 조건을 함께 고려해야 하는 매우 큰 규모의 이미...

Posted by H. Jégou, et.al. on June 13, 2010

[PACT'08] Hyunchul Park, et.al.

Edge-centric modulo scheduling for coarse-grained reconfigurable architectures

Abstract Experiments on a wide variety of compute-intensive loops from the multimedia domain show that EMS improves throughput by 25% over traditional iterative modulo scheduling, and achieves 98%...

Posted by Hyunchul Park, et.al. on October 25, 2008

[PLDI'08] Pluto

A practical automatic polyhedral parallelizer and locality optimizer

Abstract 병렬성과 지역성을 동시에 고려하여 일반 프로그램(불완전하게 중첩된 루프 시퀀스)을 최적화할 수 있는 자동 다면체 소스-소스 변환 프레임워크의 설계와 구현을 소개합니다. 이 작업을 통해 현재 프로덕션 컴파일러에서 가능한 수준을 훨씬 뛰어넘는 다면체 모델에서의 분석 모델 기반 자동 변환의 실용성을 보여줍니다. 이전 작업과 달리 우리의 접...

Posted by Uday Bondhugula on June 7, 2008

['07] CHiLL

CHiLL : A Framework for Composing High-Level Loop Transformations

Abstract A general and robust loop transformation framework that enables compilers to generate efficient code on complex loop nests and shows performance results on automaticallygenerated code for...

Posted by Chun Chen, et.al. on January 1, 2007

['07] J. Bennett, et.al.

The Netflix Prize

Abstract Netflix released a dataset containing 100 million anonymous movie ratings and challenged the data mining, machine learning and computer science communities to develop systems that could b...

Posted by J. Bennett, et.al. on January 1, 2007

[I'92] P. Feautrier

Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time

Abstract This paper extends the algorithms which were developed in Part I to cases in which there is no affine schedule, i.e. to problems whose parallel complexity is polynomial but not linear, an...

Posted by P. Feautrier on December 1, 1992

[MASSP'85] S. Kung

VLSI Array processors

Abstract VLSI 어레이 프로세서에 대한 일반적인 개요와 알고리즘, 아키텍처 및 애플리케이션 관점에서의 통합된 처리 방법을 제공합니다. 고속 신호 처리는 병렬 프로세서 기술에 결정적으로 의존합니다. 대부분의 애플리케이션에서 범용 병렬 컴퓨터는 심각한 시스템 오버헤드로 인해 만족스러운 실시간 처리 속도를 제공할 수 없습니다. 따라서 실시간 디지...

Posted by S. Kung on January 1, 1985