Title: Arah tuju dan konsep perlombongan data
1Arah tuju dan konsep perlombongan data
- Siti Norul Huda Sheikh Abdullah
Seminar Perlombongan Data 1 Oktober 2002 Hotel
Equatorial, Bangi
2(No Transcript)
38.45 minum pagi 10.00-10.15 arh 10.15-10.45
snsha 10.45-11.15 norlailanie nn 11.15-11.45
amzari ga rehat 11.45-12.15 arhusain evolusi
12.15-12.45 noranisah rs 12.30
rehat 1415-1445 mzm overview method
classification 1445-1515 sharizan- kbr 1515-1545
perbincangan Break 1545-1615 break petang
4Pengenalan
- Tujuan
- Takrifan perlombongan data
- Perkembangan perlombongan data sehingga kini
- Permasalahan
- Arah tuju penyelidikan
5Proses perlombongan data
1
Visualisasi
Perlombongan Data
Transformasi Reduksi
Corak / model
Pemprosesan pembersihan
Data terjelma
Rahsia Kesan Jangkaan Tindakan
Pensampelan Pemilihan
Data bersih
Data sasaran
Berpandukan Fayad 1996
6Takrifan Perlombongan Data
- Grabmeier et al 2001
- ..is the notion of all methods and techniques,
which allow to analyse very large data sets to
extract and discover previously unknown
structures and relations out of such huge heaps
of details. These information is filtered,
prepared and classified so that it will be a
valuable aid for decisions and strategies. - Witten 2000
- Data mining is about solving problems by
analyzing data already present in databases - Fayad et al 1994
- KDD means a nontrivial process of identifying
valid, novel, potentially, useful, and ultimately
understandable patterns in data.
7Perkembangan terkini
- Perlombongan bertempoh
- Perlombongan data dan visualisasi 3D
- Alkhawarizmi Gugusan (Cluster Algorithm)
- Teknik Dikretasi
- WEKA garam galian
8Paradigma dan Metod dalam Penerokaan Pengetahuan
Bertempoh
- Jenis data bertempoh
- Statik
- Jujukan (sequence)
- Timestamped
- Bertempoh sepenuhnya (fully temporal)
9Taksonomi konsep perlombongan bertempoholeh
Roddick et al 2002
Roddick, J.F., Spiliopoulou, M. A Survey of
Temporal Knowledge Discovery Paradigms and
Methods. IEEE Transactions on Knowledge
Management And Data Engineering. Vol 14. No 4,
July/August 2002.
10Contoh petua-petua bertempoh
- A drop in atmospheric pressure precedes rainfall
in 60 of cases. - The sequence Committee ?Board ? Council occurs
approximately every month. - Marital status is becoming less of a determinant
of voting behaviour. - Beachside flooding only occurs during spring high
tides. - There is a higher incidence of earthquakes during
and soon after periods of higher atmospheric
pressure. - Some patients tend to develop reactions after
two months with this combination of drugs. - The introduction of Euro caused a different
pattern of buying behaviour in offshore markets.
11Perlombongan data dan visualisasi 3D
- Khasnya untuk data saintifik seperti,
- pengkelasan protein
- Bank data protein
- Kompoun gugusan (clustering compound)
- Khas untuk kompoun kimia
- Teknik
- Geometric Hashing
12 a. A 3D proteinb. 3 Substructures of the
protein in (a)
13Alkhawarizmi Gugusan
Grabmeier, J., Rudolph, A. 2002. Techniques of
cluster Algorithms in Data Mining. Data Mining
and Knowledge Discovery. Vol 6 Pg 340.
Netherlands Kluwer Academic Publishers.
14Proses Dikretasi
15Teknik Diskretasi
16WEKA
- Mari kita lihat programnya
- WEKA
17Kesimpulan
- Menghasilkan satu alatan perlombongan data yang
boleh digunakan oleh pelbagai pihak - Mengumpulkan pelbagai teknik dalam satu alatan
dan akan dikemaskini secara berperingkat. - Memberi khidmat kepada mereka yang memerlukan
mungkin secara percuma.
18Semoga kita berjaya !