-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathclustering-code-R.html
More file actions
2127 lines (2057 loc) · 208 KB
/
clustering-code-R.html
File metadata and controls
2127 lines (2057 loc) · 208 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"><head>
<meta charset="utf-8">
<meta name="generator" content="quarto-1.7.32">
<meta name="viewport" content="width=device-width, initial-scale=1.0, user-scalable=yes">
<title>clustering-code-R</title>
<style>
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
div.columns{display: flex; gap: min(4vw, 1.5em);}
div.column{flex: auto; overflow-x: auto;}
div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
ul.task-list{list-style: none;}
ul.task-list li input[type="checkbox"] {
width: 0.8em;
margin: 0 0.8em 0.2em -1em; /* quarto-specific, see https://github.com/quarto-dev/quarto-cli/issues/4556 */
vertical-align: middle;
}
/* CSS for syntax highlighting */
html { -webkit-text-size-adjust: 100%; }
pre > code.sourceCode { white-space: pre; position: relative; }
pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
pre > code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
div.sourceCode { margin: 1em 0; }
pre.sourceCode { margin: 0; }
@media screen {
div.sourceCode { overflow: auto; }
}
@media print {
pre > code.sourceCode { white-space: pre-wrap; }
pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
{ counter-reset: source-line 0; }
pre.numberSource code > span
{ position: relative; left: -4em; counter-increment: source-line; }
pre.numberSource code > span > a:first-child::before
{ content: counter(source-line);
position: relative; left: -1em; text-align: right; vertical-align: baseline;
border: none; display: inline-block;
-webkit-touch-callout: none; -webkit-user-select: none;
-khtml-user-select: none; -moz-user-select: none;
-ms-user-select: none; user-select: none;
padding: 0 4px; width: 4em;
}
pre.numberSource { margin-left: 3em; padding-left: 4px; }
div.sourceCode
{ }
@media screen {
pre > code.sourceCode > span > a:first-child::before { text-decoration: underline; }
}
</style>
<script src="clustering-code-R_files/libs/clipboard/clipboard.min.js"></script>
<script src="clustering-code-R_files/libs/quarto-html/quarto.js" type="module"></script>
<script src="clustering-code-R_files/libs/quarto-html/tabsets/tabsets.js" type="module"></script>
<script src="clustering-code-R_files/libs/quarto-html/popper.min.js"></script>
<script src="clustering-code-R_files/libs/quarto-html/tippy.umd.min.js"></script>
<script src="clustering-code-R_files/libs/quarto-html/anchor.min.js"></script>
<link href="clustering-code-R_files/libs/quarto-html/tippy.css" rel="stylesheet">
<link href="clustering-code-R_files/libs/quarto-html/quarto-syntax-highlighting-37eea08aefeeee20ff55810ff984fec1.css" rel="stylesheet" id="quarto-text-highlighting-styles">
<script src="clustering-code-R_files/libs/bootstrap/bootstrap.min.js"></script>
<link href="clustering-code-R_files/libs/bootstrap/bootstrap-icons.css" rel="stylesheet">
<link href="clustering-code-R_files/libs/bootstrap/bootstrap-81267100e462c21b3d6c0d5bf76a3417.min.css" rel="stylesheet" append-hash="true" id="quarto-bootstrap" data-mode="light">
<link href="clustering-code-R_files/libs/htmltools-fill-0.5.8.1/fill.css" rel="stylesheet">
<script src="clustering-code-R_files/libs/htmlwidgets-1.6.4/htmlwidgets.js"></script>
<script src="clustering-code-R_files/libs/plotly-binding-4.11.0/plotly.js"></script>
<script src="clustering-code-R_files/libs/typedarray-0.1/typedarray.min.js"></script>
<script src="clustering-code-R_files/libs/jquery-3.5.1/jquery.min.js"></script>
<link href="clustering-code-R_files/libs/crosstalk-1.2.2/css/crosstalk.min.css" rel="stylesheet">
<script src="clustering-code-R_files/libs/crosstalk-1.2.2/js/crosstalk.min.js"></script>
<link href="clustering-code-R_files/libs/plotly-htmlwidgets-css-2.11.1/plotly-htmlwidgets.css" rel="stylesheet">
<script src="clustering-code-R_files/libs/plotly-main-2.11.1/plotly-latest.min.js"></script>
</head>
<body class="fullcontent quarto-light">
<div id="quarto-content" class="page-columns page-rows-contents page-layout-article">
<main class="content" id="quarto-document-content">
<header id="title-block-header" class="quarto-title-block default">
<div class="quarto-title">
<h1 class="title">clustering-code-R</h1>
</div>
<div class="quarto-title-meta">
</div>
</header>
<section id="libraries" class="level1">
<h1>Libraries</h1>
<div class="cell">
<div class="sourceCode cell-code" id="cb1"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb1-1"><a href="#cb1-1" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(tidymodels)</span>
<span id="cb1-2"><a href="#cb1-2" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(tidyverse)</span>
<span id="cb1-3"><a href="#cb1-3" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(tidyclust)</span>
<span id="cb1-4"><a href="#cb1-4" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(rio)</span>
<span id="cb1-5"><a href="#cb1-5" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(GGally)</span>
<span id="cb1-6"><a href="#cb1-6" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(plotly)</span>
<span id="cb1-7"><a href="#cb1-7" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(cluster)</span>
<span id="cb1-8"><a href="#cb1-8" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(factoextra)</span>
<span id="cb1-9"><a href="#cb1-9" aria-hidden="true" tabindex="-1"></a><span class="fu">library</span>(NbClust)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
</section>
<section id="base-de-datos-customer-personality-analysis" class="level1">
<h1>Base de Datos: Customer Personality Analysis</h1>
<section id="contenido" class="level2">
<h2 class="anchored" data-anchor-id="contenido">Contenido</h2>
<p>Base de datos de <strong>perfiles de clientes</strong> con información demográfica, histórico de compras, interacciones con campañas de marketing y patrones de comportamiento.</p>
</section>
<section id="variables-originales" class="level2">
<h2 class="anchored" data-anchor-id="variables-originales">Variables originales</h2>
<ul>
<li>Datos demográficos (edad, educación, ingresos, estado civil)</li>
<li>Gasto por categoría de producto (vinos, frutas, carnes, pescado, dulces, oro)</li>
<li>Canales de compra (web, catálogo, tienda física)</li>
<li>Respuesta a campañas de marketing</li>
<li>Recencia de compras e interacciones</li>
</ul>
</section>
<section id="variables-seleccionadas-para-la-práctica" class="level2">
<h2 class="anchored" data-anchor-id="variables-seleccionadas-para-la-práctica">Variables seleccionadas para la práctica</h2>
<ul>
<li><strong>Gasto total:</strong> Suma de todas las categorías de productos</li>
<li><strong>Cantidad de compra total:</strong> Suma de todas las transacciones</li>
<li><strong>Edad:</strong> Derivada del año de nacimiento</li>
</ul>
</section>
</section>
<section id="importacion-y-tratamiento-de-la-data" class="level1">
<h1>Importacion y tratamiento de la data</h1>
<div class="cell">
<div class="sourceCode cell-code" id="cb2"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb2-1"><a href="#cb2-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> <span class="fu">import</span>(<span class="st">"marketing_campaign.csv"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<div class="cell">
<div class="sourceCode cell-code" id="cb3"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb3-1"><a href="#cb3-1" aria-hidden="true" tabindex="-1"></a><span class="fu">str</span>(data)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>'data.frame': 2240 obs. of 29 variables:
$ ID : int 5524 2174 4141 6182 5324 7446 965 6177 4855 5899 ...
$ Year_Birth : int 1957 1954 1965 1984 1981 1967 1971 1985 1974 1950 ...
$ Education : chr "Graduation" "Graduation" "Graduation" "Graduation" ...
$ Marital_Status : chr "Single" "Single" "Together" "Together" ...
$ Income : int 58138 46344 71613 26646 58293 62513 55635 33454 30351 5648 ...
$ Kidhome : int 0 1 0 1 1 0 0 1 1 1 ...
$ Teenhome : int 0 1 0 0 0 1 1 0 0 1 ...
$ Dt_Customer : chr "04-09-2012" "08-03-2014" "21-08-2013" "10-02-2014" ...
$ Recency : int 58 38 26 26 94 16 34 32 19 68 ...
$ MntWines : int 635 11 426 11 173 520 235 76 14 28 ...
$ MntFruits : int 88 1 49 4 43 42 65 10 0 0 ...
$ MntMeatProducts : int 546 6 127 20 118 98 164 56 24 6 ...
$ MntFishProducts : int 172 2 111 10 46 0 50 3 3 1 ...
$ MntSweetProducts : int 88 1 21 3 27 42 49 1 3 1 ...
$ MntGoldProds : int 88 6 42 5 15 14 27 23 2 13 ...
$ NumDealsPurchases : int 3 2 1 2 5 2 4 2 1 1 ...
$ NumWebPurchases : int 8 1 8 2 5 6 7 4 3 1 ...
$ NumCatalogPurchases: int 10 1 2 0 3 4 3 0 0 0 ...
$ NumStorePurchases : int 4 2 10 4 6 10 7 4 2 0 ...
$ NumWebVisitsMonth : int 7 5 4 6 5 6 6 8 9 20 ...
$ AcceptedCmp3 : int 0 0 0 0 0 0 0 0 0 1 ...
$ AcceptedCmp4 : int 0 0 0 0 0 0 0 0 0 0 ...
$ AcceptedCmp5 : int 0 0 0 0 0 0 0 0 0 0 ...
$ AcceptedCmp1 : int 0 0 0 0 0 0 0 0 0 0 ...
$ AcceptedCmp2 : int 0 0 0 0 0 0 0 0 0 0 ...
$ Complain : int 0 0 0 0 0 0 0 0 0 0 ...
$ Z_CostContact : int 3 3 3 3 3 3 3 3 3 3 ...
$ Z_Revenue : int 11 11 11 11 11 11 11 11 11 11 ...
$ Response : int 1 0 0 0 0 0 0 0 1 0 ...</code></pre>
</div>
</div>
<p>Exploramos las variables categoricas:</p>
<div class="cell">
<div class="sourceCode cell-code" id="cb5"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb5-1"><a href="#cb5-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Education <span class="sc">|></span> <span class="fu">n_distinct</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>[1] 5</code></pre>
</div>
<div class="sourceCode cell-code" id="cb7"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb7-1"><a href="#cb7-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Marital_Status <span class="sc">|></span> <span class="fu">n_distinct</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>[1] 8</code></pre>
</div>
<div class="sourceCode cell-code" id="cb9"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb9-1"><a href="#cb9-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Dt_Customer <span class="sc">|></span> <span class="fu">n_distinct</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>[1] 663</code></pre>
</div>
</div>
<div class="cell">
<div class="sourceCode cell-code" id="cb11"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb11-1"><a href="#cb11-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Education <span class="sc">|></span> <span class="fu">table</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>
2n Cycle Basic Graduation Master PhD
203 54 1127 370 486 </code></pre>
</div>
</div>
<div class="cell">
<div class="sourceCode cell-code" id="cb13"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb13-1"><a href="#cb13-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Marital_Status <span class="sc">|></span> <span class="fu">table</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>
Absurd Alone Divorced Married Single Together Widow YOLO
2 3 232 864 480 580 77 2 </code></pre>
</div>
</div>
<div class="cell">
<div class="sourceCode cell-code" id="cb15"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb15-1"><a href="#cb15-1" aria-hidden="true" tabindex="-1"></a>data<span class="sc">$</span>Dt_Customer <span class="sc">|></span> <span class="fu">table</span>()</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>
01-01-2013 01-01-2014 01-02-2013 01-02-2014 01-03-2013 01-03-2014 01-04-2013
4 3 3 1 3 9 5
01-04-2014 01-05-2013 01-05-2014 01-06-2013 01-07-2013 01-08-2012 01-08-2013
3 3 2 2 3 4 5
01-09-2012 01-09-2013 01-10-2012 01-10-2013 01-11-2012 01-11-2013 01-12-2012
1 3 5 3 2 1 4
01-12-2013 02-01-2013 02-01-2014 02-02-2013 02-02-2014 02-03-2013 02-03-2014
5 9 3 4 5 1 4
02-04-2013 02-04-2014 02-05-2013 02-05-2014 02-06-2013 02-06-2014 02-07-2013
2 1 5 7 2 2 3
02-08-2012 02-08-2013 02-09-2012 02-09-2013 02-10-2012 02-10-2013 02-11-2012
3 4 1 2 1 7 2
02-11-2013 02-12-2012 02-12-2013 03-01-2013 03-01-2014 03-02-2013 03-02-2014
5 2 3 1 3 2 1
03-03-2013 03-03-2014 03-04-2013 03-04-2014 03-05-2013 03-05-2014 03-06-2013
6 3 3 3 3 2 8
03-06-2014 03-07-2013 03-08-2012 03-08-2013 03-09-2012 03-10-2013 03-11-2012
2 5 5 2 4 5 8
03-11-2013 03-12-2012 03-12-2013 04-01-2013 04-01-2014 04-02-2013 04-02-2014
4 4 1 2 2 2 1
04-03-2013 04-03-2014 04-04-2014 04-05-2013 04-05-2014 04-06-2013 04-06-2014
4 3 2 3 4 7 1
04-07-2013 04-08-2012 04-08-2013 04-09-2012 04-09-2013 04-10-2012 04-10-2013
6 4 1 4 2 6 7
04-11-2012 04-11-2013 04-12-2012 04-12-2013 05-01-2013 05-01-2014 05-02-2014
1 3 2 4 4 2 3
05-03-2013 05-03-2014 05-04-2013 05-04-2014 05-05-2013 05-05-2014 05-06-2014
1 3 8 9 3 1 3
05-07-2013 05-08-2012 05-08-2013 05-09-2012 05-09-2013 05-10-2012 05-10-2013
5 1 4 1 4 6 2
05-11-2012 05-11-2013 05-12-2012 05-12-2013 06-01-2013 06-01-2014 06-02-2013
5 4 4 2 2 1 5
06-02-2014 06-03-2013 06-03-2014 06-04-2013 06-04-2014 06-05-2013 06-05-2014
2 2 2 2 3 5 2
06-06-2013 06-06-2014 06-07-2013 06-08-2012 06-08-2013 06-09-2012 06-09-2013
3 1 4 3 3 4 3
06-10-2012 06-10-2013 06-11-2012 06-11-2013 06-12-2012 06-12-2013 07-01-2013
2 4 2 5 3 2 2
07-01-2014 07-02-2013 07-02-2014 07-03-2013 07-03-2014 07-04-2013 07-04-2014
3 2 7 4 2 1 4
07-05-2013 07-05-2014 07-06-2014 07-07-2013 07-08-2012 07-08-2013 07-09-2012
3 5 3 3 4 7 6
07-09-2013 07-10-2012 07-11-2012 07-11-2013 07-12-2012 07-12-2013 08-01-2013
3 1 5 8 5 5 5
08-02-2013 08-02-2014 08-03-2013 08-03-2014 08-04-2013 08-04-2014 08-05-2013
2 1 3 5 4 2 4
08-05-2014 08-06-2013 08-06-2014 08-07-2013 08-08-2012 08-08-2013 08-09-2012
4 5 4 4 7 2 6
08-10-2013 08-11-2012 08-11-2013 08-12-2012 08-12-2013 09-01-2013 09-01-2014
6 3 1 1 2 2 1
09-02-2013 09-02-2014 09-03-2013 09-03-2014 09-04-2013 09-04-2014 09-05-2013
2 3 1 2 1 2 3
09-05-2014 09-06-2013 09-06-2014 09-07-2013 09-08-2012 09-08-2013 09-09-2012
1 1 3 3 2 4 3
09-09-2013 09-10-2012 09-10-2013 09-11-2012 09-11-2013 09-12-2012 09-12-2013
5 1 3 8 1 2 5
10-01-2013 10-01-2014 10-02-2013 10-02-2014 10-03-2013 10-03-2014 10-04-2013
7 4 3 5 5 5 3
10-04-2014 10-05-2013 10-05-2014 10-06-2013 10-06-2014 10-08-2012 10-08-2013
1 4 4 4 3 2 1
10-09-2012 10-09-2013 10-10-2012 10-10-2013 10-11-2012 10-11-2013 10-12-2012
4 4 5 2 6 3 2
10-12-2013 11-01-2013 11-01-2014 11-02-2013 11-02-2014 11-03-2013 11-03-2014
2 5 5 4 3 4 2
11-04-2013 11-04-2014 11-05-2013 11-05-2014 11-06-2013 11-06-2014 11-07-2013
3 4 8 8 2 1 3
11-08-2012 11-08-2013 11-09-2012 11-09-2013 11-10-2012 11-10-2013 11-11-2012
3 4 3 6 1 4 3
11-12-2012 11-12-2013 12-01-2013 12-01-2014 12-02-2013 12-02-2014 12-03-2013
3 4 8 2 5 3 5
12-03-2014 12-04-2013 12-04-2014 12-05-2013 12-05-2014 12-06-2013 12-06-2014
5 1 2 4 11 3 1
12-07-2013 12-08-2012 12-08-2013 12-09-2012 12-09-2013 12-10-2012 12-10-2013
4 6 1 11 6 5 6
12-11-2012 12-11-2013 12-12-2012 12-12-2013 13-01-2013 13-01-2014 13-02-2013
3 2 6 3 6 1 7
13-02-2014 13-03-2014 13-04-2013 13-04-2014 13-05-2013 13-05-2014 13-06-2013
4 1 7 6 1 3 5
13-06-2014 13-07-2013 13-08-2012 13-08-2013 13-09-2013 13-10-2012 13-10-2013
1 3 2 1 1 6 3
13-11-2012 13-11-2013 13-12-2013 14-01-2013 14-01-2014 14-02-2013 14-02-2014
6 3 2 1 2 11 1
14-03-2013 14-04-2013 14-04-2014 14-05-2013 14-06-2013 14-06-2014 14-07-2013
2 1 2 1 3 1 4
14-08-2012 14-08-2013 14-09-2012 14-09-2013 14-10-2012 14-10-2013 14-11-2012
2 1 6 1 5 4 1
14-11-2013 14-12-2012 14-12-2013 15-01-2013 15-01-2014 15-02-2013 15-02-2014
6 1 2 2 4 3 5
15-03-2013 15-04-2013 15-04-2014 15-05-2013 15-05-2014 15-06-2014 15-07-2013
2 4 1 4 1 4 2
15-08-2012 15-08-2013 15-09-2012 15-09-2013 15-10-2012 15-10-2013 15-11-2013
1 5 2 4 4 2 6
15-12-2012 15-12-2013 16-01-2013 16-02-2013 16-02-2014 16-03-2013 16-03-2014
2 2 7 7 4 2 2
16-04-2013 16-04-2014 16-05-2013 16-05-2014 16-06-2013 16-06-2014 16-07-2013
2 4 1 2 2 4 3
16-08-2012 16-08-2013 16-09-2013 16-10-2012 16-10-2013 16-11-2012 16-12-2012
3 3 5 4 6 2 4
16-12-2013 17-01-2013 17-01-2014 17-02-2013 17-02-2014 17-03-2013 17-03-2014
4 6 4 3 5 5 7
17-04-2013 17-04-2014 17-05-2013 17-05-2014 17-06-2013 17-06-2014 17-07-2013
2 2 6 2 4 3 2
17-08-2012 17-08-2013 17-09-2012 17-09-2013 17-10-2012 17-10-2013 17-11-2012
8 4 3 5 2 4 5
17-11-2013 17-12-2012 17-12-2013 18-01-2014 18-02-2013 18-02-2014 18-03-2013
4 1 4 3 5 3 5
18-03-2014 18-04-2013 18-04-2014 18-05-2013 18-05-2014 18-06-2013 18-06-2014
5 6 5 3 5 2 3
18-07-2013 18-08-2012 18-08-2013 18-09-2012 18-09-2013 18-10-2012 18-10-2013
3 5 1 6 5 5 3
18-11-2012 18-11-2013 18-12-2012 18-12-2013 19-01-2013 19-01-2014 19-02-2013
1 1 2 1 4 3 5
19-02-2014 19-03-2013 19-03-2014 19-04-2013 19-04-2014 19-05-2013 19-05-2014
2 5 3 1 2 2 5
19-06-2013 19-06-2014 19-07-2013 19-08-2012 19-08-2013 19-09-2012 19-09-2013
4 4 5 2 3 1 3
19-10-2012 19-10-2013 19-11-2012 19-11-2013 19-12-2012 19-12-2013 20-01-2013
1 2 3 3 4 2 5
20-02-2013 20-03-2013 20-03-2014 20-04-2013 20-04-2014 20-05-2013 20-06-2013
7 3 4 3 4 6 5
20-06-2014 20-07-2013 20-08-2012 20-08-2013 20-09-2012 20-09-2013 20-10-2012
2 1 3 10 2 3 2
20-10-2013 20-11-2012 20-11-2013 20-12-2012 20-12-2013 21-01-2013 21-01-2014
4 2 3 1 3 3 5
21-02-2013 21-03-2013 21-03-2014 21-04-2013 21-04-2014 21-05-2013 21-06-2013
2 2 3 5 4 2 4
21-06-2014 21-07-2013 21-08-2012 21-08-2013 21-09-2012 21-09-2013 21-10-2012
2 5 2 4 4 1 1
21-10-2013 21-11-2012 21-11-2013 21-12-2012 21-12-2013 22-01-2013 22-01-2014
6 2 2 1 2 2 5
22-02-2013 22-02-2014 22-03-2013 22-03-2014 22-04-2013 22-04-2014 22-05-2013
4 1 3 1 6 1 7
22-05-2014 22-06-2013 22-06-2014 22-07-2013 22-08-2012 22-08-2013 22-09-2012
10 3 5 2 8 3 6
22-09-2013 22-10-2012 22-10-2013 22-11-2012 22-11-2013 22-12-2012 22-12-2013
4 3 3 3 3 5 1
23-01-2013 23-01-2014 23-02-2013 23-02-2014 23-03-2013 23-03-2014 23-04-2013
5 5 2 2 3 9 3
23-04-2014 23-05-2013 23-05-2014 23-06-2013 23-06-2014 23-07-2013 23-08-2012
5 3 8 4 2 3 1
23-08-2013 23-09-2012 23-09-2013 23-10-2012 23-10-2013 23-11-2012 23-11-2013
4 3 2 2 2 8 7
23-12-2012 23-12-2013 24-01-2013 24-01-2014 24-02-2014 24-03-2013 24-03-2014
2 4 1 3 5 3 4
24-04-2013 24-04-2014 24-05-2013 24-05-2014 24-06-2013 24-06-2014 24-07-2013
2 6 1 5 3 2 2
24-08-2012 24-08-2013 24-09-2012 24-09-2013 24-10-2012 24-10-2013 24-11-2012
2 1 2 2 1 4 3
24-12-2012 24-12-2013 25-01-2013 25-01-2014 25-02-2013 25-02-2014 25-03-2013
5 3 4 5 3 5 1
25-03-2014 25-04-2013 25-04-2014 25-05-2013 25-05-2014 25-06-2013 25-06-2014
3 2 2 1 3 2 5
25-07-2013 25-08-2012 25-08-2013 25-09-2012 25-09-2013 25-10-2012 25-10-2013
3 3 3 4 3 2 7
25-11-2012 25-11-2013 25-12-2012 25-12-2013 26-01-2013 26-01-2014 26-02-2013
2 3 3 4 3 7 2
26-02-2014 26-03-2013 26-03-2014 26-04-2014 26-05-2013 26-05-2014 26-06-2013
2 4 2 1 4 3 1
26-06-2014 26-07-2013 26-08-2012 26-08-2013 26-09-2012 26-09-2013 26-10-2012
4 2 3 3 3 3 3
26-10-2013 26-11-2012 26-11-2013 26-12-2012 26-12-2013 27-01-2013 27-01-2014
1 2 3 1 4 2 4
27-02-2013 27-02-2014 27-03-2013 27-03-2014 27-04-2013 27-04-2014 27-05-2013
3 2 4 1 5 1 1
27-05-2014 27-06-2013 27-06-2014 27-07-2013 27-08-2012 27-08-2013 27-09-2012
1 4 3 3 3 1 2
27-09-2013 27-10-2012 27-10-2013 27-11-2012 27-11-2013 27-12-2012 27-12-2013
3 2 3 1 2 1 4
28-01-2014 28-02-2013 28-02-2014 28-03-2013 28-03-2014 28-04-2013 28-04-2014
4 2 2 5 1 2 4
28-05-2013 28-05-2014 28-06-2013 28-06-2014 28-07-2013 28-08-2012 28-08-2013
4 4 3 3 1 2 5
28-09-2012 28-09-2013 28-10-2012 28-10-2013 28-11-2012 28-11-2013 28-12-2012
4 6 4 8 3 2 4
28-12-2013 29-01-2013 29-01-2014 29-03-2013 29-03-2014 29-04-2013 29-04-2014
1 3 1 5 6 3 4
29-05-2013 29-05-2014 29-06-2013 29-06-2014 29-07-2013 29-08-2012 29-08-2013
4 6 3 2 5 8 3
29-09-2012 29-10-2012 29-10-2013 29-11-2012 29-11-2013 29-12-2012 29-12-2013
2 9 3 4 4 2 5
30-01-2014 30-03-2013 30-03-2014 30-04-2013 30-04-2014 30-05-2013 30-05-2014
4 4 7 1 4 2 1
30-06-2013 30-07-2012 30-07-2013 30-08-2012 30-08-2013 30-09-2012 30-09-2013
3 1 5 4 3 1 5
30-10-2012 30-10-2013 30-11-2012 30-11-2013 30-12-2012 30-12-2013 31-01-2013
1 3 1 1 5 3 2
31-01-2014 31-03-2013 31-03-2014 31-05-2013 31-05-2014 31-07-2012 31-07-2013
1 5 5 3 2 1 2
31-08-2012 31-08-2013 31-10-2012 31-12-2012 31-12-2013
12 8 5 1 3 </code></pre>
</div>
</div>
<p>Transformamos las variables</p>
<ul>
<li>Calculamos la edad</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb17"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb17-1"><a href="#cb17-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb17-2"><a href="#cb17-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Edad =</span> <span class="dv">2021</span> <span class="sc">-</span> Year_Birth)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Gasto total</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb18"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb18-1"><a href="#cb18-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb18-2"><a href="#cb18-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Gasto_total =</span> MntWines <span class="sc">+</span> MntFruits <span class="sc">+</span> MntMeatProducts <span class="sc">+</span> MntFishProducts <span class="sc">+</span> MntSweetProducts <span class="sc">+</span> MntGoldProds)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Estado civil</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb19"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb19-1"><a href="#cb19-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb19-2"><a href="#cb19-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">estado_civil =</span> <span class="fu">case_when</span>(</span>
<span id="cb19-3"><a href="#cb19-3" aria-hidden="true" tabindex="-1"></a> Marital_Status <span class="sc">%in%</span> <span class="fu">c</span>(<span class="st">"Married"</span>, <span class="st">"Together"</span>) <span class="sc">~</span> <span class="st">"Casado"</span>,</span>
<span id="cb19-4"><a href="#cb19-4" aria-hidden="true" tabindex="-1"></a> Marital_Status <span class="sc">%in%</span> <span class="fu">c</span>(<span class="st">"Absurd"</span>, <span class="st">"Widow"</span>, <span class="st">"YOLO"</span>, <span class="st">"Divorced"</span>, <span class="st">"Single"</span>) <span class="sc">~</span> <span class="st">"Soltero"</span>,</span>
<span id="cb19-5"><a href="#cb19-5" aria-hidden="true" tabindex="-1"></a> <span class="cn">TRUE</span> <span class="sc">~</span> Marital_Status</span>
<span id="cb19-6"><a href="#cb19-6" aria-hidden="true" tabindex="-1"></a> ))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Cantidad de hijos</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb20"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb20-1"><a href="#cb20-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb20-2"><a href="#cb20-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Cantidad_hijos =</span> Kidhome <span class="sc">+</span> Teenhome)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Tamaño de la familia</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb21"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb21-1"><a href="#cb21-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb21-2"><a href="#cb21-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(Tamaño<span class="at">_Familia =</span> <span class="fu">case_when</span>(</span>
<span id="cb21-3"><a href="#cb21-3" aria-hidden="true" tabindex="-1"></a> estado_civil <span class="sc">==</span> <span class="st">"Soltero"</span> <span class="sc">~</span> <span class="dv">1</span> <span class="sc">+</span> Cantidad_hijos,</span>
<span id="cb21-4"><a href="#cb21-4" aria-hidden="true" tabindex="-1"></a> estado_civil <span class="sc">==</span> <span class="st">"Casado"</span> <span class="sc">~</span> <span class="dv">2</span> <span class="sc">+</span> Cantidad_hijos</span>
<span id="cb21-5"><a href="#cb21-5" aria-hidden="true" tabindex="-1"></a> ))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Padre de familia</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb22"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb22-1"><a href="#cb22-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb22-2"><a href="#cb22-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Padre_de_Familia =</span> <span class="fu">ifelse</span>(Cantidad_hijos <span class="sc">></span> <span class="dv">0</span>, <span class="dv">1</span>, <span class="dv">0</span>))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Educacion</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb23"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb23-1"><a href="#cb23-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb23-2"><a href="#cb23-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Educacion =</span> <span class="fu">case_when</span>(</span>
<span id="cb23-3"><a href="#cb23-3" aria-hidden="true" tabindex="-1"></a> Education <span class="sc">%in%</span> <span class="fu">c</span>(<span class="st">"Basic"</span>, <span class="st">"2n Cycle"</span>) <span class="sc">~</span> <span class="st">"básica"</span>,</span>
<span id="cb23-4"><a href="#cb23-4" aria-hidden="true" tabindex="-1"></a> Education <span class="sc">==</span> <span class="st">"Graduation"</span> <span class="sc">~</span> <span class="st">"Pre_grado"</span>,</span>
<span id="cb23-5"><a href="#cb23-5" aria-hidden="true" tabindex="-1"></a> Education <span class="sc">%in%</span> <span class="fu">c</span>(<span class="st">"Master"</span>, <span class="st">"PhD"</span>) <span class="sc">~</span> <span class="st">"Pos_grado"</span>,</span>
<span id="cb23-6"><a href="#cb23-6" aria-hidden="true" tabindex="-1"></a> <span class="cn">TRUE</span> <span class="sc">~</span> Education</span>
<span id="cb23-7"><a href="#cb23-7" aria-hidden="true" tabindex="-1"></a> ))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Cambio de nombre a columnas de interes</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb24"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb24-1"><a href="#cb24-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb24-2"><a href="#cb24-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">rename</span>(</span>
<span id="cb24-3"><a href="#cb24-3" aria-hidden="true" tabindex="-1"></a> <span class="at">Vinos =</span> MntWines,</span>
<span id="cb24-4"><a href="#cb24-4" aria-hidden="true" tabindex="-1"></a> <span class="at">Frutas =</span> MntFruits,</span>
<span id="cb24-5"><a href="#cb24-5" aria-hidden="true" tabindex="-1"></a> <span class="at">Carne =</span> MntMeatProducts,</span>
<span id="cb24-6"><a href="#cb24-6" aria-hidden="true" tabindex="-1"></a> <span class="at">Pescado =</span> MntFishProducts,</span>
<span id="cb24-7"><a href="#cb24-7" aria-hidden="true" tabindex="-1"></a> <span class="at">Dulces =</span> MntSweetProducts,</span>
<span id="cb24-8"><a href="#cb24-8" aria-hidden="true" tabindex="-1"></a> <span class="at">Gold =</span> MntGoldProds,</span>
<span id="cb24-9"><a href="#cb24-9" aria-hidden="true" tabindex="-1"></a> <span class="at">Ingreso =</span> Income,</span>
<span id="cb24-10"><a href="#cb24-10" aria-hidden="true" tabindex="-1"></a> Niño<span class="at">_en_casa =</span> Kidhome,</span>
<span id="cb24-11"><a href="#cb24-11" aria-hidden="true" tabindex="-1"></a> <span class="at">Adolecente_en_casa =</span> Teenhome,</span>
<span id="cb24-12"><a href="#cb24-12" aria-hidden="true" tabindex="-1"></a> <span class="at">Dias_ultima_compra =</span> Recency,</span>
<span id="cb24-13"><a href="#cb24-13" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_vinos =</span> MntWines,</span>
<span id="cb24-14"><a href="#cb24-14" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_frutas =</span> MntFruits,</span>
<span id="cb24-15"><a href="#cb24-15" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_carne =</span> MntMeatProducts,</span>
<span id="cb24-16"><a href="#cb24-16" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_pescado =</span> MntFishProducts,</span>
<span id="cb24-17"><a href="#cb24-17" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_dulces =</span> MntSweetProducts,</span>
<span id="cb24-18"><a href="#cb24-18" aria-hidden="true" tabindex="-1"></a> <span class="at">gasto_gold =</span> MntGoldProds,</span>
<span id="cb24-19"><a href="#cb24-19" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad_compra_con_descuento =</span> NumDealsPurchases,</span>
<span id="cb24-20"><a href="#cb24-20" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad_compra_web =</span> NumWebPurchases,</span>
<span id="cb24-21"><a href="#cb24-21" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad_compra_catalogo =</span> NumCatalogPurchases,</span>
<span id="cb24-22"><a href="#cb24-22" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad_compra_tienda =</span> NumStorePurchases, </span>
<span id="cb24-23"><a href="#cb24-23" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad_visitas_web_mes =</span> NumWebVisitsMonth, </span>
<span id="cb24-24"><a href="#cb24-24" aria-hidden="true" tabindex="-1"></a> acepto_campaña<span class="at">_1 =</span> AcceptedCmp1,</span>
<span id="cb24-25"><a href="#cb24-25" aria-hidden="true" tabindex="-1"></a> acepto_campaña<span class="at">_2 =</span> AcceptedCmp2,</span>
<span id="cb24-26"><a href="#cb24-26" aria-hidden="true" tabindex="-1"></a> acepto_campaña<span class="at">_3 =</span> AcceptedCmp3,</span>
<span id="cb24-27"><a href="#cb24-27" aria-hidden="true" tabindex="-1"></a> acepto_campaña<span class="at">_4 =</span> AcceptedCmp4,</span>
<span id="cb24-28"><a href="#cb24-28" aria-hidden="true" tabindex="-1"></a> acepto_campaña<span class="at">_5 =</span> AcceptedCmp5,</span>
<span id="cb24-29"><a href="#cb24-29" aria-hidden="true" tabindex="-1"></a> <span class="at">registra_quejas =</span> Complain,</span>
<span id="cb24-30"><a href="#cb24-30" aria-hidden="true" tabindex="-1"></a> respondio_a_campaña <span class="ot">=</span> Response</span>
<span id="cb24-31"><a href="#cb24-31" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb24-32"><a href="#cb24-32" aria-hidden="true" tabindex="-1"></a> )</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li>Variables descartadas</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb25"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb25-1"><a href="#cb25-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb25-2"><a href="#cb25-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">select</span>(<span class="sc">-</span><span class="fu">c</span>(Marital_Status, Dt_Customer, Z_CostContact, Z_Revenue, Year_Birth, ID, Education))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<div class="cell">
<div class="sourceCode cell-code" id="cb26"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb26-1"><a href="#cb26-1" aria-hidden="true" tabindex="-1"></a><span class="fu">head</span>(data)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code> Ingreso Niño_en_casa Adolecente_en_casa Dias_ultima_compra gasto_vinos
1 58138 0 0 58 635
2 46344 1 1 38 11
3 71613 0 0 26 426
4 26646 1 0 26 11
5 58293 1 0 94 173
6 62513 0 1 16 520
gasto_frutas gasto_carne gasto_pescado gasto_dulces gasto_gold
1 88 546 172 88 88
2 1 6 2 1 6
3 49 127 111 21 42
4 4 20 10 3 5
5 43 118 46 27 15
6 42 98 0 42 14
cantidad_compra_con_descuento cantidad_compra_web cantidad_compra_catalogo
1 3 8 10
2 2 1 1
3 1 8 2
4 2 2 0
5 5 5 3
6 2 6 4
cantidad_compra_tienda cantidad_visitas_web_mes acepto_campaña_3
1 4 7 0
2 2 5 0
3 10 4 0
4 4 6 0
5 6 5 0
6 10 6 0
acepto_campaña_4 acepto_campaña_5 acepto_campaña_1 acepto_campaña_2
1 0 0 0 0
2 0 0 0 0
3 0 0 0 0
4 0 0 0 0
5 0 0 0 0
6 0 0 0 0
registra_quejas respondio_a_campaña Edad Gasto_total estado_civil
1 0 1 64 1617 Soltero
2 0 0 67 27 Soltero
3 0 0 56 776 Casado
4 0 0 37 53 Casado
5 0 0 40 422 Casado
6 0 0 54 716 Casado
Cantidad_hijos Tamaño_Familia Padre_de_Familia Educacion
1 0 1 0 Pre_grado
2 2 3 1 Pre_grado
3 0 2 0 Pre_grado
4 1 3 1 Pre_grado
5 1 3 1 Pos_grado
6 1 3 1 Pos_grado</code></pre>
</div>
</div>
<section id="buscamos-valores-faltantes-y-duplicados" class="level2">
<h2 class="anchored" data-anchor-id="buscamos-valores-faltantes-y-duplicados">Buscamos valores faltantes y duplicados</h2>
<div class="cell">
<div class="sourceCode cell-code" id="cb28"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb28-1"><a href="#cb28-1" aria-hidden="true" tabindex="-1"></a>na_dup <span class="ot"><-</span> <span class="cf">function</span>(data) {</span>
<span id="cb28-2"><a href="#cb28-2" aria-hidden="true" tabindex="-1"></a> missing_values <span class="ot"><-</span> <span class="fu">colSums</span>(<span class="fu">is.na</span>(data))</span>
<span id="cb28-3"><a href="#cb28-3" aria-hidden="true" tabindex="-1"></a> missing_values_perc <span class="ot"><-</span> missing_values <span class="sc">/</span> <span class="fu">nrow</span>(data) <span class="sc">*</span> <span class="dv">100</span></span>
<span id="cb28-4"><a href="#cb28-4" aria-hidden="true" tabindex="-1"></a> duplicated_values <span class="ot"><-</span> <span class="fu">sum</span>(<span class="fu">duplicated</span>(data))</span>
<span id="cb28-5"><a href="#cb28-5" aria-hidden="true" tabindex="-1"></a> duplicated_values_perc <span class="ot"><-</span> duplicated_values <span class="sc">/</span> <span class="fu">nrow</span>(data) <span class="sc">*</span> <span class="dv">100</span></span>
<span id="cb28-6"><a href="#cb28-6" aria-hidden="true" tabindex="-1"></a> data_types <span class="ot"><-</span> <span class="fu">sapply</span>(data, class)</span>
<span id="cb28-7"><a href="#cb28-7" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb28-8"><a href="#cb28-8" aria-hidden="true" tabindex="-1"></a> resumen <span class="ot"><-</span> <span class="fu">data.frame</span>(</span>
<span id="cb28-9"><a href="#cb28-9" aria-hidden="true" tabindex="-1"></a> <span class="at">Valores_Faltantes =</span> missing_values,</span>
<span id="cb28-10"><a href="#cb28-10" aria-hidden="true" tabindex="-1"></a> <span class="at">Valores_Faltantes_Porcentaje =</span> missing_values_perc,</span>
<span id="cb28-11"><a href="#cb28-11" aria-hidden="true" tabindex="-1"></a> <span class="at">Valores_Duplicados =</span> duplicated_values,</span>
<span id="cb28-12"><a href="#cb28-12" aria-hidden="true" tabindex="-1"></a> <span class="at">Valores_Duplicados_Porcentaje =</span> duplicated_values_perc,</span>
<span id="cb28-13"><a href="#cb28-13" aria-hidden="true" tabindex="-1"></a> <span class="at">Tipo_de_Variable =</span> data_types,</span>
<span id="cb28-14"><a href="#cb28-14" aria-hidden="true" tabindex="-1"></a> <span class="at">stringsAsFactors =</span> <span class="cn">FALSE</span></span>
<span id="cb28-15"><a href="#cb28-15" aria-hidden="true" tabindex="-1"></a> )</span>
<span id="cb28-16"><a href="#cb28-16" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb28-17"><a href="#cb28-17" aria-hidden="true" tabindex="-1"></a> <span class="fu">return</span>(resumen)</span>
<span id="cb28-18"><a href="#cb28-18" aria-hidden="true" tabindex="-1"></a>}</span>
<span id="cb28-19"><a href="#cb28-19" aria-hidden="true" tabindex="-1"></a></span>
<span id="cb28-20"><a href="#cb28-20" aria-hidden="true" tabindex="-1"></a><span class="fu">na_dup</span>(data)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code> Valores_Faltantes Valores_Faltantes_Porcentaje
Ingreso 24 1.0714286
Niño_en_casa 0 0.0000000
Adolecente_en_casa 0 0.0000000
Dias_ultima_compra 0 0.0000000
gasto_vinos 0 0.0000000
gasto_frutas 0 0.0000000
gasto_carne 0 0.0000000
gasto_pescado 0 0.0000000
gasto_dulces 0 0.0000000
gasto_gold 0 0.0000000
cantidad_compra_con_descuento 0 0.0000000
cantidad_compra_web 0 0.0000000
cantidad_compra_catalogo 0 0.0000000
cantidad_compra_tienda 0 0.0000000
cantidad_visitas_web_mes 0 0.0000000
acepto_campaña_3 0 0.0000000
acepto_campaña_4 0 0.0000000
acepto_campaña_5 0 0.0000000
acepto_campaña_1 0 0.0000000
acepto_campaña_2 0 0.0000000
registra_quejas 0 0.0000000
respondio_a_campaña 0 0.0000000
Edad 0 0.0000000
Gasto_total 0 0.0000000
estado_civil 0 0.0000000
Cantidad_hijos 0 0.0000000
Tamaño_Familia 3 0.1339286
Padre_de_Familia 0 0.0000000
Educacion 0 0.0000000
Valores_Duplicados Valores_Duplicados_Porcentaje
Ingreso 183 8.169643
Niño_en_casa 183 8.169643
Adolecente_en_casa 183 8.169643
Dias_ultima_compra 183 8.169643
gasto_vinos 183 8.169643
gasto_frutas 183 8.169643
gasto_carne 183 8.169643
gasto_pescado 183 8.169643
gasto_dulces 183 8.169643
gasto_gold 183 8.169643
cantidad_compra_con_descuento 183 8.169643
cantidad_compra_web 183 8.169643
cantidad_compra_catalogo 183 8.169643
cantidad_compra_tienda 183 8.169643
cantidad_visitas_web_mes 183 8.169643
acepto_campaña_3 183 8.169643
acepto_campaña_4 183 8.169643
acepto_campaña_5 183 8.169643
acepto_campaña_1 183 8.169643
acepto_campaña_2 183 8.169643
registra_quejas 183 8.169643
respondio_a_campaña 183 8.169643
Edad 183 8.169643
Gasto_total 183 8.169643
estado_civil 183 8.169643
Cantidad_hijos 183 8.169643
Tamaño_Familia 183 8.169643
Padre_de_Familia 183 8.169643
Educacion 183 8.169643
Tipo_de_Variable
Ingreso integer
Niño_en_casa integer
Adolecente_en_casa integer
Dias_ultima_compra integer
gasto_vinos integer
gasto_frutas integer
gasto_carne integer
gasto_pescado integer
gasto_dulces integer
gasto_gold integer
cantidad_compra_con_descuento integer
cantidad_compra_web integer
cantidad_compra_catalogo integer
cantidad_compra_tienda integer
cantidad_visitas_web_mes integer
acepto_campaña_3 integer
acepto_campaña_4 integer
acepto_campaña_5 integer
acepto_campaña_1 integer
acepto_campaña_2 integer
registra_quejas integer
respondio_a_campaña integer
Edad numeric
Gasto_total integer
estado_civil character
Cantidad_hijos integer
Tamaño_Familia numeric
Padre_de_Familia numeric
Educacion character</code></pre>
</div>
</div>
<ul>
<li>No se identificaron valores duplicados</li>
<li>Se identificó que la columna <strong>Ingreso</strong> tiene valores faltantes, por lo cual se procesera a la imputacion de los datos</li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb30"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb30-1"><a href="#cb30-1" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> data <span class="sc">|></span></span>
<span id="cb30-2"><a href="#cb30-2" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="at">Ingreso =</span> <span class="fu">replace</span>(Ingreso, <span class="fu">is.na</span>(Ingreso), <span class="fu">mean</span>(Ingreso, <span class="at">na.rm =</span> <span class="cn">TRUE</span>)))</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<p>Verificamos que no haya datos faltantes</p>
<div class="cell">
<div class="sourceCode cell-code" id="cb31"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb31-1"><a href="#cb31-1" aria-hidden="true" tabindex="-1"></a><span class="fu">na_dup</span>(data)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code> Valores_Faltantes Valores_Faltantes_Porcentaje
Ingreso 0 0.0000000
Niño_en_casa 0 0.0000000
Adolecente_en_casa 0 0.0000000
Dias_ultima_compra 0 0.0000000
gasto_vinos 0 0.0000000
gasto_frutas 0 0.0000000
gasto_carne 0 0.0000000
gasto_pescado 0 0.0000000
gasto_dulces 0 0.0000000
gasto_gold 0 0.0000000
cantidad_compra_con_descuento 0 0.0000000
cantidad_compra_web 0 0.0000000
cantidad_compra_catalogo 0 0.0000000
cantidad_compra_tienda 0 0.0000000
cantidad_visitas_web_mes 0 0.0000000
acepto_campaña_3 0 0.0000000
acepto_campaña_4 0 0.0000000
acepto_campaña_5 0 0.0000000
acepto_campaña_1 0 0.0000000
acepto_campaña_2 0 0.0000000
registra_quejas 0 0.0000000
respondio_a_campaña 0 0.0000000
Edad 0 0.0000000
Gasto_total 0 0.0000000
estado_civil 0 0.0000000
Cantidad_hijos 0 0.0000000
Tamaño_Familia 3 0.1339286
Padre_de_Familia 0 0.0000000
Educacion 0 0.0000000
Valores_Duplicados Valores_Duplicados_Porcentaje
Ingreso 183 8.169643
Niño_en_casa 183 8.169643
Adolecente_en_casa 183 8.169643
Dias_ultima_compra 183 8.169643
gasto_vinos 183 8.169643
gasto_frutas 183 8.169643
gasto_carne 183 8.169643
gasto_pescado 183 8.169643
gasto_dulces 183 8.169643
gasto_gold 183 8.169643
cantidad_compra_con_descuento 183 8.169643
cantidad_compra_web 183 8.169643
cantidad_compra_catalogo 183 8.169643
cantidad_compra_tienda 183 8.169643
cantidad_visitas_web_mes 183 8.169643
acepto_campaña_3 183 8.169643
acepto_campaña_4 183 8.169643
acepto_campaña_5 183 8.169643
acepto_campaña_1 183 8.169643
acepto_campaña_2 183 8.169643
registra_quejas 183 8.169643
respondio_a_campaña 183 8.169643
Edad 183 8.169643
Gasto_total 183 8.169643
estado_civil 183 8.169643
Cantidad_hijos 183 8.169643
Tamaño_Familia 183 8.169643
Padre_de_Familia 183 8.169643
Educacion 183 8.169643
Tipo_de_Variable
Ingreso numeric
Niño_en_casa integer
Adolecente_en_casa integer
Dias_ultima_compra integer
gasto_vinos integer
gasto_frutas integer
gasto_carne integer
gasto_pescado integer
gasto_dulces integer
gasto_gold integer
cantidad_compra_con_descuento integer
cantidad_compra_web integer
cantidad_compra_catalogo integer
cantidad_compra_tienda integer
cantidad_visitas_web_mes integer
acepto_campaña_3 integer
acepto_campaña_4 integer
acepto_campaña_5 integer
acepto_campaña_1 integer
acepto_campaña_2 integer
registra_quejas integer
respondio_a_campaña integer
Edad numeric
Gasto_total integer
estado_civil character
Cantidad_hijos integer
Tamaño_Familia numeric
Padre_de_Familia numeric
Educacion character</code></pre>
</div>
</div>
<ul>
<li>Los valores faltantes fueron imputados correctamente</li>
</ul>
</section>
<section id="convertimos-a-factores" class="level2">
<h2 class="anchored" data-anchor-id="convertimos-a-factores">convertimos a factores</h2>
<div class="cell">
<div class="sourceCode cell-code" id="cb33"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb33-1"><a href="#cb33-1" aria-hidden="true" tabindex="-1"></a>char_a_factores <span class="ot"><-</span> <span class="cf">function</span>(data) {</span>
<span id="cb33-2"><a href="#cb33-2" aria-hidden="true" tabindex="-1"></a> data <span class="ot"><-</span> data <span class="sc">|></span> </span>
<span id="cb33-3"><a href="#cb33-3" aria-hidden="true" tabindex="-1"></a> <span class="fu">mutate</span>(<span class="fu">across</span>(<span class="fu">where</span>(is.character), as.factor))</span>
<span id="cb33-4"><a href="#cb33-4" aria-hidden="true" tabindex="-1"></a> <span class="fu">return</span>(data)</span>
<span id="cb33-5"><a href="#cb33-5" aria-hidden="true" tabindex="-1"></a>}</span>
<span id="cb33-6"><a href="#cb33-6" aria-hidden="true" tabindex="-1"></a></span>
<span id="cb33-7"><a href="#cb33-7" aria-hidden="true" tabindex="-1"></a>data <span class="ot"><-</span> <span class="fu">char_a_factores</span>(data)</span>
<span id="cb33-8"><a href="#cb33-8" aria-hidden="true" tabindex="-1"></a><span class="fu">str</span>(data)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output cell-output-stdout">
<pre><code>'data.frame': 2240 obs. of 29 variables:
$ Ingreso : num 58138 46344 71613 26646 58293 ...
$ Niño_en_casa : int 0 1 0 1 1 0 0 1 1 1 ...
$ Adolecente_en_casa : int 0 1 0 0 0 1 1 0 0 1 ...
$ Dias_ultima_compra : int 58 38 26 26 94 16 34 32 19 68 ...
$ gasto_vinos : int 635 11 426 11 173 520 235 76 14 28 ...
$ gasto_frutas : int 88 1 49 4 43 42 65 10 0 0 ...
$ gasto_carne : int 546 6 127 20 118 98 164 56 24 6 ...
$ gasto_pescado : int 172 2 111 10 46 0 50 3 3 1 ...
$ gasto_dulces : int 88 1 21 3 27 42 49 1 3 1 ...
$ gasto_gold : int 88 6 42 5 15 14 27 23 2 13 ...
$ cantidad_compra_con_descuento: int 3 2 1 2 5 2 4 2 1 1 ...
$ cantidad_compra_web : int 8 1 8 2 5 6 7 4 3 1 ...
$ cantidad_compra_catalogo : int 10 1 2 0 3 4 3 0 0 0 ...
$ cantidad_compra_tienda : int 4 2 10 4 6 10 7 4 2 0 ...
$ cantidad_visitas_web_mes : int 7 5 4 6 5 6 6 8 9 20 ...
$ acepto_campaña_3 : int 0 0 0 0 0 0 0 0 0 1 ...
$ acepto_campaña_4 : int 0 0 0 0 0 0 0 0 0 0 ...
$ acepto_campaña_5 : int 0 0 0 0 0 0 0 0 0 0 ...
$ acepto_campaña_1 : int 0 0 0 0 0 0 0 0 0 0 ...
$ acepto_campaña_2 : int 0 0 0 0 0 0 0 0 0 0 ...
$ registra_quejas : int 0 0 0 0 0 0 0 0 0 0 ...
$ respondio_a_campaña : int 1 0 0 0 0 0 0 0 1 0 ...
$ Edad : num 64 67 56 37 40 54 50 36 47 71 ...
$ Gasto_total : int 1617 27 776 53 422 716 590 169 46 49 ...
$ estado_civil : Factor w/ 3 levels "Alone","Casado",..: 3 3 2 2 2 2 3 2 2 2 ...
$ Cantidad_hijos : int 0 2 0 1 1 1 1 1 1 2 ...
$ Tamaño_Familia : num 1 3 2 3 3 3 2 3 3 4 ...
$ Padre_de_Familia : num 0 1 0 1 1 1 1 1 1 1 ...
$ Educacion : Factor w/ 3 levels "básica","Pos_grado",..: 3 3 3 3 2 2 3 2 2 2 ...</code></pre>
</div>
</div>
</section>
<section id="deteccion-de-posibles-outliers-en-la-data" class="level2">
<h2 class="anchored" data-anchor-id="deteccion-de-posibles-outliers-en-la-data">deteccion de posibles outliers en la data</h2>
<div class="cell">
<div class="sourceCode cell-code" id="cb35"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb35-1"><a href="#cb35-1" aria-hidden="true" tabindex="-1"></a>detectar_outliers <span class="ot"><-</span> <span class="cf">function</span>(x, <span class="at">nombre_var =</span> <span class="st">""</span>) {</span>
<span id="cb35-2"><a href="#cb35-2" aria-hidden="true" tabindex="-1"></a> Q1 <span class="ot"><-</span> <span class="fu">quantile</span>(x, <span class="fl">0.25</span>, <span class="at">na.rm =</span> <span class="cn">TRUE</span>)</span>
<span id="cb35-3"><a href="#cb35-3" aria-hidden="true" tabindex="-1"></a> Q3 <span class="ot"><-</span> <span class="fu">quantile</span>(x, <span class="fl">0.75</span>, <span class="at">na.rm =</span> <span class="cn">TRUE</span>)</span>
<span id="cb35-4"><a href="#cb35-4" aria-hidden="true" tabindex="-1"></a> IQR <span class="ot"><-</span> Q3 <span class="sc">-</span> Q1</span>
<span id="cb35-5"><a href="#cb35-5" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb35-6"><a href="#cb35-6" aria-hidden="true" tabindex="-1"></a> limite_inferior <span class="ot"><-</span> Q1 <span class="sc">-</span> <span class="fl">1.5</span> <span class="sc">*</span> IQR</span>
<span id="cb35-7"><a href="#cb35-7" aria-hidden="true" tabindex="-1"></a> limite_superior <span class="ot"><-</span> Q3 <span class="sc">+</span> <span class="fl">1.5</span> <span class="sc">*</span> IQR</span>
<span id="cb35-8"><a href="#cb35-8" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb35-9"><a href="#cb35-9" aria-hidden="true" tabindex="-1"></a> outliers <span class="ot"><-</span> x <span class="sc"><</span> limite_inferior <span class="sc">|</span> x <span class="sc">></span> limite_superior</span>
<span id="cb35-10"><a href="#cb35-10" aria-hidden="true" tabindex="-1"></a> </span>
<span id="cb35-11"><a href="#cb35-11" aria-hidden="true" tabindex="-1"></a> <span class="fu">boxplot</span>(x, <span class="at">main =</span> <span class="fu">paste</span>(<span class="st">"Boxplot de"</span>, nombre_var), </span>
<span id="cb35-12"><a href="#cb35-12" aria-hidden="true" tabindex="-1"></a> <span class="at">ylab =</span> <span class="st">"Valor"</span>, <span class="at">col =</span> <span class="st">"orange"</span>) </span>
<span id="cb35-13"><a href="#cb35-13" aria-hidden="true" tabindex="-1"></a></span>
<span id="cb35-14"><a href="#cb35-14" aria-hidden="true" tabindex="-1"></a> <span class="fu">return</span>(<span class="fu">list</span>(</span>
<span id="cb35-15"><a href="#cb35-15" aria-hidden="true" tabindex="-1"></a> <span class="at">outliers =</span> <span class="fu">which</span>(outliers),</span>
<span id="cb35-16"><a href="#cb35-16" aria-hidden="true" tabindex="-1"></a> <span class="at">cantidad =</span> <span class="fu">sum</span>(outliers),</span>
<span id="cb35-17"><a href="#cb35-17" aria-hidden="true" tabindex="-1"></a> <span class="at">limites =</span> <span class="fu">c</span>(<span class="at">inferior =</span> limite_inferior, <span class="at">superior =</span> limite_superior)</span>
<span id="cb35-18"><a href="#cb35-18" aria-hidden="true" tabindex="-1"></a> ))</span>
<span id="cb35-19"><a href="#cb35-19" aria-hidden="true" tabindex="-1"></a>}</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
</div>
<ul>
<li><p>Variables con posibles ouliers</p>
<ul>
<li>Ingreso</li>
</ul></li>
</ul>
<div class="cell">
<div class="sourceCode cell-code" id="cb36"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb36-1"><a href="#cb36-1" aria-hidden="true" tabindex="-1"></a><span class="fu">detectar_outliers</span>(data<span class="sc">$</span>Ingreso, <span class="st">"Ingreso"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output-display">
<div>
<figure class="figure">
<p><img src="clustering-code-R_files/figure-html/unnamed-chunk-23-1.png" class="img-fluid figure-img" width="672"></p>
</figure>
</div>
</div>
<div class="cell-output cell-output-stdout">
<pre><code>$outliers
[1] 165 618 656 688 1301 1654 2133 2234
$cantidad
[1] 8
$limites
inferior.25% superior.75%
-13587.75 117416.25 </code></pre>
</div>
</div>
<pre><code>- Edad</code></pre>
<div class="cell">
<div class="sourceCode cell-code" id="cb39"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb39-1"><a href="#cb39-1" aria-hidden="true" tabindex="-1"></a><span class="fu">detectar_outliers</span>(data<span class="sc">$</span>Edad, <span class="st">"Edad"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output-display">
<div>
<figure class="figure">
<p><img src="clustering-code-R_files/figure-html/unnamed-chunk-24-1.png" class="img-fluid figure-img" width="672"></p>
</figure>
</div>
</div>
<div class="cell-output cell-output-stdout">
<pre><code>$outliers
[1] 193 240 340
$cantidad
[1] 3
$limites
inferior.25% superior.75%
17 89 </code></pre>
</div>
</div>
<pre><code>- gasto_vinos</code></pre>
<div class="cell">
<div class="sourceCode cell-code" id="cb42"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb42-1"><a href="#cb42-1" aria-hidden="true" tabindex="-1"></a><span class="fu">detectar_outliers</span>(data<span class="sc">$</span>gasto_vinos, <span class="st">"Gasto en Vinos"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output-display">
<div>
<figure class="figure">
<p><img src="clustering-code-R_files/figure-html/unnamed-chunk-25-1.png" class="img-fluid figure-img" width="672"></p>
</figure>
</div>
</div>
<div class="cell-output cell-output-stdout">
<pre><code>$outliers
[1] 112 162 296 425 431 467 498 516 524 544 560 825 827 871 918
[16] 938 988 991 1002 1011 1053 1192 1459 1489 1493 1578 1642 1750 1923 1954
[31] 1962 1993 2068 2099 2128
$cantidad
[1] 35
$limites
inferior.25% superior.75%
-697 1225 </code></pre>
</div>
</div>
<pre><code>- gasto_frutas</code></pre>
<div class="cell">
<div class="sourceCode cell-code" id="cb45"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb45-1"><a href="#cb45-1" aria-hidden="true" tabindex="-1"></a><span class="fu">detectar_outliers</span>(data<span class="sc">$</span>gasto_frutas, <span class="st">"Gasto en Frutas"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output-display">
<div>
<figure class="figure">
<p><img src="clustering-code-R_files/figure-html/unnamed-chunk-26-1.png" class="img-fluid figure-img" width="672"></p>
</figure>
</div>
</div>
<div class="cell-output cell-output-stdout">
<pre><code>$outliers
[1] 1 30 46 54 73 77 78 103 108 110 111 117 118 127 133
[16] 169 199 202 229 242 244 249 253 267 277 278 287 289 293 306
[31] 313 337 340 348 354 361 391 393 402 419 427 448 460 464 507
[46] 510 512 514 520 529 538 557 558 562 564 592 593 615 631 633
[61] 638 643 645 647 659 673 682 687 690 702 724 728 740 754 755
[76] 763 768 772 791 799 804 813 826 836 847 856 885 898 903 906
[91] 925 928 929 941 943 961 966 971 975 985 987 1017 1031 1032 1048
[106] 1051 1063 1067 1072 1076 1081 1090 1101 1135 1145 1152 1158 1173 1180 1200
[121] 1202 1207 1217 1243 1266 1281 1289 1290 1314 1321 1337 1356 1359 1390 1414
[136] 1422 1430 1445 1452 1453 1470 1481 1485 1487 1489 1493 1496 1508 1509 1511
[151] 1521 1522 1527 1538 1553 1560 1564 1573 1590 1599 1602 1613 1616 1623 1628
[166] 1631 1649 1669 1672 1673 1681 1696 1700 1701 1723 1734 1735 1746 1750 1760
[181] 1768 1783 1790 1814 1824 1829 1854 1870 1872 1878 1881 1888 1891 1892 1895
[196] 1911 1913 1922 1925 1935 1944 1947 1954 1966 1977 1983 2014 2015 2037 2038
[211] 2040 2042 2049 2057 2058 2072 2074 2075 2085 2094 2097 2169 2173 2186 2204
[226] 2207 2218
$cantidad
[1] 227
$limites
inferior.25% superior.75%
-47 81 </code></pre>
</div>
</div>
<pre><code>- gasto_carne</code></pre>
<div class="cell">
<div class="sourceCode cell-code" id="cb48"><pre class="sourceCode r code-with-copy"><code class="sourceCode r"><span id="cb48-1"><a href="#cb48-1" aria-hidden="true" tabindex="-1"></a><span class="fu">detectar_outliers</span>(data<span class="sc">$</span>gasto_carne, <span class="st">"Gasto en Carne"</span>)</span></code><button title="Copy to Clipboard" class="code-copy-button"><i class="bi"></i></button></pre></div>
<div class="cell-output-display">
<div>
<figure class="figure">
<p><img src="clustering-code-R_files/figure-html/unnamed-chunk-27-1.png" class="img-fluid figure-img" width="672"></p>
</figure>
</div>
</div>
<div class="cell-output cell-output-stdout">
<pre><code>$outliers
[1] 22 30 52 77 78 89 105 117 127 136 141 143 165 198 229
[16] 241 249 289 293 314 339 340 341 358 361 367 391 416 425 427
[31] 457 491 516 520 547 558 562 592 593 610 628 635 637 642 645
[46] 651 687 688 690 724 727 736 751 754 767 791 799 815 821 838
[61] 846 853 879 885 886 907 911 915 928 943 944 967 987 988 989
[76] 1031 1032 1035 1053 1067 1085 1091 1114 1130 1134 1135 1154 1164 1180 1205
[91] 1207 1214 1260 1262 1270 1281 1289 1299 1302 1312 1351 1359 1386 1444 1445
[106] 1452 1453 1459 1477 1487 1493 1514 1521 1522 1529 1543 1573 1582 1583 1590
[121] 1602 1611 1613 1649 1654 1672 1673 1687 1711 1714 1722 1723 1724 1744 1746
[136] 1802 1809 1810 1814 1828 1829 1842 1851 1853 1854 1859 1870 1888 1891 1892
[151] 1928 1946 1953 1954 1959 1966 1983 1991 1993 2011 2013 2042 2087 2110 2132