deeplearn@ML-RefVm-967342:~/imagenet$ wget https://cross-entropy.net/ML530/imagenet-transfer.py.txt --2022-10-31 02:44:23-- https://cross-entropy.net/ML530/imagenet-transfer.py.txt Resolving cross-entropy.net (cross-entropy.net)... 107.180.57.14 Connecting to cross-entropy.net (cross-entropy.net)|107.180.57.14|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 3989 (3.9K) [text/plain] Saving to: ‘imagenet-transfer.py.txt’ imagenet-transfer.py.txt 100%[============================================================>] 3.90K --.-KB/s in 0s 2022-10-31 02:44:24 (2.86 GB/s) - ‘imagenet-transfer.py.txt’ saved [3989/3989] deeplearn@ML-RefVm-967342:~/imagenet$ time python imagenet-transfer.py.txt 2022-10-31 02:45:40.954098: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2022-10-31 02:45:41.523765: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1532] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 10794 MB memory: -> device: 0, name: Tesla K80, pci bus id: 0001:00:00.0, compute capability: 3.7 Model: "transfer_model" _______________________________________________________________________________________________________________________________________ Layer (type) Output Shape Param # Connected to ======================================================================================================================================= input_1 (InputLayer) [(None, 64, 64, 3)] 0 [] resizing (Resizing) (None, 235, 235, 3) 0 ['input_1[0][0]'] random_crop (RandomCrop) (None, 224, 224, 3) 0 ['resizing[0][0]'] random_flip (RandomFlip) (None, 224, 224, 3) 0 ['random_crop[0][0]'] rescaling (Rescaling) (None, 224, 224, 3) 0 ['random_flip[0][0]'] normalization (Normalization) (None, 224, 224, 3) 0 ['rescaling[0][0]'] stem_conv (Conv2D) (None, 112, 112, 32) 864 ['normalization[0][0]'] stem_bn (BatchNormalization) (None, 112, 112, 32) 128 ['stem_conv[0][0]'] stem_activation (Activation) (None, 112, 112, 32) 0 ['stem_bn[0][0]'] block1a_project_conv (Conv2D) (None, 112, 112, 16) 4608 ['stem_activation[0][0]'] block1a_project_bn (BatchNormalization) (None, 112, 112, 16) 64 ['block1a_project_conv[0][0]'] block1a_project_activation (Activation) (None, 112, 112, 16) 0 ['block1a_project_bn[0][0]'] block2a_expand_conv (Conv2D) (None, 56, 56, 64) 9216 ['block1a_project_activation[0][0]'] block2a_expand_bn (BatchNormalization) (None, 56, 56, 64) 256 ['block2a_expand_conv[0][0]'] block2a_expand_activation (Activation) (None, 56, 56, 64) 0 ['block2a_expand_bn[0][0]'] block2a_project_conv (Conv2D) (None, 56, 56, 32) 2048 ['block2a_expand_activation[0][0]'] block2a_project_bn (BatchNormalization) (None, 56, 56, 32) 128 ['block2a_project_conv[0][0]'] block2b_expand_conv (Conv2D) (None, 56, 56, 128) 36864 ['block2a_project_bn[0][0]'] block2b_expand_bn (BatchNormalization) (None, 56, 56, 128) 512 ['block2b_expand_conv[0][0]'] block2b_expand_activation (Activation) (None, 56, 56, 128) 0 ['block2b_expand_bn[0][0]'] block2b_project_conv (Conv2D) (None, 56, 56, 32) 4096 ['block2b_expand_activation[0][0]'] block2b_project_bn (BatchNormalization) (None, 56, 56, 32) 128 ['block2b_project_conv[0][0]'] block2b_drop (Dropout) (None, 56, 56, 32) 0 ['block2b_project_bn[0][0]'] block2b_add (Add) (None, 56, 56, 32) 0 ['block2b_drop[0][0]', 'block2a_project_bn[0][0]'] block3a_expand_conv (Conv2D) (None, 28, 28, 128) 36864 ['block2b_add[0][0]'] block3a_expand_bn (BatchNormalization) (None, 28, 28, 128) 512 ['block3a_expand_conv[0][0]'] block3a_expand_activation (Activation) (None, 28, 28, 128) 0 ['block3a_expand_bn[0][0]'] block3a_project_conv (Conv2D) (None, 28, 28, 48) 6144 ['block3a_expand_activation[0][0]'] block3a_project_bn (BatchNormalization) (None, 28, 28, 48) 192 ['block3a_project_conv[0][0]'] block3b_expand_conv (Conv2D) (None, 28, 28, 192) 82944 ['block3a_project_bn[0][0]'] block3b_expand_bn (BatchNormalization) (None, 28, 28, 192) 768 ['block3b_expand_conv[0][0]'] block3b_expand_activation (Activation) (None, 28, 28, 192) 0 ['block3b_expand_bn[0][0]'] block3b_project_conv (Conv2D) (None, 28, 28, 48) 9216 ['block3b_expand_activation[0][0]'] block3b_project_bn (BatchNormalization) (None, 28, 28, 48) 192 ['block3b_project_conv[0][0]'] block3b_drop (Dropout) (None, 28, 28, 48) 0 ['block3b_project_bn[0][0]'] block3b_add (Add) (None, 28, 28, 48) 0 ['block3b_drop[0][0]', 'block3a_project_bn[0][0]'] block4a_expand_conv (Conv2D) (None, 28, 28, 192) 9216 ['block3b_add[0][0]'] block4a_expand_bn (BatchNormalization) (None, 28, 28, 192) 768 ['block4a_expand_conv[0][0]'] block4a_expand_activation (Activation) (None, 28, 28, 192) 0 ['block4a_expand_bn[0][0]'] block4a_dwconv2 (DepthwiseConv2D) (None, 14, 14, 192) 1728 ['block4a_expand_activation[0][0]'] block4a_bn (BatchNormalization) (None, 14, 14, 192) 768 ['block4a_dwconv2[0][0]'] block4a_activation (Activation) (None, 14, 14, 192) 0 ['block4a_bn[0][0]'] block4a_se_squeeze (GlobalAveragePooling2D (None, 192) 0 ['block4a_activation[0][0]'] ) block4a_se_reshape (Reshape) (None, 1, 1, 192) 0 ['block4a_se_squeeze[0][0]'] block4a_se_reduce (Conv2D) (None, 1, 1, 12) 2316 ['block4a_se_reshape[0][0]'] block4a_se_expand (Conv2D) (None, 1, 1, 192) 2496 ['block4a_se_reduce[0][0]'] block4a_se_excite (Multiply) (None, 14, 14, 192) 0 ['block4a_activation[0][0]', 'block4a_se_expand[0][0]'] block4a_project_conv (Conv2D) (None, 14, 14, 96) 18432 ['block4a_se_excite[0][0]'] block4a_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4a_project_conv[0][0]'] block4b_expand_conv (Conv2D) (None, 14, 14, 384) 36864 ['block4a_project_bn[0][0]'] block4b_expand_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4b_expand_conv[0][0]'] block4b_expand_activation (Activation) (None, 14, 14, 384) 0 ['block4b_expand_bn[0][0]'] block4b_dwconv2 (DepthwiseConv2D) (None, 14, 14, 384) 3456 ['block4b_expand_activation[0][0]'] block4b_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4b_dwconv2[0][0]'] block4b_activation (Activation) (None, 14, 14, 384) 0 ['block4b_bn[0][0]'] block4b_se_squeeze (GlobalAveragePooling2D (None, 384) 0 ['block4b_activation[0][0]'] ) block4b_se_reshape (Reshape) (None, 1, 1, 384) 0 ['block4b_se_squeeze[0][0]'] block4b_se_reduce (Conv2D) (None, 1, 1, 24) 9240 ['block4b_se_reshape[0][0]'] block4b_se_expand (Conv2D) (None, 1, 1, 384) 9600 ['block4b_se_reduce[0][0]'] block4b_se_excite (Multiply) (None, 14, 14, 384) 0 ['block4b_activation[0][0]', 'block4b_se_expand[0][0]'] block4b_project_conv (Conv2D) (None, 14, 14, 96) 36864 ['block4b_se_excite[0][0]'] block4b_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4b_project_conv[0][0]'] block4b_drop (Dropout) (None, 14, 14, 96) 0 ['block4b_project_bn[0][0]'] block4b_add (Add) (None, 14, 14, 96) 0 ['block4b_drop[0][0]', 'block4a_project_bn[0][0]'] block4c_expand_conv (Conv2D) (None, 14, 14, 384) 36864 ['block4b_add[0][0]'] block4c_expand_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4c_expand_conv[0][0]'] block4c_expand_activation (Activation) (None, 14, 14, 384) 0 ['block4c_expand_bn[0][0]'] block4c_dwconv2 (DepthwiseConv2D) (None, 14, 14, 384) 3456 ['block4c_expand_activation[0][0]'] block4c_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4c_dwconv2[0][0]'] block4c_activation (Activation) (None, 14, 14, 384) 0 ['block4c_bn[0][0]'] block4c_se_squeeze (GlobalAveragePooling2D (None, 384) 0 ['block4c_activation[0][0]'] ) block4c_se_reshape (Reshape) (None, 1, 1, 384) 0 ['block4c_se_squeeze[0][0]'] block4c_se_reduce (Conv2D) (None, 1, 1, 24) 9240 ['block4c_se_reshape[0][0]'] block4c_se_expand (Conv2D) (None, 1, 1, 384) 9600 ['block4c_se_reduce[0][0]'] block4c_se_excite (Multiply) (None, 14, 14, 384) 0 ['block4c_activation[0][0]', 'block4c_se_expand[0][0]'] block4c_project_conv (Conv2D) (None, 14, 14, 96) 36864 ['block4c_se_excite[0][0]'] block4c_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4c_project_conv[0][0]'] block4c_drop (Dropout) (None, 14, 14, 96) 0 ['block4c_project_bn[0][0]'] block4c_add (Add) (None, 14, 14, 96) 0 ['block4c_drop[0][0]', 'block4b_add[0][0]'] block5a_expand_conv (Conv2D) (None, 14, 14, 576) 55296 ['block4c_add[0][0]'] block5a_expand_bn (BatchNormalization) (None, 14, 14, 576) 2304 ['block5a_expand_conv[0][0]'] block5a_expand_activation (Activation) (None, 14, 14, 576) 0 ['block5a_expand_bn[0][0]'] block5a_dwconv2 (DepthwiseConv2D) (None, 14, 14, 576) 5184 ['block5a_expand_activation[0][0]'] block5a_bn (BatchNormalization) (None, 14, 14, 576) 2304 ['block5a_dwconv2[0][0]'] block5a_activation (Activation) (None, 14, 14, 576) 0 ['block5a_bn[0][0]'] block5a_se_squeeze (GlobalAveragePooling2D (None, 576) 0 ['block5a_activation[0][0]'] ) block5a_se_reshape (Reshape) (None, 1, 1, 576) 0 ['block5a_se_squeeze[0][0]'] block5a_se_reduce (Conv2D) (None, 1, 1, 24) 13848 ['block5a_se_reshape[0][0]'] block5a_se_expand (Conv2D) (None, 1, 1, 576) 14400 ['block5a_se_reduce[0][0]'] block5a_se_excite (Multiply) (None, 14, 14, 576) 0 ['block5a_activation[0][0]', 'block5a_se_expand[0][0]'] block5a_project_conv (Conv2D) (None, 14, 14, 112) 64512 ['block5a_se_excite[0][0]'] block5a_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5a_project_conv[0][0]'] block5b_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5a_project_bn[0][0]'] block5b_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5b_expand_conv[0][0]'] block5b_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5b_expand_bn[0][0]'] block5b_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5b_expand_activation[0][0]'] block5b_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5b_dwconv2[0][0]'] block5b_activation (Activation) (None, 14, 14, 672) 0 ['block5b_bn[0][0]'] block5b_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5b_activation[0][0]'] ) block5b_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5b_se_squeeze[0][0]'] block5b_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5b_se_reshape[0][0]'] block5b_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5b_se_reduce[0][0]'] block5b_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5b_activation[0][0]', 'block5b_se_expand[0][0]'] block5b_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5b_se_excite[0][0]'] block5b_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5b_project_conv[0][0]'] block5b_drop (Dropout) (None, 14, 14, 112) 0 ['block5b_project_bn[0][0]'] block5b_add (Add) (None, 14, 14, 112) 0 ['block5b_drop[0][0]', 'block5a_project_bn[0][0]'] block5c_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5b_add[0][0]'] block5c_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5c_expand_conv[0][0]'] block5c_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5c_expand_bn[0][0]'] block5c_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5c_expand_activation[0][0]'] block5c_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5c_dwconv2[0][0]'] block5c_activation (Activation) (None, 14, 14, 672) 0 ['block5c_bn[0][0]'] block5c_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5c_activation[0][0]'] ) block5c_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5c_se_squeeze[0][0]'] block5c_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5c_se_reshape[0][0]'] block5c_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5c_se_reduce[0][0]'] block5c_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5c_activation[0][0]', 'block5c_se_expand[0][0]'] block5c_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5c_se_excite[0][0]'] block5c_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5c_project_conv[0][0]'] block5c_drop (Dropout) (None, 14, 14, 112) 0 ['block5c_project_bn[0][0]'] block5c_add (Add) (None, 14, 14, 112) 0 ['block5c_drop[0][0]', 'block5b_add[0][0]'] block5d_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5c_add[0][0]'] block5d_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5d_expand_conv[0][0]'] block5d_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5d_expand_bn[0][0]'] block5d_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5d_expand_activation[0][0]'] block5d_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5d_dwconv2[0][0]'] block5d_activation (Activation) (None, 14, 14, 672) 0 ['block5d_bn[0][0]'] block5d_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5d_activation[0][0]'] ) block5d_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5d_se_squeeze[0][0]'] block5d_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5d_se_reshape[0][0]'] block5d_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5d_se_reduce[0][0]'] block5d_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5d_activation[0][0]', 'block5d_se_expand[0][0]'] block5d_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5d_se_excite[0][0]'] block5d_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5d_project_conv[0][0]'] block5d_drop (Dropout) (None, 14, 14, 112) 0 ['block5d_project_bn[0][0]'] block5d_add (Add) (None, 14, 14, 112) 0 ['block5d_drop[0][0]', 'block5c_add[0][0]'] block5e_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5d_add[0][0]'] block5e_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5e_expand_conv[0][0]'] block5e_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5e_expand_bn[0][0]'] block5e_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5e_expand_activation[0][0]'] block5e_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5e_dwconv2[0][0]'] block5e_activation (Activation) (None, 14, 14, 672) 0 ['block5e_bn[0][0]'] block5e_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5e_activation[0][0]'] ) block5e_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5e_se_squeeze[0][0]'] block5e_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5e_se_reshape[0][0]'] block5e_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5e_se_reduce[0][0]'] block5e_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5e_activation[0][0]', 'block5e_se_expand[0][0]'] block5e_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5e_se_excite[0][0]'] block5e_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5e_project_conv[0][0]'] block5e_drop (Dropout) (None, 14, 14, 112) 0 ['block5e_project_bn[0][0]'] block5e_add (Add) (None, 14, 14, 112) 0 ['block5e_drop[0][0]', 'block5d_add[0][0]'] block6a_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5e_add[0][0]'] block6a_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block6a_expand_conv[0][0]'] block6a_expand_activation (Activation) (None, 14, 14, 672) 0 ['block6a_expand_bn[0][0]'] block6a_dwconv2 (DepthwiseConv2D) (None, 7, 7, 672) 6048 ['block6a_expand_activation[0][0]'] block6a_bn (BatchNormalization) (None, 7, 7, 672) 2688 ['block6a_dwconv2[0][0]'] block6a_activation (Activation) (None, 7, 7, 672) 0 ['block6a_bn[0][0]'] block6a_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block6a_activation[0][0]'] ) block6a_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block6a_se_squeeze[0][0]'] block6a_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block6a_se_reshape[0][0]'] block6a_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block6a_se_reduce[0][0]'] block6a_se_excite (Multiply) (None, 7, 7, 672) 0 ['block6a_activation[0][0]', 'block6a_se_expand[0][0]'] block6a_project_conv (Conv2D) (None, 7, 7, 192) 129024 ['block6a_se_excite[0][0]'] block6a_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6a_project_conv[0][0]'] block6b_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6a_project_bn[0][0]'] block6b_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6b_expand_conv[0][0]'] block6b_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6b_expand_bn[0][0]'] block6b_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6b_expand_activation[0][0]'] block6b_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6b_dwconv2[0][0]'] block6b_activation (Activation) (None, 7, 7, 1152) 0 ['block6b_bn[0][0]'] block6b_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6b_activation[0][0]'] ) block6b_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6b_se_squeeze[0][0]'] block6b_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6b_se_reshape[0][0]'] block6b_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6b_se_reduce[0][0]'] block6b_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6b_activation[0][0]', 'block6b_se_expand[0][0]'] block6b_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6b_se_excite[0][0]'] block6b_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6b_project_conv[0][0]'] block6b_drop (Dropout) (None, 7, 7, 192) 0 ['block6b_project_bn[0][0]'] block6b_add (Add) (None, 7, 7, 192) 0 ['block6b_drop[0][0]', 'block6a_project_bn[0][0]'] block6c_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6b_add[0][0]'] block6c_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6c_expand_conv[0][0]'] block6c_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6c_expand_bn[0][0]'] block6c_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6c_expand_activation[0][0]'] block6c_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6c_dwconv2[0][0]'] block6c_activation (Activation) (None, 7, 7, 1152) 0 ['block6c_bn[0][0]'] block6c_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6c_activation[0][0]'] ) block6c_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6c_se_squeeze[0][0]'] block6c_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6c_se_reshape[0][0]'] block6c_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6c_se_reduce[0][0]'] block6c_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6c_activation[0][0]', 'block6c_se_expand[0][0]'] block6c_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6c_se_excite[0][0]'] block6c_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6c_project_conv[0][0]'] block6c_drop (Dropout) (None, 7, 7, 192) 0 ['block6c_project_bn[0][0]'] block6c_add (Add) (None, 7, 7, 192) 0 ['block6c_drop[0][0]', 'block6b_add[0][0]'] block6d_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6c_add[0][0]'] block6d_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6d_expand_conv[0][0]'] block6d_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6d_expand_bn[0][0]'] block6d_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6d_expand_activation[0][0]'] block6d_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6d_dwconv2[0][0]'] block6d_activation (Activation) (None, 7, 7, 1152) 0 ['block6d_bn[0][0]'] block6d_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6d_activation[0][0]'] ) block6d_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6d_se_squeeze[0][0]'] block6d_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6d_se_reshape[0][0]'] block6d_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6d_se_reduce[0][0]'] block6d_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6d_activation[0][0]', 'block6d_se_expand[0][0]'] block6d_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6d_se_excite[0][0]'] block6d_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6d_project_conv[0][0]'] block6d_drop (Dropout) (None, 7, 7, 192) 0 ['block6d_project_bn[0][0]'] block6d_add (Add) (None, 7, 7, 192) 0 ['block6d_drop[0][0]', 'block6c_add[0][0]'] block6e_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6d_add[0][0]'] block6e_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6e_expand_conv[0][0]'] block6e_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6e_expand_bn[0][0]'] block6e_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6e_expand_activation[0][0]'] block6e_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6e_dwconv2[0][0]'] block6e_activation (Activation) (None, 7, 7, 1152) 0 ['block6e_bn[0][0]'] block6e_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6e_activation[0][0]'] ) block6e_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6e_se_squeeze[0][0]'] block6e_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6e_se_reshape[0][0]'] block6e_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6e_se_reduce[0][0]'] block6e_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6e_activation[0][0]', 'block6e_se_expand[0][0]'] block6e_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6e_se_excite[0][0]'] block6e_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6e_project_conv[0][0]'] block6e_drop (Dropout) (None, 7, 7, 192) 0 ['block6e_project_bn[0][0]'] block6e_add (Add) (None, 7, 7, 192) 0 ['block6e_drop[0][0]', 'block6d_add[0][0]'] block6f_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6e_add[0][0]'] block6f_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6f_expand_conv[0][0]'] block6f_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6f_expand_bn[0][0]'] block6f_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6f_expand_activation[0][0]'] block6f_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6f_dwconv2[0][0]'] block6f_activation (Activation) (None, 7, 7, 1152) 0 ['block6f_bn[0][0]'] block6f_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6f_activation[0][0]'] ) block6f_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6f_se_squeeze[0][0]'] block6f_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6f_se_reshape[0][0]'] block6f_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6f_se_reduce[0][0]'] block6f_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6f_activation[0][0]', 'block6f_se_expand[0][0]'] block6f_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6f_se_excite[0][0]'] block6f_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6f_project_conv[0][0]'] block6f_drop (Dropout) (None, 7, 7, 192) 0 ['block6f_project_bn[0][0]'] block6f_add (Add) (None, 7, 7, 192) 0 ['block6f_drop[0][0]', 'block6e_add[0][0]'] block6g_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6f_add[0][0]'] block6g_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6g_expand_conv[0][0]'] block6g_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6g_expand_bn[0][0]'] block6g_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6g_expand_activation[0][0]'] block6g_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6g_dwconv2[0][0]'] block6g_activation (Activation) (None, 7, 7, 1152) 0 ['block6g_bn[0][0]'] block6g_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6g_activation[0][0]'] ) block6g_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6g_se_squeeze[0][0]'] block6g_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6g_se_reshape[0][0]'] block6g_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6g_se_reduce[0][0]'] block6g_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6g_activation[0][0]', 'block6g_se_expand[0][0]'] block6g_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6g_se_excite[0][0]'] block6g_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6g_project_conv[0][0]'] block6g_drop (Dropout) (None, 7, 7, 192) 0 ['block6g_project_bn[0][0]'] block6g_add (Add) (None, 7, 7, 192) 0 ['block6g_drop[0][0]', 'block6f_add[0][0]'] block6h_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6g_add[0][0]'] block6h_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6h_expand_conv[0][0]'] block6h_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6h_expand_bn[0][0]'] block6h_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6h_expand_activation[0][0]'] block6h_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6h_dwconv2[0][0]'] block6h_activation (Activation) (None, 7, 7, 1152) 0 ['block6h_bn[0][0]'] block6h_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6h_activation[0][0]'] ) block6h_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6h_se_squeeze[0][0]'] block6h_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6h_se_reshape[0][0]'] block6h_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6h_se_reduce[0][0]'] block6h_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6h_activation[0][0]', 'block6h_se_expand[0][0]'] block6h_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6h_se_excite[0][0]'] block6h_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6h_project_conv[0][0]'] block6h_drop (Dropout) (None, 7, 7, 192) 0 ['block6h_project_bn[0][0]'] block6h_add (Add) (None, 7, 7, 192) 0 ['block6h_drop[0][0]', 'block6g_add[0][0]'] top_conv (Conv2D) (None, 7, 7, 1280) 245760 ['block6h_add[0][0]'] top_bn (BatchNormalization) (None, 7, 7, 1280) 5120 ['top_conv[0][0]'] top_activation (Activation) (None, 7, 7, 1280) 0 ['top_bn[0][0]'] avg_pool (GlobalAveragePooling2D) (None, 1280) 0 ['top_activation[0][0]'] pool_dropout (Dropout) (None, 1280) 0 ['avg_pool[0][0]'] dense_features (Dense) (None, 512) 655872 ['pool_dropout[0][0]'] dense_dropout (Dropout) (None, 512) 0 ['dense_features[0][0]'] classifier (Dense) (None, 200) 102600 ['dense_dropout[0][0]'] ======================================================================================================================================= Total params: 6,677,784 Trainable params: 758,472 Non-trainable params: 5,919,312 _______________________________________________________________________________________________________________________________________ 2022-10-31 02:45:43.925929: W tensorflow/core/framework/cpu_allocator_impl.cc:82] Allocation of 4915200000 exceeds 10% of free system memory. 2022-10-31 02:45:46.736851: W tensorflow/core/framework/cpu_allocator_impl.cc:82] Allocation of 4915200000 exceeds 10% of free system memory. Epoch 1/8 2022-10-31 02:45:56.532206: I tensorflow/stream_executor/cuda/cuda_dnn.cc:384] Loaded cuDNN version 8500 1563/1563 [==============================] - 354s 221ms/step - loss: 1.1635 - accuracy: 0.7214 - map: 0.7896 - val_loss: 0.7489 - val_accuracy: 0.8003 - val_map: 0.8577 Epoch 2/8 1563/1563 [==============================] - 344s 220ms/step - loss: 0.8538 - accuracy: 0.7760 - map: 0.8386 - val_loss: 0.7220 - val_accuracy: 0.8051 - val_map: 0.8618 Epoch 3/8 1563/1563 [==============================] - 345s 221ms/step - loss: 0.7804 - accuracy: 0.7920 - map: 0.8519 - val_loss: 0.7080 - val_accuracy: 0.8098 - val_map: 0.8646 Epoch 4/8 1563/1563 [==============================] - 346s 221ms/step - loss: 0.7329 - accuracy: 0.8018 - map: 0.8605 - val_loss: 0.7063 - val_accuracy: 0.8113 - val_map: 0.8661 Epoch 5/8 1563/1563 [==============================] - 346s 221ms/step - loss: 0.6910 - accuracy: 0.8105 - map: 0.8684 - val_loss: 0.7071 - val_accuracy: 0.8139 - val_map: 0.8669 Epoch 6/8 1563/1563 [==============================] - 346s 221ms/step - loss: 0.6571 - accuracy: 0.8180 - map: 0.8744 - val_loss: 0.7080 - val_accuracy: 0.8148 - val_map: 0.8681 Epoch 7/8 1563/1563 [==============================] - 346s 221ms/step - loss: 0.6283 - accuracy: 0.8246 - map: 0.8796 - val_loss: 0.7114 - val_accuracy: 0.8132 - val_map: 0.8669 Epoch 8/8 1563/1563 [==============================] - 346s 221ms/step - loss: 0.5978 - accuracy: 0.8300 - map: 0.8847 - val_loss: 0.7116 - val_accuracy: 0.8142 - val_map: 0.8673 Model: "transfer_model" _______________________________________________________________________________________________________________________________________ Layer (type) Output Shape Param # Connected to ======================================================================================================================================= input_1 (InputLayer) [(None, 64, 64, 3)] 0 [] resizing (Resizing) (None, 235, 235, 3) 0 ['input_1[0][0]'] random_crop (RandomCrop) (None, 224, 224, 3) 0 ['resizing[0][0]'] random_flip (RandomFlip) (None, 224, 224, 3) 0 ['random_crop[0][0]'] rescaling (Rescaling) (None, 224, 224, 3) 0 ['random_flip[0][0]'] normalization (Normalization) (None, 224, 224, 3) 0 ['rescaling[0][0]'] stem_conv (Conv2D) (None, 112, 112, 32) 864 ['normalization[0][0]'] stem_bn (BatchNormalization) (None, 112, 112, 32) 128 ['stem_conv[0][0]'] stem_activation (Activation) (None, 112, 112, 32) 0 ['stem_bn[0][0]'] block1a_project_conv (Conv2D) (None, 112, 112, 16) 4608 ['stem_activation[0][0]'] block1a_project_bn (BatchNormalization) (None, 112, 112, 16) 64 ['block1a_project_conv[0][0]'] block1a_project_activation (Activation) (None, 112, 112, 16) 0 ['block1a_project_bn[0][0]'] block2a_expand_conv (Conv2D) (None, 56, 56, 64) 9216 ['block1a_project_activation[0][0]'] block2a_expand_bn (BatchNormalization) (None, 56, 56, 64) 256 ['block2a_expand_conv[0][0]'] block2a_expand_activation (Activation) (None, 56, 56, 64) 0 ['block2a_expand_bn[0][0]'] block2a_project_conv (Conv2D) (None, 56, 56, 32) 2048 ['block2a_expand_activation[0][0]'] block2a_project_bn (BatchNormalization) (None, 56, 56, 32) 128 ['block2a_project_conv[0][0]'] block2b_expand_conv (Conv2D) (None, 56, 56, 128) 36864 ['block2a_project_bn[0][0]'] block2b_expand_bn (BatchNormalization) (None, 56, 56, 128) 512 ['block2b_expand_conv[0][0]'] block2b_expand_activation (Activation) (None, 56, 56, 128) 0 ['block2b_expand_bn[0][0]'] block2b_project_conv (Conv2D) (None, 56, 56, 32) 4096 ['block2b_expand_activation[0][0]'] block2b_project_bn (BatchNormalization) (None, 56, 56, 32) 128 ['block2b_project_conv[0][0]'] block2b_drop (Dropout) (None, 56, 56, 32) 0 ['block2b_project_bn[0][0]'] block2b_add (Add) (None, 56, 56, 32) 0 ['block2b_drop[0][0]', 'block2a_project_bn[0][0]'] block3a_expand_conv (Conv2D) (None, 28, 28, 128) 36864 ['block2b_add[0][0]'] block3a_expand_bn (BatchNormalization) (None, 28, 28, 128) 512 ['block3a_expand_conv[0][0]'] block3a_expand_activation (Activation) (None, 28, 28, 128) 0 ['block3a_expand_bn[0][0]'] block3a_project_conv (Conv2D) (None, 28, 28, 48) 6144 ['block3a_expand_activation[0][0]'] block3a_project_bn (BatchNormalization) (None, 28, 28, 48) 192 ['block3a_project_conv[0][0]'] block3b_expand_conv (Conv2D) (None, 28, 28, 192) 82944 ['block3a_project_bn[0][0]'] block3b_expand_bn (BatchNormalization) (None, 28, 28, 192) 768 ['block3b_expand_conv[0][0]'] block3b_expand_activation (Activation) (None, 28, 28, 192) 0 ['block3b_expand_bn[0][0]'] block3b_project_conv (Conv2D) (None, 28, 28, 48) 9216 ['block3b_expand_activation[0][0]'] block3b_project_bn (BatchNormalization) (None, 28, 28, 48) 192 ['block3b_project_conv[0][0]'] block3b_drop (Dropout) (None, 28, 28, 48) 0 ['block3b_project_bn[0][0]'] block3b_add (Add) (None, 28, 28, 48) 0 ['block3b_drop[0][0]', 'block3a_project_bn[0][0]'] block4a_expand_conv (Conv2D) (None, 28, 28, 192) 9216 ['block3b_add[0][0]'] block4a_expand_bn (BatchNormalization) (None, 28, 28, 192) 768 ['block4a_expand_conv[0][0]'] block4a_expand_activation (Activation) (None, 28, 28, 192) 0 ['block4a_expand_bn[0][0]'] block4a_dwconv2 (DepthwiseConv2D) (None, 14, 14, 192) 1728 ['block4a_expand_activation[0][0]'] block4a_bn (BatchNormalization) (None, 14, 14, 192) 768 ['block4a_dwconv2[0][0]'] block4a_activation (Activation) (None, 14, 14, 192) 0 ['block4a_bn[0][0]'] block4a_se_squeeze (GlobalAveragePooling2D (None, 192) 0 ['block4a_activation[0][0]'] ) block4a_se_reshape (Reshape) (None, 1, 1, 192) 0 ['block4a_se_squeeze[0][0]'] block4a_se_reduce (Conv2D) (None, 1, 1, 12) 2316 ['block4a_se_reshape[0][0]'] block4a_se_expand (Conv2D) (None, 1, 1, 192) 2496 ['block4a_se_reduce[0][0]'] block4a_se_excite (Multiply) (None, 14, 14, 192) 0 ['block4a_activation[0][0]', 'block4a_se_expand[0][0]'] block4a_project_conv (Conv2D) (None, 14, 14, 96) 18432 ['block4a_se_excite[0][0]'] block4a_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4a_project_conv[0][0]'] block4b_expand_conv (Conv2D) (None, 14, 14, 384) 36864 ['block4a_project_bn[0][0]'] block4b_expand_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4b_expand_conv[0][0]'] block4b_expand_activation (Activation) (None, 14, 14, 384) 0 ['block4b_expand_bn[0][0]'] block4b_dwconv2 (DepthwiseConv2D) (None, 14, 14, 384) 3456 ['block4b_expand_activation[0][0]'] block4b_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4b_dwconv2[0][0]'] block4b_activation (Activation) (None, 14, 14, 384) 0 ['block4b_bn[0][0]'] block4b_se_squeeze (GlobalAveragePooling2D (None, 384) 0 ['block4b_activation[0][0]'] ) block4b_se_reshape (Reshape) (None, 1, 1, 384) 0 ['block4b_se_squeeze[0][0]'] block4b_se_reduce (Conv2D) (None, 1, 1, 24) 9240 ['block4b_se_reshape[0][0]'] block4b_se_expand (Conv2D) (None, 1, 1, 384) 9600 ['block4b_se_reduce[0][0]'] block4b_se_excite (Multiply) (None, 14, 14, 384) 0 ['block4b_activation[0][0]', 'block4b_se_expand[0][0]'] block4b_project_conv (Conv2D) (None, 14, 14, 96) 36864 ['block4b_se_excite[0][0]'] block4b_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4b_project_conv[0][0]'] block4b_drop (Dropout) (None, 14, 14, 96) 0 ['block4b_project_bn[0][0]'] block4b_add (Add) (None, 14, 14, 96) 0 ['block4b_drop[0][0]', 'block4a_project_bn[0][0]'] block4c_expand_conv (Conv2D) (None, 14, 14, 384) 36864 ['block4b_add[0][0]'] block4c_expand_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4c_expand_conv[0][0]'] block4c_expand_activation (Activation) (None, 14, 14, 384) 0 ['block4c_expand_bn[0][0]'] block4c_dwconv2 (DepthwiseConv2D) (None, 14, 14, 384) 3456 ['block4c_expand_activation[0][0]'] block4c_bn (BatchNormalization) (None, 14, 14, 384) 1536 ['block4c_dwconv2[0][0]'] block4c_activation (Activation) (None, 14, 14, 384) 0 ['block4c_bn[0][0]'] block4c_se_squeeze (GlobalAveragePooling2D (None, 384) 0 ['block4c_activation[0][0]'] ) block4c_se_reshape (Reshape) (None, 1, 1, 384) 0 ['block4c_se_squeeze[0][0]'] block4c_se_reduce (Conv2D) (None, 1, 1, 24) 9240 ['block4c_se_reshape[0][0]'] block4c_se_expand (Conv2D) (None, 1, 1, 384) 9600 ['block4c_se_reduce[0][0]'] block4c_se_excite (Multiply) (None, 14, 14, 384) 0 ['block4c_activation[0][0]', 'block4c_se_expand[0][0]'] block4c_project_conv (Conv2D) (None, 14, 14, 96) 36864 ['block4c_se_excite[0][0]'] block4c_project_bn (BatchNormalization) (None, 14, 14, 96) 384 ['block4c_project_conv[0][0]'] block4c_drop (Dropout) (None, 14, 14, 96) 0 ['block4c_project_bn[0][0]'] block4c_add (Add) (None, 14, 14, 96) 0 ['block4c_drop[0][0]', 'block4b_add[0][0]'] block5a_expand_conv (Conv2D) (None, 14, 14, 576) 55296 ['block4c_add[0][0]'] block5a_expand_bn (BatchNormalization) (None, 14, 14, 576) 2304 ['block5a_expand_conv[0][0]'] block5a_expand_activation (Activation) (None, 14, 14, 576) 0 ['block5a_expand_bn[0][0]'] block5a_dwconv2 (DepthwiseConv2D) (None, 14, 14, 576) 5184 ['block5a_expand_activation[0][0]'] block5a_bn (BatchNormalization) (None, 14, 14, 576) 2304 ['block5a_dwconv2[0][0]'] block5a_activation (Activation) (None, 14, 14, 576) 0 ['block5a_bn[0][0]'] block5a_se_squeeze (GlobalAveragePooling2D (None, 576) 0 ['block5a_activation[0][0]'] ) block5a_se_reshape (Reshape) (None, 1, 1, 576) 0 ['block5a_se_squeeze[0][0]'] block5a_se_reduce (Conv2D) (None, 1, 1, 24) 13848 ['block5a_se_reshape[0][0]'] block5a_se_expand (Conv2D) (None, 1, 1, 576) 14400 ['block5a_se_reduce[0][0]'] block5a_se_excite (Multiply) (None, 14, 14, 576) 0 ['block5a_activation[0][0]', 'block5a_se_expand[0][0]'] block5a_project_conv (Conv2D) (None, 14, 14, 112) 64512 ['block5a_se_excite[0][0]'] block5a_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5a_project_conv[0][0]'] block5b_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5a_project_bn[0][0]'] block5b_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5b_expand_conv[0][0]'] block5b_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5b_expand_bn[0][0]'] block5b_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5b_expand_activation[0][0]'] block5b_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5b_dwconv2[0][0]'] block5b_activation (Activation) (None, 14, 14, 672) 0 ['block5b_bn[0][0]'] block5b_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5b_activation[0][0]'] ) block5b_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5b_se_squeeze[0][0]'] block5b_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5b_se_reshape[0][0]'] block5b_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5b_se_reduce[0][0]'] block5b_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5b_activation[0][0]', 'block5b_se_expand[0][0]'] block5b_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5b_se_excite[0][0]'] block5b_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5b_project_conv[0][0]'] block5b_drop (Dropout) (None, 14, 14, 112) 0 ['block5b_project_bn[0][0]'] block5b_add (Add) (None, 14, 14, 112) 0 ['block5b_drop[0][0]', 'block5a_project_bn[0][0]'] block5c_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5b_add[0][0]'] block5c_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5c_expand_conv[0][0]'] block5c_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5c_expand_bn[0][0]'] block5c_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5c_expand_activation[0][0]'] block5c_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5c_dwconv2[0][0]'] block5c_activation (Activation) (None, 14, 14, 672) 0 ['block5c_bn[0][0]'] block5c_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5c_activation[0][0]'] ) block5c_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5c_se_squeeze[0][0]'] block5c_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5c_se_reshape[0][0]'] block5c_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5c_se_reduce[0][0]'] block5c_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5c_activation[0][0]', 'block5c_se_expand[0][0]'] block5c_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5c_se_excite[0][0]'] block5c_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5c_project_conv[0][0]'] block5c_drop (Dropout) (None, 14, 14, 112) 0 ['block5c_project_bn[0][0]'] block5c_add (Add) (None, 14, 14, 112) 0 ['block5c_drop[0][0]', 'block5b_add[0][0]'] block5d_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5c_add[0][0]'] block5d_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5d_expand_conv[0][0]'] block5d_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5d_expand_bn[0][0]'] block5d_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5d_expand_activation[0][0]'] block5d_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5d_dwconv2[0][0]'] block5d_activation (Activation) (None, 14, 14, 672) 0 ['block5d_bn[0][0]'] block5d_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5d_activation[0][0]'] ) block5d_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5d_se_squeeze[0][0]'] block5d_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5d_se_reshape[0][0]'] block5d_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5d_se_reduce[0][0]'] block5d_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5d_activation[0][0]', 'block5d_se_expand[0][0]'] block5d_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5d_se_excite[0][0]'] block5d_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5d_project_conv[0][0]'] block5d_drop (Dropout) (None, 14, 14, 112) 0 ['block5d_project_bn[0][0]'] block5d_add (Add) (None, 14, 14, 112) 0 ['block5d_drop[0][0]', 'block5c_add[0][0]'] block5e_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5d_add[0][0]'] block5e_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5e_expand_conv[0][0]'] block5e_expand_activation (Activation) (None, 14, 14, 672) 0 ['block5e_expand_bn[0][0]'] block5e_dwconv2 (DepthwiseConv2D) (None, 14, 14, 672) 6048 ['block5e_expand_activation[0][0]'] block5e_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block5e_dwconv2[0][0]'] block5e_activation (Activation) (None, 14, 14, 672) 0 ['block5e_bn[0][0]'] block5e_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block5e_activation[0][0]'] ) block5e_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block5e_se_squeeze[0][0]'] block5e_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block5e_se_reshape[0][0]'] block5e_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block5e_se_reduce[0][0]'] block5e_se_excite (Multiply) (None, 14, 14, 672) 0 ['block5e_activation[0][0]', 'block5e_se_expand[0][0]'] block5e_project_conv (Conv2D) (None, 14, 14, 112) 75264 ['block5e_se_excite[0][0]'] block5e_project_bn (BatchNormalization) (None, 14, 14, 112) 448 ['block5e_project_conv[0][0]'] block5e_drop (Dropout) (None, 14, 14, 112) 0 ['block5e_project_bn[0][0]'] block5e_add (Add) (None, 14, 14, 112) 0 ['block5e_drop[0][0]', 'block5d_add[0][0]'] block6a_expand_conv (Conv2D) (None, 14, 14, 672) 75264 ['block5e_add[0][0]'] block6a_expand_bn (BatchNormalization) (None, 14, 14, 672) 2688 ['block6a_expand_conv[0][0]'] block6a_expand_activation (Activation) (None, 14, 14, 672) 0 ['block6a_expand_bn[0][0]'] block6a_dwconv2 (DepthwiseConv2D) (None, 7, 7, 672) 6048 ['block6a_expand_activation[0][0]'] block6a_bn (BatchNormalization) (None, 7, 7, 672) 2688 ['block6a_dwconv2[0][0]'] block6a_activation (Activation) (None, 7, 7, 672) 0 ['block6a_bn[0][0]'] block6a_se_squeeze (GlobalAveragePooling2D (None, 672) 0 ['block6a_activation[0][0]'] ) block6a_se_reshape (Reshape) (None, 1, 1, 672) 0 ['block6a_se_squeeze[0][0]'] block6a_se_reduce (Conv2D) (None, 1, 1, 28) 18844 ['block6a_se_reshape[0][0]'] block6a_se_expand (Conv2D) (None, 1, 1, 672) 19488 ['block6a_se_reduce[0][0]'] block6a_se_excite (Multiply) (None, 7, 7, 672) 0 ['block6a_activation[0][0]', 'block6a_se_expand[0][0]'] block6a_project_conv (Conv2D) (None, 7, 7, 192) 129024 ['block6a_se_excite[0][0]'] block6a_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6a_project_conv[0][0]'] block6b_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6a_project_bn[0][0]'] block6b_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6b_expand_conv[0][0]'] block6b_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6b_expand_bn[0][0]'] block6b_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6b_expand_activation[0][0]'] block6b_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6b_dwconv2[0][0]'] block6b_activation (Activation) (None, 7, 7, 1152) 0 ['block6b_bn[0][0]'] block6b_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6b_activation[0][0]'] ) block6b_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6b_se_squeeze[0][0]'] block6b_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6b_se_reshape[0][0]'] block6b_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6b_se_reduce[0][0]'] block6b_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6b_activation[0][0]', 'block6b_se_expand[0][0]'] block6b_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6b_se_excite[0][0]'] block6b_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6b_project_conv[0][0]'] block6b_drop (Dropout) (None, 7, 7, 192) 0 ['block6b_project_bn[0][0]'] block6b_add (Add) (None, 7, 7, 192) 0 ['block6b_drop[0][0]', 'block6a_project_bn[0][0]'] block6c_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6b_add[0][0]'] block6c_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6c_expand_conv[0][0]'] block6c_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6c_expand_bn[0][0]'] block6c_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6c_expand_activation[0][0]'] block6c_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6c_dwconv2[0][0]'] block6c_activation (Activation) (None, 7, 7, 1152) 0 ['block6c_bn[0][0]'] block6c_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6c_activation[0][0]'] ) block6c_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6c_se_squeeze[0][0]'] block6c_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6c_se_reshape[0][0]'] block6c_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6c_se_reduce[0][0]'] block6c_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6c_activation[0][0]', 'block6c_se_expand[0][0]'] block6c_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6c_se_excite[0][0]'] block6c_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6c_project_conv[0][0]'] block6c_drop (Dropout) (None, 7, 7, 192) 0 ['block6c_project_bn[0][0]'] block6c_add (Add) (None, 7, 7, 192) 0 ['block6c_drop[0][0]', 'block6b_add[0][0]'] block6d_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6c_add[0][0]'] block6d_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6d_expand_conv[0][0]'] block6d_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6d_expand_bn[0][0]'] block6d_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6d_expand_activation[0][0]'] block6d_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6d_dwconv2[0][0]'] block6d_activation (Activation) (None, 7, 7, 1152) 0 ['block6d_bn[0][0]'] block6d_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6d_activation[0][0]'] ) block6d_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6d_se_squeeze[0][0]'] block6d_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6d_se_reshape[0][0]'] block6d_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6d_se_reduce[0][0]'] block6d_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6d_activation[0][0]', 'block6d_se_expand[0][0]'] block6d_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6d_se_excite[0][0]'] block6d_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6d_project_conv[0][0]'] block6d_drop (Dropout) (None, 7, 7, 192) 0 ['block6d_project_bn[0][0]'] block6d_add (Add) (None, 7, 7, 192) 0 ['block6d_drop[0][0]', 'block6c_add[0][0]'] block6e_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6d_add[0][0]'] block6e_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6e_expand_conv[0][0]'] block6e_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6e_expand_bn[0][0]'] block6e_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6e_expand_activation[0][0]'] block6e_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6e_dwconv2[0][0]'] block6e_activation (Activation) (None, 7, 7, 1152) 0 ['block6e_bn[0][0]'] block6e_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6e_activation[0][0]'] ) block6e_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6e_se_squeeze[0][0]'] block6e_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6e_se_reshape[0][0]'] block6e_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6e_se_reduce[0][0]'] block6e_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6e_activation[0][0]', 'block6e_se_expand[0][0]'] block6e_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6e_se_excite[0][0]'] block6e_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6e_project_conv[0][0]'] block6e_drop (Dropout) (None, 7, 7, 192) 0 ['block6e_project_bn[0][0]'] block6e_add (Add) (None, 7, 7, 192) 0 ['block6e_drop[0][0]', 'block6d_add[0][0]'] block6f_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6e_add[0][0]'] block6f_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6f_expand_conv[0][0]'] block6f_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6f_expand_bn[0][0]'] block6f_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6f_expand_activation[0][0]'] block6f_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6f_dwconv2[0][0]'] block6f_activation (Activation) (None, 7, 7, 1152) 0 ['block6f_bn[0][0]'] block6f_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6f_activation[0][0]'] ) block6f_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6f_se_squeeze[0][0]'] block6f_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6f_se_reshape[0][0]'] block6f_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6f_se_reduce[0][0]'] block6f_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6f_activation[0][0]', 'block6f_se_expand[0][0]'] block6f_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6f_se_excite[0][0]'] block6f_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6f_project_conv[0][0]'] block6f_drop (Dropout) (None, 7, 7, 192) 0 ['block6f_project_bn[0][0]'] block6f_add (Add) (None, 7, 7, 192) 0 ['block6f_drop[0][0]', 'block6e_add[0][0]'] block6g_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6f_add[0][0]'] block6g_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6g_expand_conv[0][0]'] block6g_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6g_expand_bn[0][0]'] block6g_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6g_expand_activation[0][0]'] block6g_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6g_dwconv2[0][0]'] block6g_activation (Activation) (None, 7, 7, 1152) 0 ['block6g_bn[0][0]'] block6g_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6g_activation[0][0]'] ) block6g_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6g_se_squeeze[0][0]'] block6g_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6g_se_reshape[0][0]'] block6g_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6g_se_reduce[0][0]'] block6g_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6g_activation[0][0]', 'block6g_se_expand[0][0]'] block6g_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6g_se_excite[0][0]'] block6g_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6g_project_conv[0][0]'] block6g_drop (Dropout) (None, 7, 7, 192) 0 ['block6g_project_bn[0][0]'] block6g_add (Add) (None, 7, 7, 192) 0 ['block6g_drop[0][0]', 'block6f_add[0][0]'] block6h_expand_conv (Conv2D) (None, 7, 7, 1152) 221184 ['block6g_add[0][0]'] block6h_expand_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6h_expand_conv[0][0]'] block6h_expand_activation (Activation) (None, 7, 7, 1152) 0 ['block6h_expand_bn[0][0]'] block6h_dwconv2 (DepthwiseConv2D) (None, 7, 7, 1152) 10368 ['block6h_expand_activation[0][0]'] block6h_bn (BatchNormalization) (None, 7, 7, 1152) 4608 ['block6h_dwconv2[0][0]'] block6h_activation (Activation) (None, 7, 7, 1152) 0 ['block6h_bn[0][0]'] block6h_se_squeeze (GlobalAveragePooling2D (None, 1152) 0 ['block6h_activation[0][0]'] ) block6h_se_reshape (Reshape) (None, 1, 1, 1152) 0 ['block6h_se_squeeze[0][0]'] block6h_se_reduce (Conv2D) (None, 1, 1, 48) 55344 ['block6h_se_reshape[0][0]'] block6h_se_expand (Conv2D) (None, 1, 1, 1152) 56448 ['block6h_se_reduce[0][0]'] block6h_se_excite (Multiply) (None, 7, 7, 1152) 0 ['block6h_activation[0][0]', 'block6h_se_expand[0][0]'] block6h_project_conv (Conv2D) (None, 7, 7, 192) 221184 ['block6h_se_excite[0][0]'] block6h_project_bn (BatchNormalization) (None, 7, 7, 192) 768 ['block6h_project_conv[0][0]'] block6h_drop (Dropout) (None, 7, 7, 192) 0 ['block6h_project_bn[0][0]'] block6h_add (Add) (None, 7, 7, 192) 0 ['block6h_drop[0][0]', 'block6g_add[0][0]'] top_conv (Conv2D) (None, 7, 7, 1280) 245760 ['block6h_add[0][0]'] top_bn (BatchNormalization) (None, 7, 7, 1280) 5120 ['top_conv[0][0]'] top_activation (Activation) (None, 7, 7, 1280) 0 ['top_bn[0][0]'] avg_pool (GlobalAveragePooling2D) (None, 1280) 0 ['top_activation[0][0]'] pool_dropout (Dropout) (None, 1280) 0 ['avg_pool[0][0]'] dense_features (Dense) (None, 512) 655872 ['pool_dropout[0][0]'] dense_dropout (Dropout) (None, 512) 0 ['dense_features[0][0]'] classifier (Dense) (None, 200) 102600 ['dense_dropout[0][0]'] ======================================================================================================================================= Total params: 6,677,784 Trainable params: 6,617,176 Non-trainable params: 60,608 _______________________________________________________________________________________________________________________________________ 2022-10-31 03:32:00.945225: W tensorflow/core/framework/cpu_allocator_impl.cc:82] Allocation of 4915200000 exceeds 10% of free system memory. 2022-10-31 03:32:03.759054: W tensorflow/core/framework/cpu_allocator_impl.cc:82] Allocation of 4915200000 exceeds 10% of free system memory. Epoch 1/2 1563/1563 [==============================] - 1105s 697ms/step - loss: 0.5930 - accuracy: 0.8338 - map: 0.8877 - val_loss: 0.4975 - val_accuracy: 0.8665 - val_map: 0.9084 Epoch 2/2 1563/1563 [==============================] - 1087s 695ms/step - loss: 0.4633 - accuracy: 0.8674 - map: 0.9131 - val_loss: 0.4724 - val_accuracy: 0.8742 - val_map: 0.9144 313/313 [==============================] - 31s 94ms/step real 84m39.653s user 51m14.922s sys 4m51.928s deeplearn@ML-RefVm-967342:~/imagenet$ kaggle competitions submit -c ml530-2022-fall-imagenet -f predictions.csv -m "84:39" 100%|████████████████████████████████████████████████████████████████████████████████████████████████| 229k/229k [00:00<00:00, 546kB/s] Successfully submitted to ml530-2022-fall-imagenetdeeplearn@ML-RefVm-967342:~/imagenet$