#1
|
||||
|
||||
![]()
( Overview to Entropy of a File )
Simplistically entropy is disorder, or better in computing the density of information that a data stream can contain. So the more the content of a file will be predictable, the higher the entropy. The software ENT (Pseudorandom Number Sequence Test Program, http://www.fourmilab.ch/random) performs various statistical tests, providing output in the following report: 1) Entropy is the density of information contained in a file expressed in number of bits per character. The maximum entropy is 8, when we find a file with entropy or 8 means that it is perfectly random, or is compressed. In fact, taken a bitmap, its entropy is 4.724502 bits per byte, if you turned it into jpeg becomes 7.938038 bits per byte. If you compress the bmp with winrar I get 7.996259 bits per byte. This is clear. I take a text file where I have written a thousand times the same word we get is that its entropy Entropy = 2.545246 bits per byte. If we compress with winrar we get Entropy = 6.747827, and winrar maximum compression Entropy = 6.756800. 2) Test of Chi square Used for the study of random data streams. If we apply it to image files as a result they are random data. In practice, it occupies the deviation percentage of the flow of data from a real random sequence. However, if the result is> 99% or <1% of the data stream is not random. If it is between <5% and> 95% of the flow is random suspiciously, if intermediate then we are on random. 3) Arithmetic Mean Sum all the bytes and divides them for the length: it is a type of arithmetic mean. The closer the number is 127.5 more random. 4) Test of Pi-Greco Montecarlo The more the value is close to pi-greco (3.14 ..) plus the data stream is random / compressed. 5) Coefficient of Correlation Ie how predictable a byte knowing his previous. More the value is close to 1 and more is predictable, more and more close to 0 is random. SOME EXAMPLES ON THE IMAGES : We analyze a bitmap file ... Quote:
The chi-square gives us a value of 0.01 it says that the flow is not accidental but it is a picture is not reliable. The average is 160 and deviates from 125 and is therefore not random. Even the monte carlo is far from 3.14. The correlation is 0.81: bitmaps are always close correlation to 1. If they were random data would be 0. Now let's look at the same bitmap converted to jpg ... Quote:
Here the entropy is very high (7.9), the file is very compressed. Test Monte Carlo 3.21, quite close to the pi greek, so close to random. Correlation coefficient close to 0. understand that it is compressed. ![]() ![]() ![]() After this short overview this GUI natively leverages the application ENT, to calculate the entropy of a file, or an full data folder, providing a report based on the reduction of the file or folder data, and its total compression ratio, for know quickly if a file type and / or folder, will have a high or low compression. Classification of the file or group of files: The scanning of the file or folder data, it's divided into 5 blocks with a calculation of the entropy range from 1.0 to 7.0 for Deflate e Text, and 1.0 to 7.5 for Void and Msrsolid, through direct reading of the file arc.group during the scanning, and based on the reading of the extensions of the 4 masks "Void, deflate, Msrsolid and Text" and the basic method. The files with higher entropy than 7.0 or 7.5 or an extension not set in arc.group file are classified and added to the basic method. I chose the level 7 and 7.5 on the basis of various tests performed out on individual files of various formats, a file with the entropy level from 7.0 to 7.5, with a strong compression carried out with different samples of compressors you get a reduction of 20-25% and a compression ratio of 75-80%. Each block contains additional information, according to the main method and masks, number of files scanned and belonging, percentage of size reduction, percentage of compression ratio, total size of files added in the belonging block. ![]() Creating a masked method estimate in based on the entropy and scanning of files: With a choice of 44 compressors on 4 masks, these will be activated or deactivated, in based on the scan and the entropy of the previously evaluated files, in order to speed up and simplify the creation of the final masked method with a correct of compression estimated of 90% on the compressed files. ![]() LZbench v1.7.1 by Inikep : " Benchmark Compressors LZ77/LZSS/LZMA " https://github.com/inikep/lzbench Thank Inikep from encode.ru, the application incorporates a modified and adapted for the complete benchmark on a single file or entire directory of 63 compressors of the Family LZ77/LZSS/LZMA, with a full report out for each file (compression speed in Mb/s, decompression speed in Mb/s, original size, compressed size, ratio and file name). In the same way we will have a final scan with the reduction in size and compression ratio, to compare the various compressors and choose based to the speed of compression, decompression and ratio, on the types of scanned files. In LZBench no size file limit, even using a low amount of memory the average of the ratio is calculated for the number of divided parts obtaining the overall result of the compression ratio. Quote:
![]() "arc.groups" updated to version 3.0, based on version 2.5 of Panker1992, they were added over 200 popular formats used in the area of gaming. We Avoid the Vultures, those who not give credit and thanks for all the work, please do not use the application and not download... ![]() UPDATED : BE_Parent_Dir The file parent directory is displayed in the masks box. Other minor fix. In Down.
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() Last edited by felice2011; 25-04-2017 at 00:59. Reason: Added BE_Parent_Dir |
The Following 17 Users Say Thank You to felice2011 For This Useful Post: | ||
-XCX- (29-03-2017), arkantos7 (01-04-2017), ChronoCross (29-03-2017), COPyCAT (24-01-2018), elit (28-09-2017), EzzEldin16 (29-03-2017), gozarck (04-04-2017), JRD! (04-04-2017), kassane (29-09-2017), knife16 (29-03-2017), mikey26 (29-03-2017), ramazan19833 (29-03-2017), Razor12911 (04-04-2017), rinaldo (04-04-2017), romulus_ut3 (03-04-2017), Simorq (29-03-2017), Stor31 (21-04-2017) |
Sponsored Links |
#2
|
||||
|
||||
can i ask you something i know it's not about this program but i was wondering is it possible to add files after compressed with specific compressors into inno script like WPI
exe so it can work as self extracting archive ?? so no archive needed seprately ?
__________________
learn it |
#3
|
||||
|
||||
@EzzEldin16 if you want to ask me some questions not relative to the thread or the program, please contact me in private, or open a new thread of help.
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() |
The Following User Says Thank You to felice2011 For This Useful Post: | ||
EzzEldin16 (29-03-2017) |
#4
|
||||
|
||||
Excellent Job!!.
But I do not know if all users know how to get the maximum use of this great utility. |
The Following User Says Thank You to ChronoCross For This Useful Post: | ||
felice2011 (01-04-2017) |
#5
|
||||
|
||||
@felice2011 okay i'll message you
__________________
learn it |
#6
|
|||
|
|||
i don't get any of this at all
![]() |
The Following User Says Thank You to TWOELV For This Useful Post: | ||
felice2011 (01-04-2017) |
#7
|
||||
|
||||
Quote:
Quote:
![]() I had great ideas for this application, with new updates, but if results and interest are low, do not update my works here on the forum.
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() |
The Following 2 Users Say Thank You to felice2011 For This Useful Post: | ||
EzzEldin16 (01-04-2017), knife16 (01-04-2017) |
#8
|
||||
|
||||
@PrinceGupta2000 ... For the simple reason, that to get a good and right compression is necessary to know the structure of a file, and in this case its entropy.
I give you an example, the file (RE7) "re_chunk_000.pak.patch_001.pak" from 2.5GB get a compression ratio of 87.46% scanned in 30 seconds. Now I know that giving an estimate of 7-8% scrap, should get the maximum with any compressor or precompressor existing today in a method or mask, a decrease in the ratio of 7-8%, in fact, with such a method "pzlib+srep64+delta+exe2+4x4:lzma" get the 79.6 %, the 8% of less. Also I can know in advance the estimated compression ratio of any file in seconds or minutes, without having to test thirty compressors, with their methods or masks, with having to compress for half a day using continuous writes on HD and occupying space useless. Let's see what altro..ahhh .. ![]() Ability to create your own mask, so right and correct, but I think, that you make little use of masks, to understand the real need to use a mask is a pipe dream for many of you. Integra the possibility to verify that I have said in the first lines in the example, with LZ-Bench, without the use of space and writing in HD, in fact with the same file get a compression ratio of the 84.31% with lzma test a level 6. What else you had to do even coffee. ![]() EDIT: Prince .. what do you do, first ask the questions and then they delete. ![]() Same thing my reply goes for who else is interested. ![]()
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() Last edited by felice2011; 03-04-2017 at 09:10. |
#9
|
|||
|
|||
the file (RE7) "re_chunk_000.pak.patch_001.pak" actually contains about 3000 RIFF waves about 1 GB, feed them to Ogg and u saved about 800 MB
a little bit of hard work and gradual increase in compression ratio without going into many details you can save more retro porns in those 800 MB masks never works for me another pak in the game actually have a lot more RIFF waves and some biks so if u r a nerd like me you can save more space and knowing only the entropy will never save this much space for retro porns >>without having to test thirty compressors I never test compressor, Lzma is the best >>Prince .. what do you do, first ask the questions and then they delete Shhhhhhhhhh. Someone is Watching >>pzlib+srep64+delta+exe2+4x4:lzma never use srep64 and exe2 after srep or rep compressed data, this will decrease the overall compression ratio, if really want delta and exe2 use them before this is increase overall speed with least decrease in compression ratio
__________________
XD Last edited by Gupta; 03-04-2017 at 10:18. |
#10
|
||||
|
||||
Prince, I trouble understanding what you say, between porn and entropy, the method mentioned was just an example to make you understand the concept, but you can use any other method or compressor, put first, remove or add "srep, rep, lzma with or without 4x4 etc.." you'll get to max than 10% less than the estimated compression ratio on the entropy of a file.
Or you can make magic with a compressor as "LZAF" Shhhh ... ehhh Oh Yes, someone has already read what you write. ![]()
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() |
#11
|
||||
|
||||
ci credo che hai difficoltà a capirlo ahah.. dagli una leccalecca
![]() very complicated pg
__________________
if you understand read more Last edited by rinaldo; 04-04-2017 at 07:57. |
The Following User Says Thank You to rinaldo For This Useful Post: | ||
felice2011 (04-04-2017) |
#12
|
||||
|
||||
Nice work felice, optimum software for large file batches. You should take a look at datasmoke by Bulat Ziganshin
![]()
__________________
-the science of the useless- |
The Following User Says Thank You to gozarck For This Useful Post: | ||
felice2011 (05-04-2017) |
#13
|
|||
|
|||
An option to save the file names including categories and their full path including sub-directories would be nice.
Last edited by romulus_ut3; 04-04-2017 at 19:57. |
The Following User Says Thank You to romulus_ut3 For This Useful Post: | ||
felice2011 (05-04-2017) |
#14
|
||||
|
||||
![]() Quote:
![]() EDIT: ![]() ![]() ![]() I think something good comes out .... ![]()
__________________
≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « I Mediocri Imitano, I Geni Copiano, Dio Crea & Distrugge » (Io Ridefinisco & Perfeziono le Loro Opere Rendendole Uniche) ![]() ![]() ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ ≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈≈ « Mediocrities Imitate, Genius Copy, God Creates & Destroys » (I Reconsider & Improve Their Works, Rending Them One And Only) ![]() ![]() Last edited by felice2011; 05-04-2017 at 03:49. |
The Following User Says Thank You to felice2011 For This Useful Post: | ||
romulus_ut3 (05-04-2017) |
#15
|
|||
|
|||
Hello, Felice.
Firstly, I'd like to take this opportunity to thank you for this brilliant tool! I would like to request a simple feature. (I know you're busy with Datasmoke) Is it possible to list files category wise with their file path, and save the results once a directory has been scanned? The ratio for the compression isn't necessary to be included. For example: Code:
Method: Game\files\xxxxxxxxx.xx Game\files\xxxxxxxxx1.xx Void: Game\Movies\xxxxxxxxx.bik Game\Movies\xxxxxxxxx1.bik Deflate: Game\Sound\xxxxxxxxx.ogg Game\Sound\xxxxxxxxx1.ogg MSRSolid: Game\Help\xxxxxxxxx.jpg Game\Help\xxxxxxxxx1.jpg Text: Game\Docs\xxxxxxxxx.txt Game\Docs\xxxxxxxxx1.ini |
The Following User Says Thank You to romulus_ut3 For This Useful Post: | ||
felice2011 (19-04-2017) |
![]() |
Thread Tools | |
Display Modes | |
|
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
(Arrow) Cmd -Info - Bench -Test | felice2011 | Conversion Tutorials | 36 | 28-11-2016 12:39 |
Fast Brute (test) | Razor12911 | Conversion Tutorials | 49 | 07-06-2016 03:44 |
test bench | rinaldo | Conversion Tutorials | 8 | 28-02-2016 04:55 |
Bejeweled 2 | smoggey | PC Games | 3 | 28-09-2005 17:11 |